Soundhound Inc.
Staff Engineer
Tesla Jan 2011 - Mar 2013
Senior Systems Engineer
Soundhound Inc. Jan 2011 - Mar 2013
Engineer
Gravity Probe B May 2004 - Dec 2011
Research Engineer
Stanford University May 2003 - May 2004
Research Assistant
Education:
University of California, Berkeley 2014 - 2017
Master of Business Administration, Masters, Commerce, Business
Stanford University 2002 - 2004
Master of Science, Masters, Mechanical Engineering
Wabash College 1995 - 1999
Bachelors, Bachelor of Arts, Mathematics, German
The University of Göttingen 1997 - 1998
University of California, Berkeley, Haas School of Business
Skills:
Matlab Labview Simulink Simulations Engineering Python Testing Mechatronics Mechanical Engineering Solidworks C Robotics Sensors Foreign Languages Embedded Systems Pcb Design Electronics Algorithms R&D Product Development Perl Design For Manufacturing Systems Engineering Engineering Management Automation Electrical Engineering Firmware Manufacturing Automotive Catia Design of Experiments Aerospace C++ Software Development
Stanford University Mar 2002 to Jun 2004 M.S. in Mechanical EngineeringWabash College Sep 1995 to Jun 1999 B.A. in Mathematics and GermanAugust Universitt Gttingen Sep 1997 to Aug 1998
John Ramunas - Stanford CA, US Juan G. Santiago - Stanford CA, US Helen M. Blau - Menlo Park CA, US Keith T. Wong - Vancouver WA, US Viktor Shkolnikov - Chatsworth CA, US Karl Stahl - Menlo Park CA, US Khaesha Hall - Stanford CA, US Kevin Nam Truong - Monterey Park CA, US Jason L. Chua - Glenview IL, US
International Classification:
F16K 31/12 F16K 31/48
US Classification:
13756501, 13762418
Abstract:
Provided is a sequenced fluid control device that includes a pneumatic driver module, a fluid cartridge having at least one fluid chamber and at least one waste chamber, and at least one sample delivery element disposed to sealably contact the fluid cartridge forming a main chamber containing a sample, where the fluid cartridge is also disposed to sealably contact the pneumatic driver module. The pneumatic driver module injects gas into or withdraws gas from the fluid cartridge, where the gas directly contacts at least one fluid causing at least one fluid to flow through channels disposed to connect at least one fluid chamber to the sample delivery element and disposed to connect the sample delivery element to the at least one waste chamber or waste channel, where a sample on the sample delivery element is exposed to the at least one fluid.
Enabling Natural Language Interactions With User Interfaces For Users Of A Software Application
A user specifies a natural language command to a device. Software on the device generates contextual metadata about the user interface of the device, such as data about all visible elements of the user interface, and sends the contextual metadata along with the natural language command to a natural language understanding engine. The natural language understanding engine parses the natural language query using a stored grammar (e.g., a grammar provided by a maker of the device) and as a result of the parsing identifies information about the command (e.g., the user interface elements referenced by the command) and provides that information to the device. The device uses that provided information to respond to the command.
Voice Characterization-Based Natural Language Filtering
- Santa Clara CA, US Karl Stahl - Menlo Park CA, US
Assignee:
SoundHound, Inc. - Santa Clara CA
International Classification:
G10L 15/19 G10L 15/18 G10L 25/63 G10L 17/02
Abstract:
An utterance is analyzed to determine a characteristic of the utterance and a transcription hypothesis is generated for the utterance. Grammar rules are then used to parse the transcription hypothesis to produce a plurality of interpretation hypotheses, each having a likelihood score. A set of authorized domains is determined based on the characteristic and the plurality of interpretation hypotheses are filtered according to the set of authorized domains. Of the remaining interpretation hypotheses, one is selected according to their likelihood scores. The characteristic may include one or more characteristics such as mood, prosody, or whether the utterance has a rising intonation.
Multi-Modal Audio Processing For Voice-Controlled Devices
A voice-controlled device includes a microphone to receive a set of sound waves that includes speech uttered by a user and other sound, and to output a first audio signal that includes a contribution from the speech uttered by the user and a contribution from the other sound. The device also includes a receiver to receive an electromagnetic signal and to output a second audio signal obtained from the electromagnetic signal. An audio pre-processor of the device processes the first audio signal using the second audio signal to reduce the contribution from the other sound in a processed audio signal. The voice-controlled device then provides the processed audio signal to a speech recognition module to determine a voice command issued by the user.
- Santa Clara CA, US Karl Stahl - Palo Alto CA, US
International Classification:
H04R 1/10
Abstract:
A speaker device includes an electroacoustic transducer configured to convert an audio signal into a set of sound waves and a transmitter configured to transmit an electromagnetic signal that carries the audio signal for receipt at distances limited to an audibility range of the set of sound waves. The audibility range of the set of sound waves corresponds to a distance at which the set of sound waves is estimated to be below a predetermined sound level.
Automatic Synchronization For An Offline Virtual Assistant
[Object] Technology is provided to enable a mobile terminal to function as a digital assistant even when the mobile terminal is in a state where it cannot communicate with a server apparatus.[Solution] When a user terminal receives a query A from a user, user terminal sends query A to a server . Server interprets the meaning of query A using a grammar A. Server obtains a response to query A based on the meaning of query A and sends the response to user terminal . Server further sends grammar A to user terminal . That is, server sends to user terminal a grammar used to interpret the query received from user terminal
- Santa Clara CA, US Irina A. Spiridonova - Campbell CA, US Karl Stahl - Menlo Park CA, US
Assignee:
SoundHound, Inc. - Santa Clara CA
International Classification:
G10L 15/22 G10L 15/18 G10L 15/19 G10L 15/30
Abstract:
The technology disclosed relates to retrieving a personal memo from a database. The method includes receiving, by a virtual assistant, a natural language utterance that expresses a request, interpreting the natural language utterance according to a natural language grammar rule for retrieving memo data from the natural language utterance, the natural language grammar rule recognizing query information, responsive to interpreting the natural language utterance, using the query information to query the database for a memo related to the query information, and providing, to a user, a response generated in dependence upon the memo related to the query information.
System And Method For Detection And Correction Of Incorrectly Pronounced Words
A system and method are disclosed for capturing a segment of speech audio, performing phoneme recognition on the segment of speech audio to produce a segmented phoneme sequence, comparing the segmented phoneme sequence to stored phoneme sequences that represent incorrect pronunciations of words to determine if there is a match, and identifying an incorrect pronunciation for a word in the segment of speech audio. The system builds a library based on the data collected for the incorrect pronunciations.
Isbn (Books And Publications)
Angst Vor Der Zukunft, Versuche Zur Hoffnung: Berichte, Essays, Vermutungen