Re2 Robotics
Principal Software Engineer
Honeywell
Engineer Software Iv, R and D
Vocollect 2003 - Mar 2014
Lead Software Engineer, Speech Recognition
Vocollect 2000 - 2003
Lead Software Engineer, Custom Systems
Medrad 1999 - 2000
Senior Software Engineer, Rnd
Education:
University of Pittsburgh 2015 - 2019
Masters
University of Pittsburgh 2012 - 2015
Bachelors, Computer Science
University of Pittsburgh 1991 - 1994
Bachelors, Electrical Engineering
Skills:
Software Engineering Cross Functional Team Leadership Software Development Embedded Systems Integration Agile Methodologies Troubleshooting C Testing C++ Debugging
A method for determining a relative position of a microphone may include capturing speech audio from a user's mouth with the microphone so that the microphone outputs an electrical signal indicative of the speech audio; determining an indication of a position of the microphone relative to the user's mouth, which may include providing a plurality of inputs to a computerized discriminative classifier, wherein an input of the plurality of inputs is derived from the electrical signal, and wherein an output from the computerized discriminative classifier is indicative of the position of the microphone relative to the user's mouth.
Distinguishing User Speech From Background Speech In Speech-Dense Environments
A device, system, and method whereby a speech-driven system can distinguish speech obtained from users of the system from other speech spoken by background persons, as well as from background speech from public address systems. In one aspect, the present system and method prepares, in advance of field-use, a voice-data file which is created in a training environment. The training environment exhibits both desired user speech and unwanted background speech, including unwanted speech from persons other than a user and also speech from a PA system. The speech recognition system is trained or otherwise programmed to identify wanted user speech which may be spoken concurrently with the background sounds. In an embodiment, during the pre-field-use phase the training or programming may be accomplished by having persons who are training listeners audit the pre-recorded sounds to identify the desired user speech. A processor-based learning system is trained to duplicate the assessments made by the human listeners.
Systems And Methods For Determining Microphone Position
- Fort Mill SC, US David D. Hardek - Allison Park PA, US
International Classification:
H04R 29/00 G10L 15/01
Abstract:
A method for determining a relative position of a microphone may include capturing speech audio from a user's mouth with the microphone so that the microphone outputs an electrical signal indicative of the speech audio; determining an indication of a position of the microphone relative to the user's mouth, which may include providing a plurality of inputs to a computerized discriminative classifier, wherein an input of the plurality of inputs is derived from the electrical signal, and wherein an output from the computerized discriminative classifier is indicative of the position of the microphone relative to the user's mouth.
Apparatus And Method To Classify Sound To Detect Speech
- Fort Mill SC, US David D. Hardek - Allison Park PA, US
International Classification:
G10L 15/20
Abstract:
Audio frames are classified as either speech, non-transient background noise, or transient noise events. Probabilities of speech or transient noise event, or other metrics may be calculated to indicate confidence in classification. Frames classified as speech or noise events are not used in updating models (e.g., spectral subtraction noise estimates, silence model, background energy estimates, signal-to-noise ratio) of non-transient background noise. Frame classification affects acceptance/rejection of recognition hypothesis. Classifications and other audio related information may be determined by circuitry in a headset, and sent (e.g., wirelessly) to a separate processor-based recognition device.
Apparatus And Method To Classify Sound To Detect Speech
- Everett WA, US David D. Hardek - Allison Park PA, US
Assignee:
INTERMEC IP CORP. - Everett WA
International Classification:
G10L 15/08
US Classification:
704233
Abstract:
Audio frames are classified as either speech, non-transient background noise, or transient noise events. Probabilities of speech or transient noise event, or other metrics may be calculated to indicate confidence in classification. Frames classified as speech or noise events are not used in updating models (e.g., spectral subtraction noise estimates, silence model, background energy estimates, signal-to-noise ratio) of non-transient background noise. Frame classification affects acceptance/rejection of recognition hypothesis. Classifications and other audio related information may be determined by circuitry in a headset, and sent (e.g., wirelessly) to a separate processor-based recognition device.