Peter S. Cardillo - Atlanta GA, US Mark A. Clements - Lilburn GA, US William E. Price - Smyrna GA, US
Assignee:
Georgia Tech Research Corporation - Atlanta GA
International Classification:
G10L 15/00
US Classification:
704236, 704231, 704249, 704255
Abstract:
An improved method and apparatus is disclosed which uses probabilistic techniques to map an input search string with a prestored audio file, and recognize certain portions of a search string phonetically. An improved interface is disclosed which permits users to input search strings, linguistics, phonetics, or a combination of both, and also allows logic functions to be specified by indicating how far separated specific phonemes are in time.
Peter S. Cardillo - Atlanta GA, US Mark A. Clements - Lilburn GA, US William E. Price - Smyrna GA, US
Assignee:
Georgia Tech Research Corporation - Atlanta GA
International Classification:
G10L 15/08
US Classification:
704236, 704 7, 704 4, 704 5
Abstract:
An improved method and apparatus is disclosed which uses probabilistic techniques to map an input search string with a prestored audio file, and recognize certain portions of a search string phonetically. An improved interface is disclosed which permits users to input search strings, linguistics, phonetics, or a combination of both, and also allows logic functions to be specified by indicating how far separated specific phonemes are in time.
Peter S. Cardillo - Atlanta GA, US Mark A. Clements - Lilburn GA, US William E. Price - Smyrna GA, US
Assignee:
Georgia Tech Research Corporation - Atlanta GA
International Classification:
G06F 17/30 G06F 7/00 G10L 15/00
US Classification:
707 4, 704251, 707 3
Abstract:
An improved method and apparatus is disclosed which uses probabilistic techniques to map an input search string with a prestored audio file, and recognize certain portions of a search string phonetically. An improved interface is disclosed which permits users to input search strings, linguistics, phonetics, or a combination of both, and also allows logic functions to be specified by indicating how far separated specific phonemes are in time.
Robert W. Morris - Atlanta GA, US Jon A. Arrowood - Smyrna GA, US Marsal Gavalda - Atlanta GA, US Peter S. Cardillo - West Stockbridge MA, US Mark Finlay - Tucker GA, US Zahi Karam - Cambridge MA, US
Assignee:
Nexidia Inc. - Atlanta GA
International Classification:
G10L 13/00
US Classification:
704258, 704 7, 704270
Abstract:
An approach to improving the performance of a wordspotting system includes providing an interface for interactive improvement of a phonetic representation of a query based on an operator identifying true detections and false alarms in a data set.
Peter S. Cardillo - Atlanta GA, US Mark A. Clements - Lilburn GA, US William E. Price - Smyrna GA, US
Assignee:
Georgia Tech Research Corporation - Atlanta GA
International Classification:
G10L 15/06 G10L 21/00 G06F 17/30
US Classification:
704243, 704254, 704275, 707746, 707760, 707801
Abstract:
An improved method and apparatus is disclosed which uses probabilistic techniques to map an input search string with a prestored audio file, and recognize certain portions of a search string phonetically. An improved interface is disclosed which permits users to input search strings, linguistics, phonetics, or a combination of both, and also allows logic functions to be specified by indicating how far separated specific phonemes are in time.
Peter S. Cardillo - Atlanta GA, US Mark A. Clements - Lilburn GA, US William E. Price - Smyrna GA, US
Assignee:
Georgia Tech Research Corporation - Atlanta GA
International Classification:
G06K 9/00 G10L 15/00
US Classification:
704236, 704231, 704249, 704255, 704256
Abstract:
An improved method and apparatus is disclosed which uses probabilistic techniques to map an input search string with a prestored audio file, and recognize certain portions of a search string phonetically. An improved interface is disclosed which permits users to input search strings, linguistics, phonetics, or a combination of both, and also allows logic functions to be specified by indicating how far separated specific phonemes are in time.
Feature Normalization For Speech And Audio Processing
Peter S. Cardillo - Atlanta GA, US Mark A. Clements - Lilburn GA, US
Assignee:
Nexidia Inc. - Atlanta GA
International Classification:
G10L 19/14
US Classification:
704224, 704E19001
Abstract:
Systems, method, and apparatus for processing a speech utterance or audio record that includes receiving one or more feature vectors characterizing the speech utterance or audio record, each feature vector having a plurality of feature elements, each feature element being associated with a spectral representation of a characteristic of one of a plurality of sequential segments of the speech utterance or audio record; and processing the one or more feature vectors in a rank order filter to obtain one or more normalized feature vectors, each normalized feature vector having a plurality of normalized feature elements corresponding to the plurality of feature elements.
Robert W. Morris - Atlanta GA, US Jon A. Arrowood - Smyrna GA, US Mark A. Clements - Lilburn GA, US Kenneth King Griggs - Roswell GA, US Peter S. Cardillo - Atlanta GA, US Marsal Gavalda - Sandy Springs GA, US
Assignee:
Nexidia Inc. - Atlanta GA
International Classification:
G10L 15/04 G06F 17/30
US Classification:
704251, 707E17039, 704E15001
Abstract:
In one aspect, a method for processing media includes accepting a query. One or more language patterns are identified that are similar to the query. A putative instance of the query is located in the media. The putative instance is associated with a corresponding location in the media. The media in a vicinity of the putative instance is compared to the identified language patterns and data characterizing the putative instance of the query is provided according to the comparing of the media to the language patterns, for example, as a score for the putative instance that is determined according to the comparing of the media to the language patterns.