Alwin B. Carus - Newton MA Michael Wiesner - West Roxbury MA Deborah Krause - Burlington MA
Assignee:
Lernout & Hauspie Speech Products N.V. - Ieper
International Classification:
G06F 1538
US Classification:
704 9
Abstract:
A word breaker utilizing a lexicon module and a processing module to identify word breaks in a stream of Asian (e. g. Japanese, Chinese, or Korean) language text. The lexicon module is a dictionary or database containing words native to the language of the input text. The processing module includes a plurality of analysis modules which operate on the input text. In particular, the processing module can include modules that analyze the input text using heuristic rules and statistical analysis to identify a first set of work breaks, thereby reducing the size of segments with undefined word breaks. The processing module also includes a database analysis module that identifies the remaining undefined word breaks in those smaller segments that have undergone heuristic or statistical analysis.
Dr. Krause graduated from the Kansas City University of Medicine and Biosciences College of Osteopathic Medicine in 2003. She works in Jefferson City, MO and specializes in Psychiatry. Dr. Krause is affiliated with Capital Region Medical Center.