Xuedong Huang, Alex Acero, Fil Alleva, Mei-Yuh Hwang, Li Jiang, and Milind Mahajan
Since January 1993, we have been working to refine and extend
Sphinx-II technologies in order to develop practical speech
recognition at Microsoft. The result of that work has been the
Whisper (Windows Highly Intelligent Speech Recognizer).
Whisper represents significantly improved recognition efficiency,
usability, and accuracy, when compared with the Sphinx-II
system. In addition Whisper offers speech input capabilities for
Microsoft Windows and can be scaled to meet different PC
platform configurations. It provides features such as continuous
speech recognition, speaker-independence, on-line adaptation,
noise robustness, dynamic vocabularies and grammars. For typical
Windows Command-and-Control applications (less than 1,000
words), Whisper provides a software only solution on PCs
equipped with a 486DX, 4MB of memory, and a standard sound
card and a desk-top microphone.
In Proc. of the International Conference on Acoustics, Speech, and Signal Processing
Publisher Institute of Electrical and Electronics Engineers, Inc.
© 2007 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.