Share on Facebook Tweet on Twitter Share on LinkedIn Share by email
Information Retrieval Methods for Automatic Speech Recognition

Xiaoqiang Xiao, Jasha Droppo, and Alex Acero


In this paper, we use information retrieval (IR) techniques to improve a speech recognition (ASR) system. The potential benefits include improved speed, accuracy, and scalability. Where conventional HMM-based speech recognition systems decode words directly, our IR-based system first decodes subword units. These are then mapped to a target word by the IR system. In this decoupled system, the IR serves as a lightweight, data-driven pronunciation model. Our proposed method is evaluated in the Windows Live Search for Mobile (WLS4M) task, and our best system has 12% fewer errors than a comparable HMM classifier. We show that even using an inexpensive IR weighting scheme (TF-IDF) yields a 3% relative error rate reduction while maintaining all of the advantages of the IR approach.


Publication typeInproceedings
Published inICASSP
> Publications > Information Retrieval Methods for Automatic Speech Recognition