Voice Search: Say What You Want and Get It
In the Voice Search project, we envision a future where you can ask your cellphone for any kind of information and get it. With a small cellphone, there is a heavy tax on traditional keyboard based information entry, and we believe it can be significantly more convenient to communicate by voice. Our work focuses on making this communication more reliable, and able to cover the full range of information needed in daily life.
Publications
- Geoffrey Zweig and Shuangyu Chang, Personalizing Model M for Voice-search, in Interspeech, International Speech Communication Association, 2011
- Dong Yu, Li Deng, and George E. Dahl, Roles of Pre-Training and Fine-Tuning in Context-Dependent DBN-HMMs for Real-World Speech Recognition, in NIPS 2010 workshop on Deep Learning and Unsupervised Feature Learning, December 2010
- Jui-Ting Huang, Xiao Li, and Alex Acero, Discriminative Training Methods for Language Models Using Conditional Entropy Criteria, in ICASSP, IEEE, March 2010
- Shankar Shivappa, Patrick Nguyen, and Geoffrey Zweig, Discriminative Template Extraction for Direct Modeling, in ICASSP, IEEE, 2010
- Patrick Nguyen and Geoffrey Zweig, Speech Recognition with Flat Direct Models, in IEEE Journal of Selected Topics in Signal Processing, IEEE, 2010
- Geoffrey Zweig and Patrick Nguyen, From Flat Direct Models to Segmental CRF Models, in ICASSP, IEEE, 2010
- Dong Yu, Li Deng, and Alex Acero, Using continuous features in the maximum entropy model, in Pattern Recognition Letters, vol. 30, no. 8, pp. 1295-1300, Elsevier , October 2009
- Xiao Li, Patrick Nguyen, Geoffrey Zweig, and Dan Bohus, Leveraging Multiple Query Logs to Improve Language Models for Spoken Query Recognition, in ICASSP, IEEE, April 2009
- Daniel Bolanos, Geoffrey Zweig, and Patrick Nguyen, Multi-scale Personalization for Voice Search Applications, in HLT-NAACL 2009, Association for Computational Linguistics, 2009
- Georg Heigold, Geoffrey Zweig, Xiao Li, and Patrick Nguyen, A Flat Direct Model for Speech Recognition, in ICASSP-2009, IEEE, 2009
- Geoffrey Zweig and Patrick Nguyen, A Segmental CRF Approach to Large Vocabulary Continuous Speech Recognition, in ASRU, IEEE, 2009
- Dong Yu, Balakrishnan Varadarajan, Li Deng, and Alex Acero, Active Learning and Semi-supervised Learning for Speech Recognition: A Unified Framework using the Global Entropy Reduction Maximization Criterion, in Computer Speech and Language - Special Issue on Emergent Artificial Intelligence Approaches for Pattern Recognition in Speech and Language Processing , Elsevier , 2009
- Dan Bohus, Xiao Li, Patrick Nguyen, and Geoffrey Zweig, Learning N-Best Correction Models from Implicit User Feedback in a Multi-Modal Local Search Application, in Special Interest Group on Discourse and Dialogue (SIGdial), June 2008
- Xiao Li, Y.-C. Ju, Geoffrey Zweig, and Alex Acero, Language modeling for voice search: a machine translation approach, in ICASSP, March 2008
- Tim Paek and Yun-Cheng Ju, Accommodating Explicit User Expressions of Uncertainty in Voice Search or Something Like That, International Speech Communication Association, 2008
- G. Zweig, D. Bohus, X. Li, and P. Nguyen, Structured Models for Joint Decoding of Repeated Utterances, in In Proceedings of Interspeech, 2008
- Z. Li, Geoffrey Zweig, and Patrick Nguyen, Optimal Dialog in Consumer-Rating Systems using a POMDP Framework, in In Proceedings of SIGdial, 2008
- Milind Mahajan, Patrick Nguyen, and Geoffrey Zweig, Summarization of Multiple User Reviews in the Restaurant Domain, no. MSR-TR-2007-126, September 2007
- Dong Yu, Yun-Cheng Ju, Ye-Yi Wang, Geoffrey Zweig, and Alex Acero, Automated Directory Assistance System - from Theory to Practice, in Proc. of Interspeech, International Speech Communication Association, Antwerp, Belgium, August 2007
- Geoffrey Zweig, Patrick Nguyen, Yun-Cheng Ju, Ye-Yi Wang, Dong Yu, and Alex Acero, The Voice-Rate Dialog System for Consumer Ratings, in INTERSPEECH, International Speech Communication Association, Antwerp, Belgium, 2007
- Tim Paek, Yun-Cheng Ju, and Christopher Meek, People Watcher: A Game for Eliciting Human-Transcribed Data for Automated Directory Assistance, International Speech Communication Association, 2007
- Ye-Yi Wang, Dong Yu, Yu-Cheng Ju, Geoffrey Zweig, and Alex Acero, Confidence Measures for Voice Search Applications, in 8th Annual Conference of the International Speech Communication Association, International Speech Communication Association, Antwerp, Belgium, 2007
- EE Jan, B. Maison, L. Mangu, and G. Zweig, Automatic construction of Unique Signatures and Confusable sets for Natural Language Directory Assistance Application, in Proceedings of Eurospeech, 2003
