I am Research Area Manager in Microsoft Research. I manage the Speech group directly and oversee the Natural Language Processing group and the Communication and Collaboration Systems group. People in these teams contributed to many Microsoft products including Xbox Kinect, Ford SYNC, Voice Search, Lync. I also have a team that runs Microsoft’s machine translation service.
Research interests
I have had longstanding interest in developing speech recognition systems that are robust to background noise. In my PhD thesis, I developed a model to express the cepstrum of noisy speech as a function of the cepstrum of clean speech which was the base for algorithms such as CDCN, VTS, and SPLICE. I also worked on a model called uncertainty decoding that estimates the clean speech cepstrum from the noisy speech as a distribution which is then integrated in HMM systems.
I have also worked on various acoustic modeling techniques, including one of the first implementations of VTLN, vocal tract length normalization, for my PhD dissertation, using a linear model on the cepstrum. I'm also interested in discriminative machine learning techniques for acoustic modeling such as conditional random fields and neural networks, and rapid adaptation.
I am interested in language understanding, machine translation, telepresence and multimodal systems.
Brief Bio
Alex Acero joined Microsoft Research in 1994, became manager of the speech grup in 2000 and since 2006 is currently a Research Area Manager directing an organization with over 50 researchers and engineers working on audio, speech, multimedia, communication, and natural language. Prior to joining Microsoft, he was the manager of the speech group at Telefonica Investigacion y Desarrollo (1992-1993) and a Senior Engineer at Apple Computer (1990-1991). He has 93 granted US patents.
Since 2000, Dr. Acero is also Affiliate Professor of Electrical Engineering at the University of Washington and has taught Spoken Language Processing. He has participated in the PhD thesis committee of 7 students.
Alex got his Ph.D. in EE from Carnegie Mellon University in 1990, his MS from Rice University in 1987 and a Telecommunications Engineering Degree from the Universidad Politecnica de Madrid in 1985, all in Electrical Engineering.
Honors:
- IEEE Fellow
- ISCA Fellow
- 2006 Signal Processing Society IEEE Distinguished Lecturer.
- Fulbright Scholarship
Activities in IEEE Signal Processing Society
Boards/Committees
- President-Elect (2012-2013), President (2014-2015) and Past-President (2016-2017).
- Director Industrial Relations (2010-2012). Also part of Membership Board (2010-2012).
- Vice President Technical Directions (2007-2009).
- Board of Governors: Member-at-large (2004-2005) and (2010-2012). Also member of the Long-Range Planning and Implementation Committee and TC Review Committee.
- Chair (2000-2002) and member (1996-2000) of the Speech Technical Committee.
- IEEE Spoken Language Processing Student Travel Grant (since 2004). Sponsorship (along with Drs. Huang and Hon) of this grant to the best ICASSP student papers in the speech area since 2004, using proceeds from their textbook Spoken Language Processing (Prentice Hall, 2001).
- Member of the IEEE Signal Processing Society since 1984.
Conferences
- Technical Co-Chair of IEEE ESPA 2012. Emerging Signal Processing Applications is a new conference devoted to practitioners.
- General Co-Chair of the 2001 IEEE Workshop on Automatic Speech Recognition and Understanding.
- Sponsorship Chair of the 1999 IEEE Workshop on Automatic Speech Recognition and Understanding.
- Publications Chair of ICASSP98. Built ICASSP’s first electronic submission website.
Journals
- Member of the editorial board of IEEE Signal Processing Magazine (2008-2010).
- Member Editorial Board for IEEE Journal of Selected Topics in Signal Processing (2006-2008).
- Associate Editor for IEEE Transactions on Audio, Speech and Language Processing (2005-2007).
- Associate Editor for IEEE Signal Processing Letters (2003-2005).
Other Service
- Sponsorship co-chair of Interspeech 2006.
-
Tutorials Chair at HLT 2004.
-
Member Editorial Board of Computer, Speech and Language. Elsevier (1993- 2009).
-
Member Editorial Board for Computer, Speech and Language (1994-present).
-
Tutorial on Spoken Language Processing, at ICSLP 2004.
-
Tutorial on Multimodal Language Processing, with M. Rahim, at ICASSP 2002.
-
Microsoft’s Diversity Leadership Council.
-
Microsoft’s Latin America executive sponsor.
Personal
Alex was born in Madrid, Spain. He's married to Donna and is the proud father of Nicolas and Marcos. He likes to play the piano, soccer and sip a good wine, though not at the same time ;-)
- Xuedong Huang, Alex Acero, and Hsiao-Wuen Hon, Spoken Language Processing, pp. 1008, Prentice-Hall, May 2001
- Alex Acero, Acoustical and Environmental Robustness in Automatic Speech Recognition, pp. 212, Kluwer Academic , 1993
- Ye-Yi Wang, Dong Yu, Yun-Cheng Ju, and Alex Acero, Voice Search, in Tur & DeMori Eds. Spoken Language Understanding, Wiley, 2011
- Yeyi Wang, L. Deng, and A. Acero, Semantic Frame Based Spoken Language Understanding, in Chapter 3, Tur and De Mori (eds) Spoken Language Understanding: Systems for Extracting Semantic Information from Speech, , pp. 35-80, Wiley, 2011
- Jasha Droppo and Alex Acero, Environmental Robustness, in Benesty, Sondhi, Huang (eds) Handbook of Speech Processing, Springer, 2008
- X. D. Huang, Alex Acero, F. Alleva, M. Hwang, L Jiang, and Milind Mahajan, From Sphinx-II to Whisper: Making Speech Recognition Usable, in Automatic Speech and Speaker Recognition, Advanced Topics, pp. 536, Kluwer Academic , 1996
- R. Stern, F. Liu, Y. Ohshima, and Alex Acero, Signal Processing for Robust Speech Recognition, in Automatic Speech and Speaker Recognition, Advanced Topics, Kluwer Academic , 1996
- Alex Acero, The Role of Phoneticians in Speech Technology, in European Studies in Phonetics and Speech Communication, European Language Resources Association, August 1995
2013
- Li Deng, Jinyu Li, Jui-Ting Huang, Kaisheng Yao, Dong Yu, Frank Seide, Michael Seltzer, Geoff Zweig, Xiaodong He, Jason Williams, Yifan Gong, and Alex Acero, Recent Advances in Deep Learning for Speech Research at Microsoft, IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2013
2012
- George Dahl, Dong Yu, Li Deng, and Alex Acero, Context-Dependent Pre-trained Deep Neural Networks for Large Vocabulary Speech Recognition, in IEEE Transactions on Audio, Speech, and Language Processing, Special Issue on Deep Learning for Speech and Langauge Processing, vol. 20, no. 1, pp. 30-42, January 2012
2011
- Xiaodong He, Amittai Axelrod, Li Deng, Alex Acero, Mei-Yuh Hwang, Alisa Nguyen, Andrew Wang, and Xiahui Huang, THE MSR SYSTEM FOR IWSLT 2011 EVALUATION, Internaltional Workshop on Spoken Language Translation (IWSLT), December 2011
2010
- Xiao Li, Ye-Yi Wang, Dou Shen, and Alex Acero, Learning with Click Graph for Query Intent Classification, in ACM Transaction on Information Systems, vol. 28, no. 3, Association for Computing Machinery, Inc., June 2010
2009
- Dong Yu, Li Deng, and Alex Acero, Using continuous features in the maximum entropy model, in Pattern Recognition Letters, vol. 30, no. 8, pp. 1295-1300, Elsevier , October 2009
- Dong Yu, Li Deng, Yifan Gong, and Alex Acero, A Novel Framework and Training Algorithm for Variable-Parameter Hidden Markov Models, in IEEE Transactions on Audio, Speech and Language Processing, vol. 17, no. 7, pp. 1348-1360, IEEE, September 2009
- Dong Yu, Balakrishnan Varadarajan, Li Deng, and Alex Acero, Active Learning and Semi-supervised Learning for Speech Recognition: A Unified Framework using the Global Entropy Reduction Maximization Criterion, in Computer Speech and Language - Special Issue on Emergent Artificial Intelligence Approaches for Pattern Recognition in Speech and Language Processing , Elsevier , 2009
- Jinyu Li, Dong Yu, Li Deng, Yifan Gong, and Alex Acero, A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions, in Computer Speech and Language, vol. 23, pp. 389-405, Elsevier , 2009
2008
- Dong Yu, Li Deng, Xiaodong He, and Alex Acero, Large-Margin Minimum Classification Error Training: A Theoretical Risk Minimization Perspective, in Computer Speech and Language, vol. 22, no. 4, pp. 415-429, Elsevier , October 2008
- Dong Yu, Li Deng, Jasha Droppo, Jian Wu, Yifan Gong, and Alex Acero, Robust speech recognition using cepstral minimum-mean-square-error noise suppressor, in IEEE Trans. Audio, Speech, and Language Processing, vol. 16, no. 5, Institute of Electrical and Electronics Engineers, Inc., July 2008
- Ye-Yi Wang, Dong Yu, Yun-Cheng Ju, and Alex Acero, An Introduction to Voice Search, in IEEE Signal Processing Magazine (Special Issue on Spoken Language Technology), Institute of Electrical and Electronics Engineers, Inc., May 2008
- Amarnag Subramanya, Zhengyou Zhang, Zicheng Liu, and Alex Acero, Multisensory processing for speech enhancement and magnitude-normalized spectra for speech modeling, in Speech Communication, vol. 50, pp. 228-243, Elsevier , March 2008
- Sibel Yaman, Li Deng, Dong Yu, Ye-Yi Wang, and Alex Acero, An integrative and discriminative technique for spoken utterance classification, in IEEE Trans. Audio, Speech, and Language Processing, vol. 16, no. 6, pp. 1207-1214, Institute of Electrical and Electronics Engineers, Inc., 2008
2007
- Ciprian Chelba, Jorge Silva, and Alex Acero, Soft Indexing of Speech Content for Search in Spoken Documents, in Computer Speech & Language, vol. 21, no. 3, pp. 423-578, Elsevier , July 2007
- Amarnag Subramanya, Michael Seltzer, and Alex Acero, Automatic Removal of Typed Keystrokes From Speech Signals, in IEEE Signal Processing Letters, vol. 14, no. 5, pp. 363-366, Institute of Electrical and Electronics Engineers, Inc., May 2007
- Li Deng, Hagai Attias, Leo Lee, and Alex Acero, Adaptive Kalman smoothing for tracking vocal tract resonances using a continuous-valued hidden dynamic model, in IEEE Transactions on audio, Speech and Language Processing, vol. 15, no. 1, pp. 13-23, Institute of Electrical and Electronics Engineers, Inc., January 2007
- Dong Yu, Li Deng, and Alex Acero, Speaker-adaptive learning of resonance targets in a hidden trajectory model of speech coarticulation, in Computer Speech and Language, vol. 27, pp. 72-87, Elsevier , 2007
- Michael Seltzer and Alex Acero, Training Wideband Acoustic Models Using Mixed-Bandwidth Training Data for Speech Recognition, in Trans. on Audio, Speech and Language Processing, vol. 15, no. 1, pp. 235-245, Institute of Electrical and Electronics Engineers, Inc., January 2007
2006
- Ciprian Chelba and Alex Acero, Adaptation of maximum entropy capitalizer: Little data can help a lot, in Computer Speech & Language, vol. 20, no. 4, pp. 382-399, Elsevier , October 2006
- Li Deng, Dong Yu, and Alex Acero, Structured Speech Modeling, in IEEE Trans. on Audio, Speech and Language Processing, vol. 14, no. 5, pp. 1492-1504, Institute of Electrical and Electronics Engineers, Inc., September 2006
- Dong Yu, Li Deng, and Alex Acero, A Lattice Search Technique for a Long-Contextual-Span Hidden Trajectory Model of Speech, in Speech Communication, Elsevier , September 2006
- I. Bazzi, Li Deng, and Alex Acero, Tracking Vocal Tract Resonances Using a Quantized Nonlinear Function Embedded in a Temporal Constraint, in IEEE Trans. on Audio, Speech and Language Processing, vol. 14, no. 2, pp. 425-434, March 2006
- Alex Acero, Building Voice User Interfaces, in MSDN Magazine, February 2006
- Ye-Yi Wang and Alex Acero, Rapid development of spoken language understanding grammars, in Speech Communication, vol. 48, no. 3-4, pp. 390-416, Elsevier , 2006
- Li Deng, Dong Yu, and Alex Acero, A Bidirectional Target Filtering Model of Speech Coarticulation: two-stage Implementation for Phonetic Recognition, in IEEE Transactions on Audio and Speech Processing, vol. 14, no. 1, pp. 256-265, IEEE, January 2006
2005
- Dong Yu and Alex Acero, Semiautomatic Improvements of System-Initiative Spoken Dialog Applications Using Interactive Clustering, in IEEE Trans. Speech & Audio Proc (Special Issue on Data Mining of Speech, Audio and Dialog), IEEE, September 2005
- Li Deng, J. Wu, Jasha Droppo, and Alex Acero, Analysis and Comparison of Two Speech Feature Extraction/Compensation Algorithms, in IEEE Signal Processing Letters, vol. 12, no. 6, pp. 477–480, Institute of Electrical and Electronics Engineers, Inc., June 2005
- Li Deng, Jian Wu, Jasha Droppo, and Alex Acero, Dynamic Compensation of HMM Variances Using the Feature Enhancement Uncertainty Computed From a Parametric Model of Speech Distortion, in IEEE Transactions on Speech and Audio Processing, vol. 13, no. 3, pp. 412–421, Institute of Electrical and Electronics Engineers, Inc., May 2005
- Ye-Yi Wang, Li Deng, and Alex Acero, Spoken Language Understanding — An Introduction to the Statistical Framework, in IEEE Signal Processing Magazine, vol. 22, no. 5, pp. 16-31, Institute of Electrical and Electronics Engineers, Inc., 2005
2004
- Li Deng, Jasha Droppo, and Alex Acero, Estimating cepstrum of speech under the presence of noise using a joint prior of static and dynamic features, in IEEE Transactions on Speech and Audio Processing, vol. 12, no. 3, pp. 218–233, Institute of Electrical and Electronics Engineers, Inc., May 2004
- Li Deng, Jasha Droppo, and Alex Acero, Enhancement of log Mel power spectra of speech using a phase-sensitive model of the acoustic environment and sequential estimation of the corrupting noise, in IEEE Transactions on Speech and Audio Processing, vol. 12, no. 2, pp. 133–143, Institute of Electrical and Electronics Engineers, Inc., March 2004
- Li Deng, Ye-Yi Wang, Kuansan Wang, Alex Acero, Hsiao Hon, Jasha Droppo, C. Boulis, Derek Jacoby, Milind Mahajan, Ciprian Chelba, and Xuedong Huang, Speech and language processing for multimodal human-computer interaction (Invited Article) , in Journal of VLSI Signal Processing Systems (Special issue on Real-World Speech Processing), vol. 36, no. 2-3, pp. 161 - 187, Kluwer Academic , 2004
2003
- Li Deng, Jasha Droppo, and Alex Acero, Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition, in IEEE Transactions on Speech and Audio Processing, vol. 11, no. 6, pp. 568–580, Institute of Electrical and Electronics Engineers, Inc., November 2003
2002
- Li Deng, Kuansan Wang, Alex Acero, Hsiao-Wuen Hon, Jasha Droppo, Constantinos Boulis, Ye-Yi Wang, Derek Jacoby, Milind Mahajan, Ciprian Chelba, and Xuedong D. Huang, Distributed Speech Processing in MiPad’s Multimodal User Interface, in IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, vol. 10, no. 8, pp. 605-619, Institute of Electrical and Electronics Engineers, Inc., 2002
2000
- Y. Rui, A. Gupta, and Alex Acero, Automatically Extracting Highlights for TV Baseball Programs, in ACM Multimedia, pp. 105-115, 2000
2012
- Amittai Axelrod, Xiaodong He, Li Deng, Alex Acero, and Mei-Yuh Hwang, New Methods and Evaluation Experiments on Translating TED Talks in the IWSLT Benchmark, IEEE International Confrence on Acoustics, Speech, and Signal Processing (ICASSP), March 2012
2011
- Mike Seltzer and Alex Acero, Separating Speaker and Environmental Variability Using Factored Transforms, in Interspeech, International Speech Communication Association, August 2011
- Hoang Do, Ivan Tashev, and Alex Acero, A New Speaker Identification Algorithm for Gaming Scenarios, in ICASSP, IEEE, May 2011
- G. Dahl, Dong Yu, Li Deng, and Alex Acero, Large Vocabulary Continuous Speech Recognition With Context-Dependent DBN-HMMS, in Proc. ICASSP, Prague, IEEE, May 2011
- Xing Fan, Michael Seltzer, Jasha Droppo, Henrique Malvar, and Alex Acero, Joint Encoding of the Waveform and Speech Recognition Features Using a Transform Codec, in International Conference on Acoustics, Speech and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., May 2011
- Jingjing Liu, Xiao Li, Alex Acero, and Ye-Yi Wang, Lexicon Modeling for Query Understanding, in ICASSP, IEEE, May 2011
- Xiaodong He, Li Deng, and Alex Acero, Why Word Error Rate is not a Good Metric for Speech Recognizer Training for the Speech Translation Task?, in Proc. ICASSP, IEEE, May 2011
- Yaodong Zhang, Li Deng, Xiaodong He, and Alex Acero, A Novel Decision Function and the Associated Decision-Feedback Learning for Speech Translation, in ICASSP, IEEE, May 2011
2010
- Mike Seltzer and Alex Acero, HMM Adaptation Using Linear Spline Interpolation with Integrated Spline Parameter Training for Robust Speech Recognition, in Interspeech, International Speech Communication Association, September 2010
- Geoffrey Zweig, Patrick Nguyen, Jasha Droppo, and Alex Acero, Continuous Speech Recognition with a TF-IDF Acoustic Model, International Speech Communication Association, September 2010
- Li Deng, Mike Seltzer, Dong Yu, Alex Acero, Abdel-rahman Mohamed, and Geoff Hinton, Binary Coding of Speech Spectrograms Using a Deep Auto-encoder, in Interspeech 2010, International Speech Communication Association, September 2010
- Ivan Tashev and Alex Acero, Statistical Modeling of the Speech Signal, in International Workshop on Acoustic, Echo, and Noise Control (IWAENC), Tel Aviv, Israel, 1 September 2010
- Ivan Tashev, Andrew Lovitt, and Alex Acero, Dual stage probabilistic voice activity detector, in NOISE-CON 2010 and 159th Meeting of the Acoustical Society of America, Acoustical Society of America, 20 April 2010
- Lae-Hoon Kim, Ivan Tashev, and Alex Acero, Reverberated Speech Signal Separation Based on Regularized Subband Feedforward ICA and Instantaneous Direction of Arrival, in International Conference on Acoustics, Speech and Signal Processing, IEEE, 16 March 2010
- Jui-Ting Huang, Xiao Li, and Alex Acero, Discriminative Training Methods for Language Models Using Conditional Entropy Criteria, in ICASSP, IEEE, March 2010
- Jasha Droppo and Alex Acero, Context Dependent Phonetic String Edit Distance for Automatic Speech Recognition, in ICASSP, IEEE, March 2010
- Mike Seltzer, Alex Acero, and Kaustubh Kalgaonkar, Acoustic Model Adaptation via Linear Spline Interpolation for Robust Speech Recognition, in ICASSP, IEEE, March 2010
- Xiaoqiang Xiao, Jasha Droppo, and Alex Acero, Information Retrieval Methods for Automatic Speech Recognition, in ICASSP, IEEE, March 2010
2009
- Ivan Tashev, Michael Seltzer, Yun-Cheng Ju, Ye-Yi Wang, and Alex Acero, Commute UX: Voice Enabled In-car Infotainment System, in Mobile HCI '09: Workshop on Speech in Mobile and Pervasive Environments (SiMPE), Association for Computing Machinery, Inc., Bonn, Germany, 15 September 2009
- Dong Yu, Li Deng, and Alex Acero, Hidden Conditional Random Field with Distribution Constraints for Phone Classification, in Interspeech 2009, International Speech Communication Association, September 2009
- Ivan Tashev, Andrew Lovitt, and Alex Acero, Unified Framework for Single Channel Speech Enhancement, in 2009 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing, IEEE, Victoria B.C., Canada, 24 August 2009
- Xiao Li, Ye-Yi Wang, and Alex Acero, Extracting Structured Information from User Queries with Semi-Supervised Conditional Random Fields, in SIGIR, July 2009
- Ozlem Kalinli, Michael L. Seltzer, and Alex Acero, Noise Adaptive Training Using a Vector Taylor Series Approach for Robust Automatic Speech Recognition, in Proceedings of International Conference on Acoustics, Speech, and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., Taipei, Taiwan, April 2009
- Balakrishnan Varadarajan, Dong Yu, Li Deng, and Alex Acero, Using collective information in semi-supervised learning for speech recognition, in Proceedings of the ICASSP, Institute of Electrical and Electronics Engineers, Inc., April 2009
- Dong Yu, Li Deng, Peng Liu, Jian Wu, Yifan Gong, and Alex Acero, Cross-lingual speech recognition under run-time resource constraints, in Proceedings of the ICASSP, Institute of Electrical and Electronics Engineers, Inc., April 2009
- Young-In Song, Ye-Yi Wang, Yun-Cheng Ju, Mike Seltzer, Ivan Tashev, and Alex Acero, Voice Search of Structured Media Data, in International Conference on Acoustics, Speech and Signal Processing, Institute of Electrical and Electornic Engineers, Inc., Taipei, Taiwan, April 2009
- Balakrishnan Varadarajan, Dong Yu, Li Deng, and Alex Acero, Maximizing global entry reduction for active learning in speech recognition, in Proceedings of the ICASSP, Institute of Electrical and Electronics Engineers, Inc., April 2009
- Jasha Droppo and Alex Acero, Experimenting with a Global Decision Tree for State Clustering in Automatic Speech Recognition Systems, in ICASSP 2009, IEEE, April 2009
- Oriol Vinyals, Li Deng, Dong Yu, and Alex Acero, Discriminative pronunciation learning using phonetic decoder and minimum classification error criterion, in Proceedings of the ICASSP, Institute of Electrical and Electronics Engineers, Inc., April 2009
- Hui Lin, Li Deng, Dong Yu, Yifan Gong, Alex Acero, and Chi-Hui Lee, A Study on Multilingual Acoustic Modeling For Large Vocabulary ASR, in Proceedings of the ICASSP, Institute of Electrical and Electronics Engineers, Inc., April 2009
2008
- Dong Yu, Li Deng, Jian Wu, Yifan Gong, and Alex Acero, Improvements on Mel-Frequency Cepstrum Minimum-Mean-Square-Error Noise Suppressor for Robust Speech Recognition, in ISCSLP, IEEE, December 2008
- Hui Lin, Li Deng, Jasha Droppo, Dong Yu, and Alex Acero, Learning Methods in Multilingual Speech Recognition, in NIPS Workshop, Whistler, BC, Canada, Microsoft, December 2008
- Dong Yu, Li Deng, and Alex Acero, The Maximum Entropy Model with Continuous Features , in NIPS Workshop, Whistler, BC, Canada, Microsoft, December 2008
- Xiaolong Li, Li Deng, Yun-Cheng Ju, and Alex Acero, Automatic Children's Reading Tutor on Hand-Held Devices, in Proceedings of Interspeech, International Speech Communication Association, Brisbane, Australia, September 2008
- Jasha Droppo, Michael L. Seltzer, Alex Acero, and Y.-H. Chiu, Towards a non-parametric acoustic model: an acoustic decision tree for observation probability calculation, in Proceedings of Interspeech, International Speech Communication Association, Brisbane, Australia, September 2008
- Ivan Tashev, Slavy Mihov, Tyler Gleghorn, and Alex Acero, Sound Capture System and Spatial Filter for Small Devices, in Proceedings of Interspeech 2008, International Speech Communication Association, Brisbane, Australia, September 2008
- Ye-Yi Wang, Xiao Li, and Alex Acero, Inductive and Example-Based Learning for Text Classification, in Interspeech, International Speech Communication Association, Brisbane, Australia, September 2008
- Dong Yu, Li Deng, Yifan Gong, and Alex Acero, Parameter Clustering and Sharing in Variable-Parameter HMMs for Noise Robust Speech Recognition, in Proc. of the Interspeech, International Speech Communication Association, September 2008
- Dong Yu, Li Deng, Yifan Gong, and Alex Acero, Discriminative Training of Variable-Parameter HMMs for Noise Robust Speech Recognition, in Proceedings of the Interspeech, International Speech Communication Association, September 2008
- Xiao Li, Ye-Yi Wang, and Alex Acero, Learning Query Intent from Regularized Click Graphs, in SIGIR'08: the 31st Annual ACM SIGIR conference on Research and Development in Information Retrieval, Association for Computing Machinery, Inc., Singapore, Singapore, July 2008
- Dong Yu, Li Deng, Jasha Droppo, Jian Wu, Yifan Gong, and Alex Acero, A Minimum Mean-Square-Error Noise Reduction Algorithm on Mel-Frequency Cepstra for Robust Speech Recognition, in Proc. ICASSP, Institute of Electrical and Electronics Engineers, Inc., April 2008
- Luis Buera, Jasha Droppo, and Alex Acero, Speech Enhancement using a Pitch Predictive Model, in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., April 2008
- Nilesh Madhu, Ivan Tashev, and Alex Acero, An EM-based Probabilistic Approach for Acoustic Echo Suppression, in Proceedings of International Conference on Audio, Speech and Signal Processing ICASSP 2008, Institute of Electrical and Electronics Engineers, Inc., Institute of Electrical and Electronics Engineers, Inc., Las Vegas, USA, April 2008
- Ivan Tashev, Jasha Droppo, Michael Seltzer, and Alex Acero, Robust Design of Wideband Loudspeaker Arrays, in Proc. of International Conference on Audio, Speech and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., Las Vegas, USA, April 2008
- Jinyu Li, Li Deng, Dong Yu, Jian Wu, Yifan Gong, and Alex Acero, Adaptation of compressed HMM parameters for resource-constrained speech recognition, Institute of Electrical and Electronics Engineers, Inc., April 2008
- Graham Taylor, Michael Seltzer, and Alex Acero, Maximum a Posteriori ICA: Applying Prior Knowledge to the Separation of Acoustic Sources, in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., April 2008
- Alex Acero, Neal Bernstein, Rob Chambers, Yun-Cheng Ju, Xiao Li, Julian Odell, Patrick Nguyen, Oliver Scholtz, and Geoff Zweig, Live Search for Mobile: Web Services by Voice on the Cellphone, in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., April 2008
- Jinyu Li, Li Deng, Dong Yu, Yifan Gong, and Alex Acero, HMM Adaptation Using a Phase-Sensitive Acoustic Distortion Model for Environment-Robust Speech Recognition, Institute of Electrical and Electronics Engineers, Inc., April 2008
- Xiao Li, Y.-C. Ju, Geoffrey Zweig, and Alex Acero, Language modeling for voice search: a machine translation approach, in ICASSP, March 2008
- Jinyu Li, Li Deng, Dong Yu, Yifan Gong, and Alex Acero, HMM adaptation using a phase-sensitive acoustic distortion model for environment-robust speech recognition, in Proc. ICASSP, 2008
2007
- Xiao Li, Asela Guanawardana, and Alex Acero, Adapting grapheme-to-phoneme conversion for name recognition, in IEEE Workshop on Automatic Speech Recognition and Understanding, Institute of Electrical and Electronics Engineers, Inc., December 2007
- Ivan Tashev, Michael Seltzer, Y. C. Ju, Dong Yu, and Alex Acero, Commute UX: Telephone Dialog System for Location-based Services, in Proceedings of SIGdial Workshop on Disclosure and Dialogue 2007, Antwerp, Belgium, September 2007
- Dong Yu, Yun-Cheng Ju, Ye-Yi Wang, Geoffrey Zweig, and Alex Acero, Automated Directory Assistance System - from Theory to Practice, in Proc. of Interspeech, International Speech Communication Association, Antwerp, Belgium, August 2007
- Michael Seltzer, Y. C. Ju, Ivan Tashev, and Alex Acero, Robust Location Understanding in Spoken Dialog Systems Using Intersections, in Proceedings of Interspeech 2007, Antwerp, Belgium, August 2007
- Jasha Droppo and Alex Acero, A Fine Pitch Model for Speech, in Proc. Interspeech Conference, International Speech Communication Association, August 2007
- J. Sherwani, Dong Yu, Tim Paek, Mary Czerwinski, Yun-Cheng Ju, and Alex Acero, VoicePedia: Towards Speech-based Access to Unstructured Information, in Interspeech, International Speech Communication Association, August 2007
- Amarnag Subramanya, Mike Seltzer, and Alex Acero, Removal of Typed Keystrokes from Speech Signals, in Proc. of the Interspeech Conference, International Speech Communication Association, May 2007
- Xiaolong Li, Yun-Cheng Ju, Li Deng, and Alex Acero, Efficient and Robust Language Modeling in an Automatic Children's Reading Tutor System, in Proceedings of IEEE Internaltional Conference on Acoustics, Speech and Signal Processing (ICASSP), Institute of Electrical and Electronics Engineers, Inc., 18 April 2007
- Michael Seltzer, Ivan Tashev, and Alex Acero, Microphone Array Post-Filter Using Incremental Bayes Learning to Track the Spatial Distribution of Speech and Noise, in Proceedings of International Conference on Audio, Speech and Signal Processing ICASSP 2007, Honolulu, USA, April 2007
- Sibel Yaman, Li Deng, Dong Yu, Ye-Yi Wang, and Alex Acero, A Discriminative Training Framework using N-Best Speech Recognition Transcriptions and Scores for Spoken Utterance Classification, in Proc. of the International Conference on Acoustics, Speech and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., Honolulu, Hawaii, U.S.A., April 2007
- Byung-Jun Yoon, Ivan Tashev, and Alex Acero, Robust Adaptive Beamforming Algorithm Using Instantaneous Direction of Arrival with Enhanced Noise Suppression Capability, in Proceedings of International Conference on Audio, Speech and Signal Processing ICASSP 2007, Honolulu, USA, April 2007
- Amarnag Subramanya, Zhengyou Zhang, A.C. Surendran, Patrick Nguyen, Mukund Narasimhan, and Alex Acero, A Generative Discriminative Framework Using Ensemble Methods for Text-Dependent Speaker Verification, in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., April 2007
- Chris White, Jasha Droppo, Alex Acero, and Julian Odell, Maximum Entropy Confidence Estimation for Speech Recognition, in Proc. ICASSP, Institute of Electrical and Electronics Engineers, Inc., Hawaii, April 2007
- Jinyu Li, Li Deng, Dong Yu, Yifan Gong, and Alex Acero, High-Performance HMM Adaptation With Joint Compensation of Additive and Convolutive Distortions Via Vector Taylor Series, in Proceedings IEEE Workshop on ASRU, Institute of Electrical and Electronics Engineers, Inc., April 2007
- Dong Yu, Li Deng, Xiaodong, and Alex Acero, Large-Margin Minimum Classification Error Training for Large-Scale Speech Recognition Tasks, in Proceedings of the ICASSP, Honolulu, Hawaii, IEEE, April 2007
- Ye-Yi Wang, Dong Yu, Yu-Cheng Ju, Geoffrey Zweig, and Alex Acero, Confidence Measures for Voice Search Applications, in 8th Annual Conference of the International Speech Communication Association, International Speech Communication Association, Antwerp, Belgium, 2007
- Jinyu Li, Li Deng, Dong Yu, Yifan Gong, and Alex Acero, High-performance HMM adaptation with joint compensation of additive and convolutive distortions via vector Taylor series, in Proc. IEEE Automatic Speech Recognition and Understanding, 2007
- Geoffrey Zweig, Yun-Cheng Ju, Patrick Nguyen, Dong Yu, Ye-Yi Wang, and Alex Acero, Voice-Rate: A Dialog System for Consumer Ratings, in NAACL/HLT (Demonstration Program), Association for Computational Linguistics, Rochester, New York, USA, 2007
- Geoffrey Zweig, Patrick Nguyen, Yun-Cheng Ju, Ye-Yi Wang, Dong Yu, and Alex Acero, The Voice-Rate Dialog System for Consumer Ratings, in INTERSPEECH, International Speech Communication Association, Antwerp, Belgium, 2007
- Ye-Yi Wang and Alex Acero, Maximum Entropy Model Parameterization with Tf-Idf Weighted Vector Space Model, in IEEE Automatic Speech Recognition and Understanding Workshop, Institute of Electrical and Electronics Engineers, Inc., Kyoto, Japan, 2007
2006
- Xiaolong Li, Li Deng, Dong Yu, and Alex Acero, A Time-Synchronous Phonetic Decoder For A Long-Contextual-Span Hidden Trajectory Model, in Proceedings of International Conference on Speech Communication (InterSpeech), 2006, International Speech Communication Association, Pittsburgh, PA, 19 September 2006
- Amarnag Subramanya, Michael Seltzer, and Alex Acero, Removal of Typed Keystrokes from Speech Signals, in Proc. of the Interspeech Conference, International Speech Communication Association, September 2006
- Dong Yu, Li Deng, Xiaodong He, and Alex Acero, Use of Incrementally Regulated Discriminative Margins in MCE Training for Speech Recognition, in Proc. of the Interspeech Conference, International Speech Communication Association, September 2006
- Dong Yu, Yun-Cheng Ju, and Alex Acero, An Effective and Efficient Utterance Verification Technology Using Word N-gram Filler Models, in Proc. of the Interspeech Conference, International Speech Communication Association, September 2006
- Ivan Tashev and Alex Acero, Microphone Array Post-Processor Using Instantaneous Direction of Arrival, in Proceedings of International Workshop on Acoustic, Echo and Noise Control IWAENC 2006, Paris, France, September 2006
- Jasha Droppo and Alex Acero, Joint Discriminative Front End and Back End Training for Improved Speech Recognition Accuracy, in Proc. ICASSP, Institute of Electrical and Electronics Engineers, Inc., Toulouse, France, May 2006
- J. Silva, C. Chelba, and Alex Acero, Pruning Analysis for the Position Specific Posterior Lattices for Spoken Document Search, in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing, May 2006
- Milind Mahajan, Asela Gunawardana, and Alex Acero, Training algorithms for hidden conditional random fields, in International Conference on Acoustics, Speech, and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., May 2006
- Ivan Tashev, Jasha Droppo, and Alex Acero, Suppression Rule for Speech Recognition Friendly Noise Suppressors, in Proceedings of Eight International Conference Digital Signal Processing and Applications DSPA’06, Moscow, Russia, March 2006
- Ye-Yi Wang, John Lee, Milind Mahajan, and Alex Acero, Combining Statistical and Knowledge-Based Spoken Language Understanding in Conditional Models, in COLING/ACL06, Association for Computational Linguistics, Sydney, Australia, 2006
- Dong Yu, Yun-Cheng Ju, Ye-Yi Wang, and Alex Acero, N-Gram Based Filler Model for Robust Grammar Authoring, in International Conference on Acoustics, Speech, and Signal Processing., Institute of Electrical and Electronics Engineers, Inc., Toulouse, France, 2006
- Yun-Cheng Ju, Ye-Yi Wang, and Alex Acero, Call Analysis with Classification Using Speech and Non-Speech Features, in the International Conference on Spoken Language Processing, International Speech Communication Association, Pittsburgh, PA, USA, 2006
- Ye-Yi Wang and Alex Acero, Discriminative Models for Spoken Language Understanding., in the International Conference on Spoken Language Processing, International Speech Communication Association, Pittsburgh, PA, USA, 2006
- Ye-Yi Wang, John Lee, and Alex Acero, Speech Utterance Classification Model Training without Manual Transcriptions, in IEEE International Conference on Acoustics, Speech and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., Roulouse, France, 2006
2005
- Mike Seltzer and Alex Acero, An EM Algorithm for Training Wideband Acoustic Models from Mixed-Bandwidth Training Data , in Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, Institute of Electrical and Electronics Engineers, Inc., December 2005
- Jasha Droppo, Milind Mahajan, Asela Gunawardana, and Alex Acero, How to Train a Discriminative Front End with Stochastic Gradient Descent and Maximum Mutual Information, in Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, Institute of Electrical and Electronics Engineers, Inc., Puerto Rico, December 2005
- Li Deng, Dong Yu, and Alex Acero, A Generative Modeling Framework for Structured Hidden Speech Dynamics, in NIPS Workshop on Advances in Structured Learning for Text and Speech Processing , Microsoft, December 2005
- Li Deng, Dong Yu, Xiaolong Li, and Alex Acero, A Long-Contextual-Span Model of Resonance Dynamics for Speech Recognition: Parameter Learning and Recognizer Evaluation, in Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding, Institute of Electrical and Electronics Engineers, Inc., Puerto Rico, November 2005
- Zicheng Liu, Michael Seltzer, Alex Acero, Ivan Tashev, Zhengyou Zhang, and Mike Sinclair, A Compact Multi-Sensor Headset for Hands-Free Communication, in Proceedings of Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, USA, October 2005
- Li Deng, Xiaolong Li, Dong Yu, and Alex Acero, Evaluation of a Long-Contextual-Span Hidden Trajectory Model and Phonetic Recognizer Using A* Lattice Search, in Proc. of the Interspeech Conference, International Speech Communication Association, September 2005
- Ivan Tashev, Michael Seltzer, and Alex Acero, Microphone Array for Headset with Spatial Noise Suppressor, in Proceedings of Ninth International Workshop on Acoustic, Echo and Noise Control IWAENC 2005, Eindhoven, The Netherlands, September 2005
- A. Subramanya, Z. Zhang, Z. Liu, Jasha Droppo, and Alex Acero, A Graphical Model for Multi-Sensory Speech Processing in Air-and-Bone Conductive Microphones, in Proc. of the Interspeech Conference, International Speech Communication Association, Lisbon, Portugal, September 2005
- C. Chelba and Alex Acero, Indexing Uncertainty for Spoken Document Search, in Proc. of the Interspeech Conference, September 2005
- Asela Gunawardana, Milind Mahajan, Alex Acero, and John C. Platt, Hidden Conditional Random Fields for Phone Classification, in International Conference on Speech Communication and Technology, International Speech Communication Association, September 2005
- Jasha Droppo and Alex Acero, Maximum Mutual Information SPLICE Transform for Seen and Unseen Conditions, in Proc. Interspeech Conference, International Speech Communication Association, September 2005
- Li Deng, Dong Yu, and Alex Acero, Learning Statistically Characterized Resonance Targets in a Hidden Trajectory Model of Speech Coarticulation and Reduction, in Proc. of the Interspeech Conference, International Speech Communication Association, September 2005
- Xiao Li, Asela Gunawardana, and Alex Acero, Unsupervised semantic intent discovery from call log acoustics, in International Conference on Acoustics, Speech, and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., August 2005
- C. Chelba and Alex Acero, Position Specific Posterior Lattices for Indexing Speech, in Proc. of the Association for Computational Linguistics, June 2005
- C. Chelba and Alex Acero, SPEECH OGLE: Indexing Uncertainty for Spoken Document Search, in Proc. of the Association for Computational Linguistics, June 2005
- Li Deng, Xiang Li, Dong Yu, and Alex Acero, A Hidden Trajectory Model with Bi-Directional Target Filtering: Cascaded vs. Integrated Implementation for Phonetic Recognition, in Proc. of Int. Conf. on Acoustics, Speech, and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., March 2005
- Z. Liu, A. Subramanya, Z. Zhang, Jasha Droppo, and Alex Acero, Leakage Model and Teeth Clack Removal for Air- and Bone-Conductive Integrated Microphones, in Proc. ICASSP, Institute of Electrical and Electronics Engineers, Inc., Philadelphia, March 2005
- Michael Seltzer and Alex Acero, Training Wideband Acoustic Models using Mixed-Bandwidth Training Data via Feature Bandwidth Extension, in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., March 2005
- Dong Yu, Milind Mahajan, P. Mau, and Alex Acero, Maximum Entropy Based Generic Filter for Language Model Adaptation, in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing, IEEE, March 2005
- Ye-Yi Wang, John Lee, Milind Mahajan, and Alex Acero, Statistical Spoken Language Understanding: from Generative Model to Conditional Model, in NIPS Workshop: Advances in Structured Learning for Text and Speech Processing, Whistler, BC, Canada, 2005
- Ye-Yi Wang and Alex Acero, SGStudio: Rapid Semantic Grammar Development for Spoken Language Understanding, in 9th European Conference on Speech Communication and Technology, International Speech Communication Association, Lisbon, Portugal, 2005
- M. L. Seltzer, Alex Acero, and Jasha Droppo, Robust Bandwidth Extension of Noise-corrupted Narrowband Speech, in Proc. Interspeech Conference, International Speech Communication Association, 2005
2004
- Li Deng, Xiaolong Li, Dong Yu, and Alex Acero, Novel Acoustic Modeling with Structured Hidden Dynamics for Speech Coarticulation and Reduction, in Proc. of the DARPA RT04 Workshop, November 2004
- Dong Yu, Mei-Yuh Hwang, Peter Mau, Alex Acero, and Li Deng, Unsupervised Learning from Users’ Error Correction in Speech Dictation, in Proc. Int. Conf. on Spoken Language Processing, International Speech Communication Association, October 2004
- Li Deng, Dong Yu, and Alex Acero, A Quantitative Model for Formant Dynamics and Contextually Assimilated Reduction in Fluent Speech, in Proc. Int. Conf. on Spoken Language Processing, International Speech Communication Association, October 2004
- Zicheng Liu, Zhengyou Zhang, Alex Acero, Jasha Droppo, and Xuedong Huang, Direct Filtering for Air- and Bone-Conductive Microphones, in Proc. IEEE Workshop on Automatic Speech Recognition and Understanding, Institute of Electrical and Electronics Engineers, Inc., Siena, Italy, September 2004
- Li Deng, Zicheng Liu, Zhengyou Zhang, and Alex Acero, Nonlinear Information Fusion in Multi-Sensor Processing - Extracting and Exploiting Hidden Dynamics of Speech Captured by a Bone-Conductive Microphone, in Proc. of the IEEE Workshop on Multimedia Signal Processing, Institute of Electrical and Electronics Engineers, Inc., September 2004
- C. Chelba and Alex Acero, Adaptation of Maximum Entropy Capitalizer: Little Data Can Help a Lot,, in Proc. of EMNLP, July 2004
- Jasha Droppo and Alex Acero, Noise Robust Speech Recognition with a Switching Linear Dynamic Model, in Proc. ICASSP, IEEE, Montreal, Canada, May 2004
- Li Deng, L. Lee, H. Attias, and Alex Acero, A Structured Speech Model with Continuous Hidden Dynamics and Prediction-Residual Training for Tracking Vocal Tract Resonances, in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing, May 2004
- C. Chelba and Alex Acero, Conditional ML Estimation Using Rational Function Growth Transform, in Proc. of the Snowbird Learning Workshop, April 2004
- Zhengyou Zhang, Z. Liu, M. Sinclair, A. Acero, Li Deng, J. Droppo, Xuedong Huang, and Yanli Zheng, Multisensory microphones for robust speech detection, enhancement, and recognition, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Montreal, Canada, May 2004, IEEE, 2004
- Alex Acero, Ye-Yi Wang, and Kuansan Wang, A Semantically Structured Language Model, in Special Workshop in Maui, Maui, Hawaii, 2004
- Kuansan Wang, Ye-Yi Wang, and Alex Acero, Use and Acquisition of Semantic Language Model, in NAACL/HLT (Short Paper), Association for Computational Linguistics, Boston, MA, 2004
2003
- J. Wu, Jasha Droppo, Li Deng, and Alex Acero, A Noise-Robust ASR Front-End Using Wiener Filter Constructed from MMSE Estimation of Clean Speech and Noise, in Proc. IEEE Workshop on Automatic Speech Recognition and Understanding, Institute of Electrical and Electronics Engineers, Inc., U.S. Virgin Islands, December 2003
- Y. Zheng, Z. Liu, Z. Zhang, M. Sinclair, Jasha Droppo, Li Deng, Xuedong Huang, and Alex Acero, Air and Bone-Conductive Integrated Microphones for Robust Speech Detection and Enhancement, in Proc. IEEE Workshop on Automatic Speech Recognition and Understanding, Institute of Electrical and Electronics Engineers, Inc., U.S. Virgin Islands, December 2003
- Yongang Deng, Milind Mahajan, and Alex Acero, Estimating Speech Recognition Error Rate without Acoustic Test Data,, in Proc. of the European Conference on Speech Communication, International Speech Communication Association, September 2003
- Ciprian Chelba and Alex Acero, Discriminative Training of N-gram Classifiers for Speech and Text Routing, in Proc. of the Eurospeech Conference, International Speech Communication Association, September 2003
- Li Deng, I. Bazzi, and Alex Acero, Tracking Vocal Tract Resonances Using an Analytical Nonlinear Predictor and a Target-guided Temporal Constraint, in Proc. of the Eurospeech Conference. Geneva, September 2003
- Asela Gunawardana and Alex Acero, Adapting acoustic models to new domains and conditions using untranscribed data, in International Conference on Speech Communication and Technology, International Speech Communication Association, September 2003
- Mike Seltzer, Jasha Droppo, and Alex Acero, A Harmonic-Model-Based Front End for Robust Speech Recognition, in Proc. Eurospeech Conference, International Speech Communication Association, Geneva, Switzerland, September 2003
- Dong Yu, Kuansan Wang, Milind Mahajan, Peter Mau, and Alex Acero, Improved Name Recognition With User Modeling, in Proc. of the Eurospeech Conference, International Speech Communication Association, September 2003
- Jasha Droppo, Li Deng, and Alex Acero, A Comparison of Three Non-Linear Observation Models for Noisy Speech Features, in Proc. Eurospeech Conference, International Speech Communication Association, Geneva, Switzerland, September 2003
- Li Deng, Jasha Droppo, and Alex Acero, Incremental Bayes Learning with Prior Evolution for Tracking Non-Stationary Noise Statistics from Noisy Speech Data, in Proc. ICASSP, Institute of Electrical and Electronics Engineers, Inc., Hong Kong, April 2003
- C. Chelba, Milind Mahajan, and Alex Acero, Speech Utterance Classification, in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing, April 2003
- Issam Bazzi, Alex Acero, and Li Deng, An Expectation-Maximization Approach for Formant Tracking using a Parameter-free Nonlinear Predictor, in Proc. of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., April 2003
- Ye-Yi Wang and Alex Acero, Concept Acquisition in Example-Based Grammar Authoring, in IEEE International Conference on Acoustics, Speech, and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., Hong Kong, China, 2003
- Ye-Yi Wang and Alex Acero, Is Word Error Rate a Good Indicator for Spoken Language Understanding Accuracy, in IEEE Workshop on Automatic Speech Recognition and Understanding, Institute of Electrical and Electronics Engineers, Inc., St. Thomas, US Virgin Islands, 2003
- Ye-Yi Wang and Alex Acero, Combination of CFG and N-gram Modeling in Semantic Grammar Learning, in Eurospeech 2003, International Speech Communication Association, Geneva, Switzerland, 2003
2002
- Li Deng, Alex Acero, Ye-Yi Wang, Kuansan Wang, Hsiao-Wuen Hon, Jasha Droppo, Milind Mahajan, and XD Huang, A speech-centric perspective for human-computer interface, in Proc. of the IEEE Fifth Workshop on Multimedia Signal Processing, Institute of Electrical and Electronics Engineers, Inc., December 2002
- Jasha Droppo, Alex Acero, and Li Deng, A Nonlinear Observation Model for Removing Noise from Corrupted Speech Log Mel-Spectral Energies, in Proc. International Conference on Spoken Language Processing, Denver, Colorado, September 2002
- Jasha Droppo, Li Deng, and Alex Acero, Evaluation of SPLICE on the Aurora 2 and 3 Tasks, in Proc. International Conference on Spoken Language Processing, International Speech Communication Association, Denver, Colorado, September 2002
- Li Deng, Jasha Droppo, and Alex Acero, Log-Domain Speech Feature Enhancement Using Sequential MAP Noise Estimation and a Phase-sensitive Model of the Acoustic Environment, in Proc. International Conference on Spoken Language Processing, Denver, Colorado, September 2002
- Li Deng, Jasha Droppo, and Alex Acero, Exploiting Variances in Robust Feature Extraction Based on a Parametric Model of Speech Distortion, in Proc. International Conference on Spoken Language Processing, Denver, Colorado, September 2002
- Jasha Droppo, Li Deng, and Alex Acero, Uncertainty Decoding with SPLICE for Noise Robust Speech Recognition, in Proc. ICASSP, Institute of Electrical and Electronics Engineers, Inc., Florida, May 2002
- Y. Xiang, Y. Hua, S. An, and Alex Acero, Separating Colored Signals Distorted by Convolutive Channels Using Diagonal Constrained Decorrelation, in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing, May 2002
- Li Deng, Jasha Droppo, and Alex Acero, A Bayesian Approach to Speech Feature Enhancement using the Dynamic Cepstral Prior, in Proc. ICASSP, Institute of Electrical and Electronics Engineers, Inc., Florida, May 2002
- Ye-Yi Wang, Alex Acero, Ciprian Chelba, Brendan Frey, and Leon Wong, Combination of Statistical and Rule-Based Approaches for Spoken Language Understanding., in International Conference on Spoken Processing, International Speech Communication Association, Denver, Colorado, 2002
- Ye-Yi Wang and Alex Acero, Evaluation of Spoken Language Grammar Learning in ATIS Domain, in IEEE International Conference on Acoustics, Speech, and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., Orlando, Florida, 2002
2001
- Li Deng, Jasha Droppo, and Alex Acero, Recursive Noise Estimation Using Iterative Stochastic Approximation for Stereo-based Robust Speech Recognition, in Proc. IEEE Workshop on Automatic Speech Recognition and Understanding, Institute of Electrical and Electronics Engineers, Inc., Madonna di Campliglio, Italy, December 2001
- Jasha Droppo, Alex Acero, and Li Deng, Evaluation of the SPLICE Algorithm on the Aurora 2 Database, in Proc. Eurospeech Conference, International Speech Communication Association, Aalbodk, Denmark, September 2001
- B. Frey, Li Deng, T. Kristjansson, and Alex Acero, ALGONQUIN: Iterating Laplace's Method to Remove Multiple Types of Acoustic Distortion for Robust Speech Recognition, in Proc. of the Eurospeech Conference, September 2001
- H. Attias, Li Deng, Alex Acero, and John Platt, A New Method for Speech Denoising and Robust Speech Recognition Using Probabilistic Models for Clean Speech and for Noise, in Proc. of the Eurospeech Conference, September 2001
- Jasha Droppo, Alex Acero, and Li Deng, Efficient Online Acoustic Environment Estimation for FCDCN in a Continuous Speech Recognition System, in Proc. ICASSP, Institute of Electrical and Electronics Engineers, Inc., Salt Lake City, Utah, May 2001
- Li Deng, Alex Acero, L. Jiang, Jasha Droppo, and Xuedong Huang, High-Performance Robust Speech Recognition Using Stereo Training Data, in Proc. ICASSP, Institute of Electrical and Electronics Engineers, Inc., Salt Lake City, Utah, May 2001
- Y. Xiang, Y. Hua, S. An, and Alex Acero, Experimental Investigation of Delayed Instantaneous Demixer for Speech Enhancement, in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing, May 2001
- T. Kristjansson, B. Frey, Li Deng, and Alex Acero, Towards Non-Stationary Model-Based Noise Adaptation for Large Vocabulary Speech Recognition, in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing, May 2001
- Xuedong Huang, Alex Acero, C. Chelba, Li Deng, Jasha Droppo, D. Duchene, J. Goodman, Hsiao-Wuen Hon, D. Jacoby, L. Jiang, R. Loynd, Milind Mahajan, P. Mau, S. Meredith, S. Mughal, S. Neto, M. Plumpe, K. Stery, G. Venolia, Kuansan Wang, and Ye-Yi Wang, MIPAD: A Multimodal Interactive Prototype, in International Conference on Acoustics, Speech, and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., Salt Lake City, Utah, USA, 2001
- Ye-Yi Wang and Alex Acero, Grammar Learning for Spoken Language Understanding, in IEEE Workshop on Automatic Speech Recognition and Understanding, Institute of Electrical and Electronics Engineers, Inc., Madonna di Campiglio, Italy, 2001
- B. Frey, T. Kristjansson, Li Deng, and Alex Acero, Learning dynamic noise models from noisy speech for robust speech recognition, in Advances in Neural Information Processing Systems (NIPS), Vol. 14, Vancouver, Canada, 2001, pp. 101-108, 2001
2000
- H. Attias, J. Platt, Alex Acero, and Li Deng, Speech Denoising and Dereverberation Using Probabilistic Models, in NIPS, November 2000
- Alex Acero, S. Altschuler, and L. Wu, Speech/Noise Separation Using Two Microphones and a VQ Model of Speech Signals, in Proc. Int. Conf. on Spoken Language Processing, October 2000
- Alex Acero, Li Deng, T. Kristjansson, and J. Zhang, HMM Adaptation Using Vector Taylor Series for Noisy Speech Recognition, in Proc. Int. Conf. on Spoken Language Processing, October 2000
- Li Deng, Alex Acero, M. Plumpe, and Xuedong Huang, Large-Vocabulary Speech Recognition under Adverse Acoustic Environments,, in Proc. Int. Conf. on Spoken Language Processing, October 2000
- Xuedong Huang, Alex Acero, Ciprian Chelba, Li Deng, Doug Duchene, Joshua Goodman, Hsiao-Wuen Hon, Derek Jacoby, Li Jiang, Ricky Loynd, Milind Mahajan, Peter Mau, Scott Meredith, Salman Mughal, Salvado Neto, Mike Plumpe, Kuansan Wang, and Ye-Yi Wang, MiPad: A Next Generation PDA Prototype, in International Conference on Spoken Language Processing, International Speech Communication Association, Beijing, China, 2000
1999
- Matthew Richardson, Mei-Yuh Hwang, Alex Acero, and Xuedong Huang, Improvements on Speech Recognition for Fast Talkers, in Proc. of the Eurospeech Conference, September 1999
- Alex Acero, Formant Analysis and Synthesis using Hidden Markov Models, in Proc. of the Eurospeech Conference, September 1999
1998
- Jasha Droppo and Alex Acero, Maximum a Posteriori Pitch Tracking, in Proc. International Conference on Spoken Language Processing, International Speech Communication Association, Sydney, Australia, December 1998
- Alex Acero, A Mixed-Excitation Frequency Domain Model for Time-Scale Pitch-Scale Modification of Speech, in Proc. of the Int. Conf. on Spoken Language Processing, December 1998
- M. Plumpe, Alex Acero, Hsiao-Wuen Hon, and Xuedong Huang, HMM-Based Smoothing for Concatenative Speech Synthesis, in Proc. of the Int. Conf. on Spoken Language Processing, December 1998
- Hsiao-Wuen Hon, Alex Acero, Xuedong Huang, J. Liu, and M. Plumpe, Automatic Generation of Synthesis Units for Trainable Text-to-Speech Systems, in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing, December 1998
- Alex Acero, Source-Filter Models for Time-Scale Pitch-Scale Modification of Speech, in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing, May 1998
1997
- X. D. Huang, Alex Acero, Hsiao-Wuen Hon, Yun-Cheng Ju, J. Liu, S. Meredith, and M. Plumpe, Recent Improvements on Microsofts Trainable Text-to-Speech System: Whistler, in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., April 1997
1996
- Xuedong Huang, Alex Acero, J. Adcock, J. Goldsmith, and J. Liu, Whistler: A Trainable Text-to-Speech System, in Proc. of the Int. Conf. on Spoken Language Processing, International Speech Communication Association, October 1996
- Alex Acero and Xuedong Huang, Speaker and Gender Normalization for Continuous-Density Hidden Markov Models, in Proc. of the Int. Conf. on Acoustics, Speech, and Signal , IEEE, May 1996
1995
- Alex Acero and Xuedong Huang, Augmented Cepstral Normalization for Robust Speech Recognition, in Proc. of the IEEE Workshop on Automatic Speech Recognition, December 1995
- Xuedong Huang, Alex Acero, Fil Alleva, Mei-Yuh Hwang, Li Jiang, and Milind Mahajan, Microsoft Windows Highly Intelligent Speech Recognizer: Whisper, in Proc. of the International Conference on Acoustics, Speech, and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., May 1995
1994
- Cecilia de la Torre and Alex Acero, Discriminative Training of Garbage Models for Out-of-Vocabulary Utterance Rejection, in Proc. of the International Conference on Spoken Language Systems, International Speech Communication Association, September 1994
- Daniel Tapias, Alex Acero, Javier Esteve, and Juan Carlos Torrecilla, The Vestel Telephone Speech Database, in Proc. of the International Conference on Spoken Language Systems, September 1994
- Richard Stern, Fu-Hua Liu, Pedro Moreno, and Alex Acero, Signal Processing for Robust Speech Recognition, in Proc. of the International Conference on Spoken Language Systems, September 1994
- Fu-Hua Liu, Richard Stern, Alex Acero, and Pedro Moreno, Environment Normalization for Robust Speech Recognition using Direct Cepstral Comparisons, in Proc. of the International Conference on Acoustics, Speech, and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., April 1994
- Fu-Hua Liu, Pedro Moreno, Richard Stern, and Alex Acero, Signal Processing for Robust Speech Recognition, in Proc. of ARPA Human Language Technology Workshop, March 1994
1993
- Alex Acero, Carlos Crespo, Celinda de la Torre, and Juan Carlos Torrecilla, A Robust HMM-Based Endpoint Detector for Telecommunication Applications, in Proc. of Eurospeech, International Speech Communication Association, September 1993
- Luis Villarrubia and Alex Acero, Rejection Techniques for Digit Recognition in Telecommunication Applications, in Proc. of the International Conference on Acoustics, Speech and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., April 1993
- Fu-Hua Liu, Richard Stern, Xuadong Huang, and Alex Acero, Efficient Cepstral Normalization for Robust Speech Recognition, in Proc. of DARPA Speech and Natural Language Workshop, March 1993
1992
- Alex Acero and Richard Stern, Cepstral Normalization for Robust Speech Recognition, in Proc. of ESCA Workshop on Speech Processing in Adverse Conditions, International Speech Communication Association, November 1992
- Richard Stern, Fu-Hua Liu, Yoshiaki Ohshima, Tom Sullivan, and Alex Acero, Multiple Approaches to Robust Speech Recognition, in Proc. of the ICSLP, International Speech Communication Association, October 1992
- Alex Acero, Luis Villarrubia, and Carlos Santamaria, Rejection and Echo Canceling in Telecommunication Applications, in Proc. of the IEEE Workshop on Interactive Voice Technology for Telecommunication Applications, Institute of Electrical and Electronics Engineers, Inc., October 1992
- Fu-Hua Liu, Richard Stern, and Alex Acero, Efficient Joint Compensation of Speech for the Effects of Additive Noise and Linear System, in Proc. of the Sixth ARPA Workshop on Human Language Technology, Morgan Kaufmann Publishers, March 1992
- Richard Stern, Fu-Hua Liu, Yoshiaki Ohshima, Tom Sullivan, and Alex Acero, Multiple Approaches to Robust Speech Recognition, in Proc. of DARPA Speech and Natural Language Workshop, February 1992
1991
- Alex Acero and Richard Stern, Robust Speech Recognition by Normalization of the Acoustic Space, in Proc. of the International Conference on Acoustics, Speech and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., May 1991
1990
- Alex Acero and Richard Stern, Acoustical Pre-Processing for Robust Spoken Language Systems, in Proc. of the International Conference on Spoken Language Systems, International Speech Communication Association, November 1990
- Alex Acero and Richard Stern, Towards Microphone-Independent Spoken Language Systems, in Proc. of the DARPA Speech and Natural Language Workshop, June 1990
- Alex Acero and Richard Stern, Environmental Robustness in Automatic Speech Recognition, in Proc. of International Conference on Acoustics, Speech, and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., April 1990
1989
- Richard Stern and Alex Acero, Acoustical preprocessing for Automatic Speech Recognition, in Proc. of DARPA Speech and Natural Language Workshop, October 1989
- Daniel Povey, Geoffrey Zweig, and Alex Acero, Speaker Adaptation with an Exponential Transform, no. MSR-TR-2011-101, 9 September 2011
- Amarnag Subramanya, Zhengyou Zhang, Zicheng Liu, and Alex Acero, Speech Modeling with Magnitude-Normalized Complex Spectra and its Application to Multisensory Speech Enhancement, no. MSR-TR-2005-126, September 2005
- Ya Chang, Ross Cutler, Zicheng Liu, Zhengyou Zhang, Alex Acero, and Matthew Turk, Automatic Head-Size Equalization in Panorama Images for Video Conferencing, no. MSR-TR-2005-48, May 2005
- Ciprian Chelba and Alex Acero, Conditional Maximum Likelihood Estimation of Naive Bayes Probability Models Using Rational Function Growth Transform, no. MSR-TR-2004-33, April 2004
- Alex Acero, Acoustical and Environmental Robustness in Automatic Speech Recognition, 13 September 1990
Last updated: June 2, 2011
E-mail: alexac at microsoft dot com
U.S.Mail: Microsoft Corporation, One Microsoft Way, Redmond WA, 98052-6399, USA
Tel: (425) 706-1597
Fax: (425) 706-7329 (This is the main MS FAX number so make sure to send documents to my attention)
