Background and Interests
Xiaodong He is a Researcher in the Conversational Systems Research Center of Microsoft Research Redmond. He is also an Affiliate Professor in the Department of Electrical Engineering at the University of Washington (Seattle). He received the BS degree from Tsinghua University (Beijing) in 1996, MS degree from Chinese Academy of Sciences (Beijing) in 1999, and the PhD degree from the University of Missouri - Columbia in 2003.
His research interests include machine learning, speech recognition, spoken language understanding, machine translation, natural language processing, and information retrieval. He has published extensively in these areas. In benchmark evaluations, he and his colleagues developed the MSR-NRC-SRI entry and the MSR entry which obtained No. 1 place in the 2008 NIST MT Evaluation and No. 1 place in the 2011 IWSLT Evaluation, all in Chinese-English translation, respectively.
He has held editorial positions on several IEEE jounrals, and has served as area chair and program committe member of major speech and language processing conferences. He is a senior member of IEEE and a member of ACL.
News and Events
- Our paper on "Speech-Centric Information Processing" in the Proceedings of the IEEE is online now. Also check out other recently written papers in SIGIR, NAACL, WWW, and ICASSP at Publications.
- Rick Rashid demonstrated our speech-to-speech translation system during his keynote in Tianjin at MSRA’s 21 Century Computing Conference on October 25.
- Translator App supports Windows Phone 8 now, and read the technology behind at Inside Microsoft Research.
- Gave a lecture “Introduction to speech and human language technology” at Columbia University on October 12. About 200 students in engineering and applied science are in the audience.
ICASSP 2013 Tutorial on Speech Translation: Theory and Practice
We have gaven a tutorial on Speech Translation: Theory and Practice at ICASSP 2013. The slides can be found here.
Special Issue in IEEE Transactions on Audio, Speech, and Language Processing
Submission deadline of the Special Issue on Large-Scale Optimization for Audio, Speech, and Language Processing was passed. Manuscripts are under review now.
Academic Services
- Chair of Special Sessions, IEEE ICASSP 2013
- Associate Editor, IEEE Signal Processing Magazine
- Guest Editor, Special Issue on Large-Scale Optimization for Audio, Speech, and Language Processing, in IEEE Transactions on Audio, Speech, and Language Processing
- Lead Guest Editor, Special Issue on Statistical Learning Methods for Speech and Language Processing, in IEEE Journal of Selected Topics in Signal Processing
- Co-Chair, NIPS 2008 Workshop on Speech and Language: Learning-Based Methods and Systems, Whistler, BC, Canada, 2008
- Grant Reviewer: Swiss National Science Foundation
- Program Committee Member: ACL, NAACL, EMNLP, COLING, AAAI
- Reviewer: IEEE Transactions on Speech and Audio Processing, Proceedings of the IEEE, IEEE Signal Processing Magazine, IEEE Signal Processing Letters, IEEE Transactions on Computer, Speech Communication, Pattern Recognition, Pattern Recognition Letters, ICASSP, Interspeech, NIPS
Honors and Awards
- No. 1 Place, Chinese to English MT track, 2011 IWSLT Evaluation
- No. 1 Place, Chinese to English common data track, 2008 NIST MT Evaluation
- ICASSP 2011 Best Student Paper Award, co-author, for the paper by Yaodong Zhang, Li Deng, Xiaodong He, Alex Acero
- IEEE senior member since 2008
- Microsoft Gold Star Award, 2005
- Microsoft Patent awards, 2005-2012
- Microsoft Technology Transfer Award, 2009
- Member of Sigma Xi since 2002
- Award of student author grant, Int'l Conf. on Spoken Language Processing, 2002
- Outstanding Academic Achievement Award, University of Missouri, 2001
- Prizes in the 13rd & 14th "Challenge Cup" Sci & Tech Innovation Competition, Tsinghua University, 1995,1996
Contact
E-mail: xiaohe@microsoft.com
U.S.Mail: Microsoft Corporation, One Microsoft Way, Redmond WA, 98052-6399, USA
Tel: (425) 706-4939
Fax: (425) 706-7329 (to Xiaodong He's attention)
ICASSP 2013: Special Sessions
The 38th International Conference on Acoustics, Speech, and Signal Processing (ICASSP) will be held at Vancouver, Canada in May 2013. A total of eight special sessions will be offered.
Special Issue in IEEE Journal of Selected Topics in Signal Processing
The Special Issue on Statistical Learning Methods for Speech and Language Processing was published in the IEEE Journal of Selected Topics in Signal Processing in December, 2010.
NIPS 2008 Workshop
The NIPS 2008 workshop on Speech and Language: Learning-based Methods and Systems covers a variety of advanced topics in the Speech and Language Processing area. More details can be found at the workshop's homepage NIPS08 WSL(a)
Book
Xiaodong He and Li Deng, 2008. Discriminative Learning for Speech Recognition: Theory and Practice, Morgan & Claypool Publishers, 2008. ISBN: 1598293087 (order from Amazon.com)
2013
- Grégoire Mesnil, Xiaodong He, Li Deng, and Yoshua Bengio, Investigation of Recurrent-Neural-Network Architectures and Learning Methods for Spoken Language Understanding, in Interspeech 2013, August 2013
- Hongning Wang, Xiaodong He, Ming-Wei Chang, Yang Song, Ryen White, and Wei Chu, Personalized Ranking Model Adaptation for Web Search, in The 36th Annual ACM SIGIR Conference (SIGIR'2013), ACM, July 2013
- Jianfeng Gao and Xiaodong He, Training MRF-Based Phrase Translation Models using Gradient Ascent, in the North American Chapter of the Association for Computational Linguistics (NAACL), Association for Computational Linguistics, June 2013
- Xiaodong He and Li Deng, Speech-Centric Information Processing: An Optimization-Oriented Approach, in Proceedings of the IEEE, IEEE, 31 May 2013
- Hongning Wang, Yang Song, Ming-Wei Chang, Xiaodong He, Ryen White, and Wei Chu, Learning to Extract Cross-Session Search Tasks, in International World Wide Web Conference (WWW), ACM, 13 May 2013
- Ryen White, Wei Chu, Ahmed Hassan, Xiaodong He, Yang Song, and Hongning Wang, Enhancing Personalized Search by Mining and Modeling Task Behavior, in International World Wide Web Conference (WWW), ACM, 13 May 2013
- Li Deng, Xiaodong He, and Jianfeng Gao, Deep Stacking Networks for Information Retrieval, IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2013
- Jennifer Gillenwater, Xiaodong He, Jianfeng Gao, and Li Deng, End-To-End Learning of Parsing Models for Information Retrieval, IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2013
- Xiaodong He, Li Deng, Dilek Hakkani-Tur, and Gokhan Tur, Multi-Style Adaptive Training for Robust Cross-Lingual Spoken Language Understanding, IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2013
- Bowen Zhou and Xiaodong He, Speech Translation: Theory and Practices, IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2013
- Po-Sen Huang, Li Deng, Mark Hasegawa-Johnson, and Xiaodong He, Random Features for Kernel Deep Convex Network, IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2013
- Li Deng, Jinyu Li, Jui-Ting Huang, Kaisheng Yao, Dong Yu, Frank Seide, Michael Seltzer, Geoff Zweig, Xiaodong He, Jason Williams, Yifan Gong, and Alex Acero, Recent Advances in Deep Learning for Speech Research at Microsoft, IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2013
2012
- Li Deng, Gokhan Tur, Xiaodong He, and Dilek Hakkani-Tur, Use of Kernel Deep Convex Networks and End-To-End Learning for Spoken Language Understanding, IEEE Workshop on Spoken Language Technologies, December 2012
- Dimitri Kanevsky, Xiaodong He, Georg Heigold, Haizhou Li, Hermann Ney, and Stephen Wright, Special Issue on Large-Scale Optimization for Audio, Speech, and Language Processing, IEEE SPS, August 2012
- Jianfeng Gao, Shasha Xie, Xiaodong He, and Alnur Ali, Learning Lexicon Models from Search Logs for Query Expansion, in Proceedings of EMNLP, ACM, July 2012
- Xiaodong He and Li Deng, Maximum Expected BLEU Training of Phrase and Lexicon Translation Models , in Proceedings of ACL, Association for Computational Linguistics, July 2012
- Antti-Veikko Rosti, Xiaodong He, Damianos Karakos, Gregor Leusch, Yuan Cao, Markus Freitag, Spyros Matsoukas, Hermann Ney, Jason Smith, and Bing Zhang, Review of Hypothesis Alignment Algorithms for MT System Combination via Confusion Network Decoding , in Proceedings of NAACL-HLT workshop on SMT (WMT), Association for Computational Linguistics, June 2012
- Gokhan Tur, Li Deng, Dilek Hakkani-Tur, and Xiaodong He, Towards Deeper Understanding Deep Convex Networks for Semantic Utterance Classification, IEEE International Confrence on Acoustics, Speech, and Signal Processing (ICASSP), March 2012
- Xiaodong He and Li Deng, Optimization in Speech-Centric Information Processing: Criteria and techniques, IEEE International Confrence on Acoustics, Speech, and Signal Processing (ICASSP), March 2012
- Amittai Axelrod, Xiaodong He, Li Deng, Alex Acero, and Mei-Yuh Hwang, New Methods and Evaluation Experiments on Translating TED Talks in the IWSLT Benchmark, IEEE International Confrence on Acoustics, Speech, and Signal Processing (ICASSP), March 2012
2011
- Xiaodong He, Amittai Axelrod, Li Deng, Alex Acero, Mei-Yuh Hwang, Alisa Nguyen, Andrew Wang, and Xiahui Huang, THE MSR SYSTEM FOR IWSLT 2011 EVALUATION, Internaltional Workshop on Spoken Language Translation (IWSLT), December 2011
- Xiaodong He and Li Deng, Speech Recognition, Machine Translation, and Speech Translation – A Unified Discriminative Learning Paradigm, in IEEE Signal Processing Magazine, September 2011
- Xiaodong He and Li Deng, Robust Speech Translation by Domain Adaptation, in Interspeech, International Speech Communication Association, August 2011
- Xiaodong He and Li Deng, Discriminative Learning of Feature Functions of Generative Type in Speech Translation, in Workshop on Learning Architectures, Representations, and Optimization for Speech and Visual Information Processing, ICML, July 2011
- Amittai Axelrod, Xiaodong He, and Jianfeng Gao, Domain Adaptation via Pseudo In-Domain Data Selection, in EMNLP, ACM, July 2011
- Xiaodong He, Li Deng, and Alex Acero, Why Word Error Rate is not a Good Metric for Speech Recognizer Training for the Speech Translation Task?, in Proc. ICASSP, IEEE, May 2011
- Yaodong Zhang, Li Deng, Xiaodong He, and Alex Acero, A Novel Decision Function and the Associated Decision-Feedback Learning for Speech Translation, in ICASSP, IEEE, May 2011
2010
- Xiaodong He, Li Deng, Roland Kuhn, Helen Meng, and Samy Bengio, Introduction to the Issue on Statistical Learning Methods for Speech and Language Processing, in IEEE Journal of Selected Topics in Signal Processing, IEEE, December 2010
- Jianfeng Gao, Xiaodong He, and Jian-Yun Nie, Clickthrough-Based Translation Models for Web Search: from Word Models to Phrase Models, in CIKM, 2010
2009
- Xiaodong He and Kristina Toutanova, Joint Optimization for Machine Translation System Combination, in In Proceedings of EMNLP, Association for Computational Linguistics, August 2009
- Chi-Ho Li, Xiaodong He, Yupeng Liu, and Ning Xi, Incremental HMM Alignment for MT System Combination, in ACL, 2009
- Yong Zhao and Xiaodong He, Using N-gram based Features for Machine Translation System Combination, in NAACL-HLT, 2009
- Xiaodong He, Mei Yang, Jianfeng Gao, Patrick Nguyen, and Robert Moore, Improved Monolingual Hypothesis Alignment for Machine Translation System Combination, in ACM Transactions on Asian Language Information Processing -- special issue on Machine Translation of Asian Languages, ACM, 2009
2008
- Dong Yu, Li Deng, Xiaodong He, and Alex Acero, Large-Margin Minimum Classification Error Training: A Theoretical Risk Minimization Perspective, in Computer Speech and Language, vol. 22, no. 4, pp. 415-429, Elsevier , October 2008
- Xiaodong He and Li Deng, DISCRIMINATIVE LEARNING FOR SPEECH RECOGNITION: Theory and Practice, Morgan & Claypool, October 2008
- Xiaodong He, Li Deng, and Wu Chou, Discriminative Learning in Sequential Pattern Recognition --- A Unifying Review for Optimization-Oriented Speech Recognition, in IEEE Signal Processing Magazine, vol. 25, no. 5, pp. 14-36, Institute of Electrical and Electronics Engineers, Inc., September 2008
- Xiaodong He, Jianfeng Gao, Chris Quirk, Patrick Nguyen, Arul Menezes, Robert Moore, Kristina Toutanova, Mei Yang, Bill dolan, Mu Li, Chi-Ho Li, Dongdong Zhang, Long Jiang, and Ming Zhou, The MSR-MSRA MT System for NIST Open Machine Translation 2008 Evaluation, in The 2008 NIST Open Machine Translation Evaluation Workshop, 2008
- Xiaodong He, Mei Yang, Jianfeng Gao, Patrick Nguyen, and Robert Moore, Indirect-HMM-based Hypothesis Alignment for Combining Outputs from Machine Translation Systems, in EMNLP, 2008
- Xiaodong He, Jianfeng Gao, Chris Quirk, Patrick Nguyen, Arul Menezes, Robert Moore, Kristina Toutanova, Mei Yang, Bill dolan, Mu Li, Chi-Ho Li, Dongdong Zhang, Long Jiang, Ming Zhou, George Foster, Roland Kuhn, Jing Zheng, Wen Wang, Necip Fazil Ayan, Dimitra Vergyri, Nicolas Scheffer, and Andreas Stolcke, The MSR-NRC-SRI MT System for NIST Open Machine Translation 2008 Evaluation, in The 2008 NIST Open Machine Translation Evaluation Workshop, 2008
2007
- Xiaodong He and Li Deng, Discriminative Learning in Speech Recognition, no. MSR-TR-2007-129, October 2007
- Masaki Itagaki, Takako Aikawa, and Xiaodong He, Automatic Validation of Terminology Translation Consistency with Statistical Method, European Association for Machine Translation, September 2007
- Qiang Fu, Xiaodong He, and Li Deng, Phone-Discriminating Minimum Classification Error (P-MCE) Training for Phonetic Recognition, in Proc. Interspeech, August 2007
- Patrick Nguyen, Milind Mahajan, and Xiaodong He, Training Non-Parametric Features for Statistical Machine Translation , in Proceedings of ACL workshop on SMT (WMT), Association for Computational Linguistics, June 2007
- Xiaodong He, Using Word-Dependent Transition Models in HMM based Word Alignment for Statistical Machine Translation, in in ACL workshop on SMT (WMT), Association for Computational Linguistics, June 2007
- Dong Yu, Li Deng, Xiaodong, and Alex Acero, Large-Margin Minimum Classification Error Training for Large-Scale Speech Recognition Tasks, in Proceedings of the ICASSP, Honolulu, Hawaii, IEEE, April 2007
- Xiaodong He and Yunxin Zhao, Prior Knowledge Guided Maximum Expected Likelihood based Model Selection and Adaptation for Nonnative Speech Recognition, in Computer Speech and Language, Elsevier, 2007
- Xiaodong He and Li Deng, A new look at discriminative learning for hidden Markov models, in Pattern Recognition Letters, vol. 28, pp. 1285-1294, 2007
2006
- Xiaodong He, Li Deng, and Wu Chou, A novel learning method for hidden Markov models in speech and audio processing, in IEEE MMSP, IEEE SPS, October 2006
- Dong Yu, Li Deng, Xiaodong He, and Alex Acero, Use of Incrementally Regulated Discriminative Margins in MCE Training for Speech Recognition, in Proc. of the Interspeech Conference, International Speech Communication Association, September 2006
- Xiaodong He, Arul Menezes, Chris Quirk, Anthony Aue, Simon Corston-Oliver, Jianfeng Gao, and Patrick Nguyen, Microsoft Research Treelet Translation System: NIST MT Evaluation 06, National Institute of Standards and Technology , March 2006
- Xin Lei, Jon Hamaker, and Xiaodong He, Robust Feature Space Adaptation For Telephony Speech Recognition, in InterSpeech , 2006
2004
- Xiaodong He and Yunxin Zhao, Prior knowledge guided MEL based model selection and adaptation for nonnative speech recognition, in ICASSP, 2004
2003
- Xiaodong He, Model Selection based Speaker Adaptation and its Application to Nonnative Speech Recognition, University of Missouri - Columbia, 2003
- Xiaodong He and Wu Chou, Minimum Classification Error Linear Regression for Acoustic Model Adaptation of Continuous Density HMMs, in ICASSP, 2003
- Wu Chou and Xiaodong He, Maximum a Posteriori Linear Regression (MAPLR) Variance Adaptation for Continuous Density HMMs, in European Conf. on Speech Communication and Technology, ISCA, 2003
- Xiaodong He and Wu Chou, Minimum Classification Error (MCE) Model Adaptation of Continuous Density HMMs, in European Conf. on Speech Communication and Technology, ISCA, 2003
- Xiaodong He and Yunxin Zhao, Fast Model Selection Based Speaker Adaptation for Nonnative Speech, in IEEE Transaction on Speech and Audio Processing, IEEE, 2003
2002
- Xiaodong He and Yunxin Zhao, Maximum Expected Likelihood Based Model Selection and Adaptation for Nonnative English Speakers, in ICSLP, 2002
- Xiaodong He and Yunxin Zhao, Fast model adaptation and complexity selection for nonnative English speakers, in ICASSP, IEEE, 2002
2001
- Xiaodong He and Yunxin Zhao, Model Complexity Optimization for Nonnative English Speakers, in European Conf. on Speech Communication and Technology, 2001
2000
- Yunxin Zhao, Xiao Zhang, Xiaodong He, and Laura Schopp, A Combined Adaptive and Decision Tree Based Speech Separation Technique for Telemedicine Applications, in ICSLP, 2000
1999
- Jianlai Zhou, Xiaodong He, Tiecheng Yu, and Fuyuan Mo, A New Hybrid Structure of Speech Recognizer Based on HMM and Neural Network, in European Conf. on Speech Communication and Technology, 1999
- Xiaodong He, Jian Liu, Jianlai Zhou, and Tiecheng Yu, Research on Speech Units Modeling in Continuous Speech Recognition, in European Conf. on Speech Communication and Technology, 1999
- Xiaodong He, Jianlai Zhou, Jian Liu, and Tiecheng Yu, A Speaker independent vocabulary extensible Chinese speech recognition system, in International Symposium on Machine Translation and Computer Language Information Processing , 1999
- Jian Liu, Xiaodong He, Fuyuan Mo, and Tiecheng Yu, Study on Tone Classification of Chinese Continuous Speech in Speech Recognition System, in European Conf. on Speech Communication and Technology, 1999
- Xiaodong He, Jian Liu, and Tiecheng Yu, Research on Segmentation and Labeling of Speech Corpora, in International Workshop on East-Asian Language Resource and Evaluation, 1999
Links
- Background and Interests
- Academic Services
- Honors and Awards
- Contact
- Publications
- Special Issue in IEEE Transactions on ASLP: Call for Papers
- ICASSP 2013 Call for Special Sessions Proposals
- IEEE J-STSP Special Issue on Speech and Language Processing (Dec. 2010)
- NIPS 2008 Workshop on Speech and Language
