Share this page
    Project Tuva Enhanced Video Player
    Project Tuva Enhanced Video Player
    Share this page E-mail this page Print this page RSS feeds
    Home  > People > Xiaodong He
    Xiaodong He

       

    Xiaodong He is a Researcher in the Natural Language Processing Group.


    Background and Interests

    Xiaodong He received the BS degree from Tsinghua University (Beijing) in 1996, the MS degree from Chinese Academy of Sciences in 1999, and the PhD degree from University of Missouri - Columbia in 2003. His current research interests include statistical pattern recognition and machine learning, machine translation, speech recognition, and natural language processing. He is a senior member of IEEE and a member of ACL. He is a member of Sigma Xi.

    In 2006, he joined the Natural Language Processing (NLP) group of Microsoft Research (MSR), Redmond, WA, where he is currently working on various machine translation (MT) topics including system combination, word alignment, and model optimization. In the 2008 NIST Open MT Evaluation, he and colleagues submitted the MSR-NRC-SRI entry, which gave the best Chinese-to-English result in the Common-Data Track.

    Prior to joining NLP/MSR, he was with Microsoft Speech and Natural Language group from 2003 to 2006, where he engaged in a wide range of speech recognition research and development activities including large vocabulary discriminative training, confidence measure, and model adaptation. He has authored/coauthored more than 20 papers and one book in these areas. His work has been incorporated in a variety of Microsoft speech products.


    Special Issue in IEEE Journal of Selected Topics in Signal Processing

    we are organizing a Special Issue on Statistical Learning Methods for Speech and Language Processing which will be published in the IEEE Journal of Selected Topics in Signal Processing in 2010. Original and unpublished papers in the relevant areas are solicited.


    NIPS 2008 Workshop

    We orgnized the NIPS 2008 workshop Speech and Language: Learning-based Methods and Systems. It covers a variety of advanced topics in the Speech and Language Processing areas. More details can be found at NIPS08 WSL(a)


    Publications

    Book

    • Xiaodong He and Li Deng, 2008. Discriminative Learning for Speech Recognition: Theory and Practice, Morgan & Claypool Publishers, 2008. ISBN: 1598293087 (order from Amazon.com)

    Journals

    • Xiaodong He, Mei Yang, Jianfeng Gao, Patrick Nguyen, Robert Moore. 2009. Improved Monolingual Hypothesis Alignment for Machine Translation System Combination. ACM Transactions on Asian Language Information Processing -- special issue on Machine Translation of Asian Languages, June 2009 (pdf)
    • Xiaodong He, Li Deng, Wu Chou, 2008. Discriminative Learning in Sequential Pattern Recognition –– A Unifying Review for Optimization-Oriented Speech Recognition. Feature Article, IEEE Signal Processing Magazine, September 2008 (pdf)
    • Dong Yu, Li Deng, Xiaodong He, and Alex Acero, 2008. Large-Margin Minimum Classification Error Training: A Theoretical Risk Minimization Perspective. Computer Speech and Language, Vol. 22, October 2008 (pdf)
    • Xiaodong He and Li Deng, 2007. A New Look at Discriminative Training for Hidden Markov Models. Invited paper, Pattern Recognition Letters, Vol. 28, August 2007 (pdf)
    • Xiaodong He and Yunxin Zhao, 2007. Prior Knowledge Guided Maximum Expected Likelihood based Model Selection and Adaptation for Nonnative Speech Recognition. Computer Speech and Language, Vol. 21, April 2007 (pdf)
    • Xiaodong He and Yunxin Zhao, 2003 Fast Model Selection Based Speaker Adaptation for Nonnative Speech. IEEE Transaction on Speech and Audio Processing, Vol. 11, July 2003 (pdf)

    Conferences

    • Xiaodong He and Kristina Toutanova, 2009. Joint Optimization for Machine Translation System Combination. EMNLP August 2009 (pdf)
    • Chi-Ho Li, Xiaodong He, Yupeng Liu and Ning Xi, 2009. Incremental HMM Alignment for MT System Combination. ACL. August 2009 (pdf)
    • Yong Zhao and Xiaodong He, 2009. Using N-gram based Features for Machine Translation System Combination. NAACL-HLT May 2009 (pdf)
    • Xiaodong He, Mei Yang, Jianfeng Gao, Patrick Nguyen, Robert Moore, 2008. Indirect-HMM-based Hypothesis Alignment for Combining Outputs from Machine Translation Systems. EMNLP. October 2008 (pdf)
    • X. He, J. Gao, C. Quirk, P. Nguyen, A. Menezes, R. Moore, K. Toutanova, M. Yang, W. Dolan, M. Li, C.-H. Li, D. Zhang, L. Jiang, C. Niu, M. Zhou, G. Foster, R. Kuhn, J. Zheng, W. Wang, N. F. Ayan, D. Vergyri, N. Scheffer, A. Stolcke, 2008. The MSR-NRC-SRI MT System for NIST Open Machine Translation 2008 Evaluation. the 2008 NIST Open Machine Translation Evaluation Workshop. March 2008 (pdf)
    • X. He, J. Gao, C. Quirk, P. Nguyen, A. Menezes, R. Moore, K. Toutanova, M. Yang, W. Dolan, M. Li, C.-H. Li, D. Zhang, L. Jiang, C. Niu, M. Zhou, 2008. The MSR-MSRA MT System for NIST Open Machine Translation 2008 Evaluation. the 2008 NIST Open Machine Translation Evaluation Workshop. March 2008 (pdf)
    • Xiaodong He, 2007. Using Word-Dependent Transition Models in HMM based Word Alignment for Statistical Machine Translation. ACL07 2nd SMT workshop. (pdf)
    • Patrick Nguyen, Milind Mahajan and Xiaodong He, 2007. Training Non-Parametric Features for Statistical Machine Translation. ACL 07 2nd SMT workshop. (pdf)
    • Masaki Itagaki, Takako Aikawa, Xiaodong He, 2007. Automatic Validation of Terminology Translation Consistency with Statistical Method. MT Summit XI. (pdf)
    • Dong Yu, Li Deng, Xiaodong He, Alex Acero, 2007. Large-Margin Minimum Classification Error Training For Large-Scale Speech Recognition Tasks. ICASSP. (pdf)
    • Qiang Fu, Xiaodong He, Li Deng, 2007. Phone-Discriminating Minimum Classification Error (P-MCE) Training for Phonetic Recognition. Interspeech. (pdf)
    • X. He, A. Menezes, C. Quirk, A. Aue, S. Corston-Oliver, JF. Gao, and P. Nguyen, 2006. Microsoft Research Treelet Translation System: NIST MT Evaluation 06. the 2006 NIST Open Machine Translation Evaluation Workshop. (pdf)
    • Xiaodong He, Li Deng, and Wu Chou, 2006. A Novel Learning Method for Hidden Markov Models in Speech and Audio Processing. IEEE MMSP. (pdf)
    • Xin Lei, Jon Hamaker, and Xiaodong He, 2006. Robust Feature Space Adaptation For Telephony Speech Recognition. InterSpeech (pdf)
    • Yu Dong, Li Deng, Xiaodong He, and Alex Acero, 2006. Use of Incrementally Regulated Discriminative Margins in MCE Training for Speech Recognition. InterSpeech (pdf)
    • Xiaodong He and Yunxin Zhao, 2004. Prior knowledge guided MEL based model selection and adaptation for nonnative speech recognition. ICASSP (pdf)
    • Xiaodong He and Wu Chou, 2003. Minimum Classification Error Linear Regression for Acoustic Model Adaptation of Continuous Density HMMs. ICASSP (pdf)
    • Xiaodong He and Wu Chou, 2003. Minimum Classification Error (MCE) Model Adaptation of Continuous Density HMMs. In Proceedings of European Conf. on Speech Communication and Technology. (pdf)
    • Wu Chou and Xiaodong He, 2003. Maximum a Posteriori Linear Regression (MAPLR) Variance Adaptation for Continuous Density HMMs. In Proceedings of European Conf. on Speech Communication and Technology. (pdf)
    • Xiaodong He and Yunxin Zhao, 2002. Maximum Expected Likelihood Based Model Selection and Adaptation for Nonnative English Speakers. In Proceedings of Int'l Conf. on Spoken Language Processing. (pdf)
    • Xiaodong He and Yunxin Zhao, 2002. Fast model adaptation and complexity selection for nonnative English speakers. ICASSP (pdf)
    • Xiaodong He and Yunxin Zhao, 2001. Model Complexity Optimization for Nonnative English Speakers. In Proceedings of European Conf. on Speech Communication and Technology. (pdf)
    • Yunxin Zhao, Xiao Zhang, Xiaodong He, Laura Schopp, 2000. A Combined Adaptive and Decision Tree Based Speech Separation Technique for Telemedicine Applications. In Proceedings of Int'l Conf. on Spoken Language Processing. (pdf)
    • Xiaodong He, Jian Liu, Jianlai Zhou, Tiecheng Yu, 1999. Research on Speech Units Modeling in Continuous Speech Recognition. In Proceedings of European Conf. on Speech Communication and Technology. (pdf)
    • Jianlai Zhou, Xiaodong He, Tiecheng Yu, Fuyuan Mo, 1999. A New Hybrid Structure of Speech Recognizer Based on HMM and Neural Network. In Proceedings of European Conf. on Speech Communication and Technology. (pdf)
    • Jian Liu, Xiaodong He, Fuyuan Mo, Tiecheng Yu, 1999. Study on Tone Classification of Chinese Continuous Speech in Speech Recognition System. In Proceedings of European Conf. on Speech Communication and Technology. (pdf)
    • Xiaodong He, Jian Liu, Tiecheng Yu, 1999. Research on Segmentation and Labeling of Speech Corpora. In Proceedings of Oriental COCOSDA, International Workshop on East-Asian Language Resource and Evaluation. (pdf)
    • Xiaodong He, Jianlai Zhou, Jian Liu, Tiecheng Yu, 1999 A Speaker independent vocabulary extensible Chinese speech recognition system In Proceedings of International Symposium on Machine Translation and Computer Language Information Processing (pdf)

    Thesis

    • Xiaodong He, 2003. Model Selection based Speaker Adaptation and its Application to Nonnative Speech Recognition. PhD dissertation, University of Missouri-Columbia, 2003

    Others

    • Xiaodong He and Li Deng, 2007. Discriminative Learning in Speech Recognition. Technical Report of Microsoft Research (MSR-TR-2007-129). Oct 2007.
    • Xiaodong He, 2004. Recognition Confidence and Threshold Tuning. Feature article, Microsoft Speech Server Newsletter July 2004

     

    Patents

    • Confidence Threshold Tuning, with Li Jiang, Julian Odell. and Wei Zhang. (pending)
    • Speech Models Generated Using Competitive Training, Asymmetric Training, And Data Boosting. with Jian Wu. (pending)
    • Speech Recognition Using Adaptation and Prior Knowledge. with Xin Lei, Jonathan Hamaker, and Patrick Nguyen. (pending)
    • Identifying Language Origin of Words, with Min Chu, Yining Chen, Shiun-Zu Kuo, Kevin Feige. Yifan Gong, and Megan Riley. (pending)
    • Use of incrementally regulated discriminative margins in mce training for speech recognition. with Dong Yu, Li Deng and Alex Acero. (pending)
    • A new method of discriminative training for hidden Markov models. with Li Deng. (pending)
    • Using Word Dependent Transition Models in HMM based Word Alignment for Statistical Machine Translation. (pending)
    • Automatic Validation of Terminology Translation Consistency and Its Applications. with Masaki Itagaki, Takako Aikawa. (pending)
    • A Generic Framework For Large Margin MCE in Speech Recognition. with Li Deng. (pending)
    • Segment Discriminating Minimum Classification Error Pattern Recognition. with Yu Dong, Li Deng, and Alex Acero. (pending)
    • HMM Alignment for Combining Translation Systems. with Mei Yang, Jianfeng Gao, Patrick Nguyen, Robert Moore. (pending)
    • Using Machine Translation Technologies to help Foreign Language Education. with Alex Acero and Sebastian de la Chica. (pending)

     

    Technical Services

    • Guest Editor, Special Issue in IEEE Journal of Selected Topics in Signal Processing, on Statistical Learning Methods for Speech and Language Processing
    • Co-Chair, NIPS 2008 Workshop on Speech and Language: Learning-Based Methods and Systems, Whistler, BC, Canada, 2008
    • Program Committee Member: IEEE International Conference on Semantic Computing (ICSC) 2007
    • Technical Committee Member: International Symposium on Chinese Spoken Language Processing (ISCSLP) 2008
    • Program Committee Member: The Int’l Conf. on Internet & Multimedia Systems & Applications, 2005, 2006, 2007
    • Reviewer: IEEE Transactions on Speech and Audio Processing, IEEE Signal Processing Magazine, IEEE Signal Processing Letters, IEEE Transactions on Computer, Speech Communication, Pattern Recognition, Pattern Recognition Letters, ICASSP, Interspeech, ASRU, HLT, NAACL, ACL, EMNLP, MT Summit

     

    Honors and Awards

    • Gold Star Award from Microsoft, 2005
    • Patent awards from Microsoft in 2005-2008
    • Member of Sigma Xi since 2002
    • Award of student author grant, Int'l Conf. on Spoken Language Processing, 2002
    • Outstanding Academic Achievement Award, University of Missouri, 2001
    • Second Prize, the 14th "Challenge Cup" Sci & Tech Innovation Competition, Tsinghua University, 1996
    • Second Prize, the 13th "Challenge Cup" Sci & Tech Innovation Competition, Tsinghua University, 1995
    • Tsinghua – AIWA outstanding Sci. & Tech. Activities Award, 1995

    E-mail: xiaohe@microsoft.com
    U.S.Mail: Microsoft Corporation, One Microsoft Way, Redmond WA, 98052-6399, USA
    Tel: (425) 706-4939
    Fax: (425) 706-7329 (This is the main MS FAX number so make sure to send documents to Xiaodong He's attention)


    Natural Language Processing group's home page.