*
Quick Links|Home|Worldwide
Microsoft*
Search for


Dong Yu

Dong Yu (
俞栋) - Researcher, Speech Research Group
 

About Me | Speech Research Links | Photo Albums | Contact Me

About Me

Brief Biography

Dr. Dong Yu joined Microsoft in 1998 and Microsoft Speech Research Group in 2002, where he is currently  a researcher. He holds a Ph.D. degree in computer science from University of Idaho, an MS degree in computer science from Indiana University / Bloomington, an MS degree in electrical engineering from Chinese Academy of Sciences, and a BS degree (with honor) in electrical engineering from Zhejiang University (China). His current research interests include speech processing, robust speech recognition, discriminative training, spoken dialog system, voice search technology, machine learning, and pattern recognition. He is a senior member of IEEE, a member of ACM, and a member of ISCA.

Equation Number in Office 2007

Office 2007 comes with a very nice equation editor and bibliography manager. However, it does not support equation and theorem number management. To work around this problem. I have developed a set of macros. You can download it here.

English Publications

All copyrights for these documents are retained by the copyright holder, and permission to copy the work should be obtained from the copyright holder in writing. This copyright notice must be kept together with the downloaded or printed document.

Book Chapters

  1. Dong Yu, Li Deng, "Speech-Centric Multimodal User Interface Design in Mobile Technology", Handbook of Research on User Interface Design and Evaluation for Mobile Technology (Editor: Joanna Lumsden), Jan. 2008, IGI. (to appear)

Journals and Magazines

  1. Sibel Yaman, Li Deng, Dong Yu, Ye-Yi Wang, Alex Acero, "A Discriminative Technique for Spoken Utterance Classification", IEEE Transactions on Audio, Speech and Language Processing. (to appear)
  2. Dong Yu, Li Deng, Xiaodong He, Alex Acero, "Large-Margin Minimum Classification Error Training: A Theoretical Risk Minimization Perspective", Computer Speech and Language, Volume 22, Issue 4, October 2008, Pages 415-429.
  3. Dong Yu, Li Deng, Jasha Droppo, Jian Wu, Yifan Gong, Alex Acero, "Robust Speech Recognition Using a Cepstral Minimum-Mean-Square-Error-Motivated Noise Suppressor", IEEE Transactions on Audio, Speech and Language Processing. (to appear)
  4. Ye-Yi Wang, Dong Yu, Yun-Cheng Ju, Alex Acero, "Voice Search – An Introduction", IEEE Signal Processing Magazine (Special Issue on Spoken Language Technology), May 2008 (to appear)
  5. Dong Yu, Li Deng, Alex Acero, "A Lattice Search Technique for a Long-Contextual-Span Hidden Trajectory Model of Speech", Speech Communication, Elsevier. Volume: 48 Issue: 9, Sep 2006. pp. 1214-1226.
  6. Li Deng, Dong Yu, and Alex Acero. "Structured Speech Modeling", IEEE Trans. on Audio, Speech and Language Processing. Volume: 14 Issue: 5, Sep 2006. pp. 1492- 1504.
  7. Dong Yu, Deborah Frincke, "Improving the Quality of Alerts and Predicting Intruder¡¯s Next Goal with Hidden Colored Petri-Net", Computer Networks, Volume 51, Issue 3, 21 February 2007, Pages 632-654.
  8. Dong Yu, Li Deng, Alex Acero, "Speaker-Adaptive Learning of Resonance Targets in a Hidden Trajectory Model of Speech Coarticulation", Computer Speech and Language, Vol. 27, 2007, pp. 72-87.
  9. Li Deng, Dong Yu, and Alex Acero, "A Bi-Directional Target-Filtering Model of Speech Coarticulation and Reduction: Two-Stage Implementation for Phonetic Recognition", IEEE Trans. Audio, Speech & Language Proc, vol. 14, No. 1, pp 256-265, Jan 2006.
  10. Dong Yu, Alex Acero, "Semiautomatic Improvements of System-Initiative Spoken Dialog Applications Using Interactive Clustering", © IEEE Trans. Speech & Audio Proc (Special Issue on Data Mining of Speech, Audio and Dialog), Sept. 2005, vol.13, no. 5pp 661-671.
  11. Li Deng, Dong Yu, "A Speech-Centric Perspective for Human-Computer Interface - A Case Study", Journal of VLSI Signal Processing Systems (Special Issue on Multimedia Signal Processing), Vol. 41, No. 3. pp. 255-269, November 2005.

Refereed Conferences

  1. Jinyu Li, Li Deng, Dong Yu, Jian Wu, Yifan Gong, Alex Acero, "Adaptation of Compressed HMM Parameters for Resource-Constrained Speech Recognition", ICASSP 2008, Las Vegas, USA.
  2. Jinyu Li, Li Deng, Dong Yu, Yifan Gong, Alex Acero, "HMM Adaptation Using a Phase-Sensitive Acoustic Distortion Model For Environment-Robust Speech Recognition", ICASSP 2008, Las Vegas, USA.
  3. Dong Yu, Li Deng, Jasha Droppo, Jian Wu, Yifan Gong, Alex Acero, "a Minimum-Mean-Square-Error Noise Reduction Algorithm on Mel-Frequency Cepstra for Robust Speech Recognition", ICASSP 2008, Las Vegas, USA.
  4. Jinyu Li, Li Deng, Dong Yu, Yifan Gong, Alex Acero, "High-Performance HMM Adaptation With Joint Compensation of Additive and Convolutive Distortions Via Vector Taylor Series", ASRU 2007, Kyoto, Japan.
  5. Ivan Tashev, Michael Seltzer, Yun-Cheng Ju, Dong Yu and Alex Acero, "Commute UX: Telephone Dialog System for Location-based Services", SIGDIAL 2007, Antwerp, Belgium.
  6. Dong Yu, Li Deng, "Large-Margin Discriminative Training of Hidden Markov Models for Speech Recognition", ICSC 2007, Irvine, CA (invited).
  7. Geoffrey Zweig, Patrick Nguyen, Yun-Cheng Ju, Ye-Yi Wang, Dong Yu, Alex Acero, "The Voice-Rate Dialog System for Consumer Ratings", Interspeech 2007, Antwerp, Belgium.
  8. Dong Yu, Li Deng, Alex Acero, "Handling Phonetic Context and Speaker Variation in a Structure-Based Speech Recognizer", Interspeech 2007, Antwerp, Belgium.
  9. J. Sherwani, Dong Yu, Tim, Paek, Mary Czerwinski, Yun-Cheng Ju, Alex Acero, "VoicePedia: Towards Speech-based Access to Unstructured Information", Interspeech 2007, Antwerp, Belgium.
  10. Dong Yu, Yun-Cheng Ju, Ye-Yi Wang, Geoffrey Zweig, Alex Acero, "Automated Directory Assistance System - from Theory to Practice", Interspeech 2007, Antwerp, Belgium.
  11. Ye-Yi Wang, Dong Yu, Yun-Cheng Ju, Geoffrey Zweig, Alex Acero, "Confidence Measures for Voice Search Applications", Interspeech 2007, Antwerp, Belgium.
  12. Dong Yu, Li Deng, Xiaodong He, Alex Acero, "Large-Margin Minimum Classification Error Training for Large-Scale Speech Recognition Tasks", ICASSP 2007, Hawaii, USA.
  13. Li Deng, Dong Yu, "Use of Differential Cepstra as Acoustic Features in Hidden Trajectory Modeling for Phonetic Recognition", ICASSP 2007, Hawaii, USA.
  14. Sibel Yaman, Li Deng, Dong Yu, Ye-Yi Wang, Alex Acero, "A Discriminative Training Framework Using N-Best Speech Recognition Transcriptions and Scores for Spoken Utterance Classification", ICASSP 2007, Hawaii, USA.
  15. Dong Yu, Li Deng, Xiaodong He, Alex Acero, "Use of Incrementally Regulated Discriminative Margins in MCE Training for Speech Recognition", in Proc. of the Interspeech Conference. Pittsburgh, Sep, 2006.
  16. Xiaolong Li, Li Deng, Dong Yu, Alex Acero, "A Time-Synchronous Phonetic Decoder for a Long-Contextual-Span Hidden Trajactory Model", Proc. of the Interspeech Conference. Pittsburgh, Sep, 2006.
  17. Dong Yu, Yun Cheng Ju, Alex Acero, "Effective and Efficient Utterance Verification Technology Using Word N-gram Filler Models", in Proc. of the Interspeech Conference. Pittsburgh, Sep, 2006.
  18. Dong Yu, Yun Cheng Ju, Ye-Yi Wang, Alex Acero, "N-Gram Based Filler Model for Robust Grammar Authoring",in Proc. ICASSP 2006, Toulouse, France, May 2006.
  19. Li Deng, Dong Yu, and Alex Acero. "A Generative Modeling Framework for Structured Hidden Speech Dynamics", in Proc. NIPS Workshop on Advances in Structured Learning for Text and Speech Processing 2005.
  20. Li Deng, Dong Yu, and Alex Acero. "A Long-Contextual-Span Model of Resonance Dynamics for Speech Recognition: Parameter Learning and Recognizer Evaluation," in Proc. ASRU2005.
  21. Dong Yu, Li Deng, and Alex Acero. "Learning Statistically Characterized Resonance Targets in a Hidden Trajectory Model of Speech Coarticulation and Reduction," in Proc of Interspeech, Lisbon, Sept 2005.
  22. Li Deng, Dong Yu, and Alex Acero."Evaluation of a Long-contextual-span Hidden Trajectory Model and Phonetic Recognizer Using A* Lattice Search," in Proc of Interspeech, Lisbon, Sept 2005.
  23. Dong Yu, Deborah Frincke, "Alert Confidence Fusion in Intrusion Detection Systems with Extended Dempster-Shafer Theory" © ACM, in the 43rd Annual ACM Southeast Conference, Kennesaw, Georgia, March 18, 19 & 20, 2005.
  24. Dong Yu, Milind Mahajan, Peter Mau, and Alex Acero, "Maximum Entropy Based Generic Filter for Language Model Adaptation", in ICASSP 05, March 19-23, 2005, Philadelphia, PA, USA
  25. Li Deng, Xiang Li, Dong Yu, and Alex Acero, "A Hidden Trajectory Model with Bi-directional Target-Filtering: Cascaded vs. Integrated Implementation for Phonetic Recognition", in ICASSP 05, March 19-23, 2005, Philadelphia, PA, USA
  26. Dong Yu, Mei-Yuh Hwang , Peter Mau, Alex Acero, Li Deng, "Unsupervised Learning from Users' Error Correction in Speech Dictation", in Proceedings of InterSpeech-ICSLP 2004, October 4-8, 2004, Jeju, Korea.
  27. Li Deng, Dong Yu, and Alex Acero, "A Quantitative Model for Formant Dynamics and Contextually Assimilated Reduction in Fluent Speech", in Proceedings of InterSpeech-ICSLP 2004.
  28. Dong Yu, Deborah Frincke, "A Novel Framework for Alert Correlation and Understanding" (© Springer-Verlag), Springer's LNCS series, vol 3089. International Conference on Applied Cryptography and Network Security (ACNS) 2004.
  29. Dong Yu, Deborah Frincke, "Towards Survivable Intrusion Detection System", the 37th Hawaii International Conference On System Science (HICSS-37), Big Island, Hawaii, 2004.
  30. Dong Yu, Kuansan Wang, Milind Mahajan, Peter Mau, Alex Acero, "Improved Name Recognition With User Modeling", in Proceedings of EUROSPEECH 03, Geneva, Switzerland, 2003.
  31. Lei Yao, Dong Yu, and Taiyi Huang, "A unified spectral transformation adaptation approach for robust speech recognition", in Proceedings of the fourth International Conference in Spoken Language Proceedings (ICSLP-96), Philadelphia, PA, USA, 1996.
  32. Dong Yu and Taiyi Huang, "Canonical Correlation Based Compensation Approach for Robust Speech Recognition in Noisy Environment", in Proceedings of EUROSPEECH 95, pp477-480, Madrid, Spain, 1995.
  33. Dong Yu and Taiyi Huang, "A New HMM/NN Hybrid Method for High Performance Speech Recognition", in Proceedings of International Conference in Spoken Language Processing (ICSLP-94), Yokohama, Japan, 1994.
  34. Dong Yu and Taiyi Huang, "A New Time-alignment Approach for Robust Neural Network Based Speech Recognition", in Proceedings of IEEE International Conference in Signal Processing (ICSP-93), Beijiang, China, 1993.

Other Conferences and/or Short Papers:

  1. Geoffrey Zweig, Y.C. Ju, Patrick Nguyen, Dong Yu, Ye-Yi Wang, Alex Acero, "Voice-Rate: A Dialog System for Consumer Ratings", NAACL-HLT 2007, Rochester, New York, USA, pp31-32.
  2. Li Deng, Xiang Li, Dong Yu, and Alex Acero, "Novel Acoustic Modeling with Structured Hidden Dynamics for Speech Coarticulation and Reduction", in Proc. of the DARPA RT04 Workshop. Palisades, New York, Nov 2004.

Chinese Publications

  1. Dong Yu, Taiyi Huang, and Daowen Chen, "Chinese Consonant Recognition Based on Multi-State Gaussian Competitive Neural Networks", in Proceedings of Chinese Conference on Neural Networks (CCNN), Wuhan, China, 1994 (in Chinese).
  2. Dong Yu, Taiyi Huang, and Daowen Chen, "A Fast NN/HMM Hybrid Approach for Speech Recognition", in Proceedings of Chinese Conference on Man-Machine Interface (CCMMI), China, 1994 (in Chinese).

Patents (US and/or International)

  1. "Speech Recognition With Non-linear Noise Reduction on Mel-Frequency Cepstra", with Li Deng, Jasha Droppo, and Alex Acero, (pending, filed 2008)
  2. "High Performance HMM Adaptation With Joint Compensation of Additive and Convolutive Distortions", with Jinyu Li, Li Deng, Alex Acero (pending, filed 2007)
  3. "Searching Database of Listing", with Ye-Yi Wang, Yun-Cheng Ju, Alex Acero and Geoffrey Zweig (pending, filed 2007)
  4. "Speech-Centric Multimodal User Interface Design in Mobile Technology", with Li Deng (pending, filed 2007)
  5. "A Generic Framework for Large-Margin MCE Training in Speech Recognition", with Li Deng, Xiaodong He, Alex Acero (pending, filed 2007)
  6. "Integrated Speech Recognition and Semantic Classification", with Sibel Yaman, Li Deng, Ye-Yi Wang, and Alex Acero (pending, filed 2007)
  7. "Hidden Trajectory Modeling with Differential Cepstra for Speech Recognition", with Li Deng (pending, filed 2007)
  8. "Indexing and Ranking Processes for Directory Assistance Services", with Yun Cheng Ju, Ye-yi Wang, and Alex Acero (pending, filed 2007)
  9. "Adapting a Language Model to Accommodate Inputs Not Found in a Directory Assistance Listing", with Yun Cheng Ju and Alex Acero (pending, filed 2007)
  10. "Compound Word Splitting for Directory Assistance Services", with Yun Cheng Ju and Alex Acero (pending, filed 2007)
  11. "Incrementally Regulated Discriminative Margins in MCE Training for Speech Recognition", with Li Deng, Xiaodong He, Alex Acero (pending, filed 2006)
  12. "Detecting an Answering Machine Using Speech Recognition", with Yun Cheng Ju, Alex Acero, Craig M. Fisher, and Ye-yi Wang (pending, filed 2006)
  13. "Sharable Filler Model for Grammar Authoring", with Yun Cheng Ju, Alex Acero, and Ye-Yi Wang (pending, filed 2006)
  14. "Time Synchronous Decoding for Long-Span Hidden Trajectory Model", with Xiaolong Li, Li Deng, and Alex Acero (pending, filed 2006)
  15. "Parameter Learning in a Hidden Trajectory Model", with Li Deng, Xiaolong Li, Alex Acero (pending, filed 2006)
  16. "Time Asynchronous Decoding for Long-Span Trajectory Model", with Li Deng, Alex Acero (pending, filed 2005)
  17. "Learning Statistically Characterized Resonance Targets in a Hidden Trajectory Model", with Li Deng, Alex Acero (pending, filed 2005)
  18. "Speaker-adaptive Learning of Resonance Targets in a Hidden Trajectory Model of Speech Coarticulation", with Li Deng, Alex Acero (pending, filed 2005)
  19. "Configurable Grammar Templates", with Ye-Yi Wang, Yun-Cheng Ju, and Alex Acero (pending, filed 2005)
  20. "Interactive Clustering Method for Identifying Problems in Speech Applications", with Alex Acero (pending, filed 2005)
  21. "Classification Filter for Processing Data for Creating a Language Model", with Alex Acero, Julian J. Odell, Milind V. Mahajan, and Peter Mau (pending, filed 2005)
  22. "Method of Automatically Ranking Speech Dialog States and Transitions to Aid in Performance Analysis in Speech Applications", with Alex Acero (pending, filed 2005)
  23. "System and Method for Identifying Semantic Intent from Acoustic Information", with Xiao Li, Asela J. Gunawardana, Alex Acero, and Milind Mahajan (pending, filed 2005)
  24. "Acoustic Models with Structured Hidden Dynamics with Integration over Many Possible Hidden Trajectories", with Li Deng, and Alex Acero (pending, filed 2004)
  25. "Two Stage Implementation for Phonetic Recognition Using a Bi-directional Target-filtering Model of Speech Co-articulation and Reduction", with Li Deng, and Alex Acero (pending, filed 2004)
  26. "Quantitative Model for Formant Dynamics and Contextually Assimilated Reduction in Fluent Speech", with Li Deng, and Alex Acero (pending, filed 2004)
  27. "Automatic Speech Recognition Learning Using User Corrections", With Peter Mau, Mei-Yuh Hwang , and Alex Acero (pending, filed 2004)
  28. "Efficient Capitalization through User Modeling", with Peter Mau (pending, filed 2003)
  29. "Named entity recognition with user modeling", with Peter Mau, Kuansan Wang, Milind Mahajan, and Alex Acero, (granted 2007, US patent #7289956)

Technical Services

  • Grant reviewer: Research Grants Council (RGC) of Hong Kong
  • Technical program committee member: ICSC 2007, ICSC 2008, ISCSLP 2008
  • Organization committee member: MMSP 2006, ICSC 2008
  • Session chair: ACNS 2004, ICSC 2007.
  • Paper reviewer: J. Computer Speech and Language, J. Speech Communication, IEEE Transactions on Audio, Speech, and Language Processing, J. Pattern Recognition Letters, EURASIP J. on Audio Speech and Music Processing, J. Computer Security, J. Computer Networks, IEEE Transactions on Computer, ICASSP 2004-2008, INTERSPEECH 2004-2007, ICSC 2007-2008, EUSIPCO 2008, ACMSE 2005

Honors and Awards

  • Best Paper Award of ACMSE, 2005
  • Gold Star Award from Microsoft, 2004
  • Patent Awards from Microsoft, 2002-2008
  • Graduate school fellowship from Indiana University, 1995-96
  • Presidential Award of Chinese Academy of Sciences, 1994
  • Excellent Graduate Student award from Chinese Academy of Sciences, 1993
  • Excellent Graduate Student award from Graduate School, Academia Sinica, 1992
  • Excellent Graduate award from Zhejiang province, 1991
  • Excellent Graduate award from Zhejiang University, 1991
  • Excellent Student Award from Zhejiang University, 1987-1991

©2008 Microsoft Corporation. All rights reserved. Terms of Use |Trademarks |Privacy Statement