|
|
|
Dong Yu
Dong Yu (俞栋) - Researcher,
Speech
Research Group
About Me | Speech Research Links | Photo Albums | Contact Me
About Me
Brief Biography
Dr. Dong Yu joined Microsoft in 1998 and Microsoft
Speech Research Group in 2002, where he is currently
a researcher. He holds a Ph.D. degree in computer
science from University of Idaho, an
MS degree in computer science from
Indiana University / Bloomington, an MS degree in electrical
engineering from Chinese Academy of
Sciences, and a BS degree (with
honor) in electrical engineering from
Zhejiang University (China). His
current research interests include speech processing, robust speech
recognition, discriminative training, spoken dialog system, voice search
technology, machine learning, and pattern recognition. He is a senior member of
IEEE, a member of ACM,
and a member of ISCA.
Equation Number in Office 2007
Office 2007 comes with a very nice equation editor and bibliography manager.
However, it does not support equation and theorem number management. To work around this problem.
I have developed a set of macros. You can download it here.
English Publications
All copyrights for these documents are retained by the copyright
holder, and permission to copy the work should be obtained from the
copyright holder in writing. This copyright notice must be kept together
with the downloaded or printed document.
Book Chapters
- Dong Yu, Li Deng, "Speech-Centric Multimodal User Interface Design
in Mobile Technology", Handbook of Research on User Interface Design
and Evaluation for Mobile Technology (Editor: Joanna Lumsden), Jan. 2008, IGI. (to appear)
Journals and Magazines
- Sibel Yaman, Li Deng, Dong Yu, Ye-Yi Wang, Alex Acero, "A Discriminative Technique for Spoken Utterance Classification", IEEE Transactions on Audio, Speech and Language Processing. (to appear)
- Dong Yu, Li Deng, Xiaodong He, Alex Acero, "Large-Margin
Minimum Classification Error Training: A Theoretical Risk Minimization
Perspective", Computer Speech and Language, Volume 22, Issue 4, October
2008, Pages 415-429.
- Dong Yu, Li Deng, Jasha Droppo, Jian Wu, Yifan Gong, Alex Acero, "Robust Speech Recognition Using a Cepstral Minimum-Mean-Square-Error-Motivated Noise Suppressor", IEEE Transactions on Audio, Speech and Language Processing. (to appear)
- Ye-Yi Wang, Dong Yu, Yun-Cheng Ju, Alex Acero, "Voice Search – An Introduction", IEEE Signal Processing Magazine
(Special Issue on Spoken Language Technology), May 2008 (to appear)
- Dong Yu, Li Deng, Alex Acero, "A
Lattice Search Technique for a Long-Contextual-Span Hidden Trajectory Model
of Speech", Speech Communication, Elsevier. Volume: 48 Issue: 9, Sep
2006. pp. 1214-1226.
- Li Deng, Dong Yu, and Alex Acero. "Structured
Speech Modeling", IEEE Trans. on Audio, Speech and Language
Processing. Volume: 14 Issue: 5, Sep 2006. pp. 1492- 1504.
- Dong Yu, Deborah Frincke, "Improving
the Quality of Alerts and Predicting Intruder¡¯s Next Goal with Hidden
Colored Petri-Net", Computer Networks, Volume 51, Issue 3, 21 February
2007, Pages 632-654.
- Dong Yu, Li Deng, Alex Acero, "Speaker-Adaptive Learning
of Resonance Targets in a Hidden Trajectory Model of Speech Coarticulation",
Computer Speech and Language, Vol. 27, 2007, pp. 72-87.
- Li Deng, Dong Yu, and Alex Acero, "A Bi-Directional
Target-Filtering Model of Speech Coarticulation and Reduction:
Two-Stage Implementation for Phonetic Recognition", IEEE Trans.
Audio, Speech & Language Proc, vol. 14, No. 1, pp 256-265, Jan 2006.
- Dong Yu, Alex Acero, "Semiautomatic
Improvements of System-Initiative Spoken Dialog Applications Using
Interactive Clustering", © IEEE Trans. Speech & Audio
Proc (Special Issue on Data Mining of Speech, Audio and Dialog),
Sept. 2005, vol.13, no. 5pp 661-671.
- Li Deng, Dong Yu, "A Speech-Centric Perspective for
Human-Computer Interface - A Case Study", Journal of VLSI Signal
Processing Systems (Special Issue on Multimedia Signal Processing),
Vol. 41, No. 3. pp. 255-269, November 2005.
Refereed Conferences
- Jinyu Li, Li Deng, Dong Yu, Jian Wu, Yifan
Gong, Alex Acero, "Adaptation
of Compressed HMM Parameters for Resource-Constrained Speech Recognition",
ICASSP 2008, Las Vegas, USA.
- Jinyu Li, Li Deng, Dong Yu, Yifan Gong, Alex Acero, "HMM
Adaptation Using a Phase-Sensitive Acoustic Distortion Model For
Environment-Robust Speech Recognition", ICASSP 2008, Las Vegas, USA.
- Dong Yu, Li Deng, Jasha Droppo, Jian Wu, Yifan Gong, Alex Acero, "a Minimum-Mean-Square-Error Noise Reduction Algorithm on Mel-Frequency Cepstra
for Robust Speech Recognition", ICASSP 2008, Las Vegas, USA.
- Jinyu Li, Li Deng, Dong Yu, Yifan Gong, Alex Acero, "High-Performance HMM Adaptation With Joint Compensation of Additive and Convolutive Distortions Via Vector Taylor Series", ASRU 2007, Kyoto, Japan.
- Ivan Tashev, Michael Seltzer, Yun-Cheng Ju, Dong Yu and Alex Acero, "Commute UX: Telephone Dialog System for Location-based Services", SIGDIAL 2007, Antwerp, Belgium.
- Dong Yu, Li Deng, "Large-Margin Discriminative Training of Hidden Markov Models for Speech Recognition",
ICSC 2007, Irvine, CA (invited).
- Geoffrey Zweig, Patrick Nguyen, Yun-Cheng Ju, Ye-Yi Wang, Dong Yu, Alex Acero,
"The Voice-Rate Dialog System for Consumer Ratings", Interspeech 2007, Antwerp, Belgium.
- Dong Yu, Li Deng, Alex Acero, "Handling Phonetic Context and
Speaker Variation in a Structure-Based Speech Recognizer", Interspeech 2007, Antwerp, Belgium.
- J. Sherwani, Dong Yu, Tim, Paek, Mary Czerwinski, Yun-Cheng Ju, Alex Acero,
"VoicePedia: Towards Speech-based Access to Unstructured
Information", Interspeech 2007, Antwerp, Belgium.
- Dong Yu, Yun-Cheng Ju, Ye-Yi Wang, Geoffrey Zweig, Alex Acero,
"Automated Directory Assistance System - from Theory to Practice", Interspeech 2007, Antwerp, Belgium.
- Ye-Yi Wang, Dong Yu, Yun-Cheng Ju, Geoffrey Zweig, Alex Acero, "Confidence Measures for Voice Search
Applications", Interspeech 2007, Antwerp, Belgium.
- Dong Yu, Li Deng, Xiaodong He, Alex Acero, "Large-Margin Minimum
Classification Error Training for Large-Scale Speech Recognition Tasks",
ICASSP 2007, Hawaii, USA.
- Li Deng, Dong Yu, "Use of Differential Cepstra as Acoustic
Features in Hidden Trajectory Modeling for Phonetic Recognition", ICASSP 2007,
Hawaii, USA.
- Sibel Yaman, Li Deng, Dong Yu, Ye-Yi Wang, Alex Acero, "A
Discriminative Training Framework Using N-Best Speech Recognition
Transcriptions and Scores for Spoken Utterance Classification", ICASSP 2007,
Hawaii, USA.
- Dong Yu, Li Deng, Xiaodong He, Alex Acero, "Use of
Incrementally Regulated Discriminative Margins in MCE Training for
Speech Recognition", in Proc. of the Interspeech Conference. Pittsburgh,
Sep, 2006.
- Xiaolong Li, Li Deng, Dong Yu, Alex Acero, "A Time-Synchronous Phonetic Decoder for a Long-Contextual-Span Hidden Trajactory Model",
Proc. of the Interspeech Conference. Pittsburgh, Sep, 2006.
- Dong Yu, Yun Cheng Ju, Alex Acero, "Effective and
Efficient Utterance Verification Technology Using Word N-gram Filler
Models", in Proc. of the Interspeech Conference. Pittsburgh, Sep, 2006.
- Dong Yu, Yun Cheng Ju, Ye-Yi Wang, Alex Acero, "N-Gram
Based Filler Model for Robust Grammar Authoring",in Proc. ICASSP
2006, Toulouse, France, May 2006.
- Li Deng, Dong Yu, and Alex Acero. "A Generative Modeling
Framework for Structured Hidden Speech Dynamics", in Proc. NIPS
Workshop on Advances in Structured Learning for Text and Speech
Processing 2005.
- Li Deng, Dong Yu, and Alex Acero. "A
Long-Contextual-Span Model of Resonance Dynamics for Speech
Recognition: Parameter Learning and Recognizer Evaluation,"
in Proc. ASRU2005.
- Dong Yu, Li Deng, and Alex Acero. "Learning
Statistically Characterized Resonance Targets in a Hidden Trajectory
Model of Speech Coarticulation and Reduction," in Proc of
Interspeech, Lisbon, Sept 2005.
- Li Deng, Dong Yu, and Alex Acero."Evaluation of a Long-contextual-span Hidden Trajectory Model and Phonetic Recognizer Using A* Lattice Search," in Proc of
Interspeech, Lisbon, Sept 2005.
- Dong Yu, Deborah Frincke, "Alert
Confidence Fusion in Intrusion Detection Systems with Extended
Dempster-Shafer Theory" © ACM, in the 43rd Annual ACM Southeast
Conference, Kennesaw, Georgia, March 18, 19 & 20, 2005.
- Dong Yu, Milind Mahajan, Peter Mau, and Alex Acero, "Maximum
Entropy Based Generic Filter for Language Model Adaptation",
in ICASSP 05, March 19-23, 2005, Philadelphia, PA, USA
- Li Deng, Xiang Li, Dong Yu, and Alex Acero, "A
Hidden Trajectory Model with Bi-directional Target-Filtering:
Cascaded vs. Integrated Implementation for Phonetic Recognition",
in ICASSP 05, March 19-23, 2005, Philadelphia, PA, USA
- Dong Yu, Mei-Yuh Hwang , Peter Mau, Alex Acero, Li Deng,
"Unsupervised
Learning from Users' Error Correction in Speech Dictation",
in Proceedings of InterSpeech-ICSLP 2004, October 4-8, 2004, Jeju,
Korea.
- Li Deng, Dong Yu, and Alex Acero, "A
Quantitative Model for Formant Dynamics and Contextually Assimilated
Reduction in Fluent Speech", in Proceedings of
InterSpeech-ICSLP 2004.
- Dong Yu, Deborah Frincke, "A
Novel Framework for Alert Correlation and Understanding"
(© Springer-Verlag), Springer's LNCS series, vol 3089. International
Conference on Applied Cryptography and Network Security (ACNS) 2004.
- Dong Yu, Deborah Frincke, "Towards
Survivable Intrusion Detection System", the 37th Hawaii
International Conference On System Science (HICSS-37), Big Island,
Hawaii, 2004.
- Dong Yu, Kuansan Wang, Milind Mahajan, Peter Mau, Alex
Acero, "Improved
Name Recognition With User Modeling", in Proceedings of
EUROSPEECH 03, Geneva, Switzerland, 2003.
- Lei Yao, Dong Yu, and Taiyi Huang, "A
unified spectral transformation adaptation approach for robust
speech recognition", in Proceedings of the fourth
International Conference in Spoken Language Proceedings (ICSLP-96),
Philadelphia, PA, USA, 1996.
- Dong Yu and Taiyi Huang, "Canonical
Correlation Based Compensation Approach for Robust Speech
Recognition in Noisy Environment", in Proceedings of
EUROSPEECH 95, pp477-480, Madrid, Spain, 1995.
- Dong Yu and Taiyi Huang, "A New HMM/NN Hybrid Method for
High Performance Speech Recognition", in Proceedings of
International Conference in Spoken Language Processing (ICSLP-94),
Yokohama, Japan, 1994.
- Dong Yu and Taiyi Huang, "A New Time-alignment Approach
for Robust Neural Network Based Speech Recognition", in Proceedings
of IEEE International Conference in Signal Processing (ICSP-93),
Beijiang, China, 1993.
Other Conferences and/or Short Papers:
- Geoffrey Zweig, Y.C. Ju, Patrick Nguyen, Dong Yu, Ye-Yi Wang, Alex Acero, "Voice-Rate: A Dialog System for Consumer Ratings", NAACL-HLT 2007, Rochester, New York, USA, pp31-32.
- Li Deng, Xiang Li, Dong Yu, and Alex Acero, "Novel
Acoustic Modeling with Structured Hidden Dynamics for Speech
Coarticulation and Reduction", in Proc. of the DARPA RT04
Workshop. Palisades, New York, Nov 2004.
Chinese Publications
- Dong Yu, Taiyi Huang, and Daowen Chen, "Chinese Consonant
Recognition Based on Multi-State Gaussian Competitive Neural Networks", in
Proceedings of Chinese Conference on Neural Networks (CCNN), Wuhan, China, 1994
(in Chinese).
- Dong Yu, Taiyi Huang, and Daowen Chen, "A Fast NN/HMM Hybrid
Approach for Speech Recognition", in Proceedings of Chinese Conference on
Man-Machine Interface (CCMMI), China, 1994 (in Chinese).
Patents (US and/or International)
- "Speech Recognition With Non-linear Noise Reduction on Mel-Frequency Cepstra", with Li Deng, Jasha Droppo, and Alex Acero, (pending, filed 2008)
- "High Performance HMM Adaptation With Joint Compensation of Additive and Convolutive Distortions",
with Jinyu Li, Li Deng, Alex Acero (pending, filed 2007)
- "Searching Database of Listing", with Ye-Yi
Wang, Yun-Cheng Ju, Alex Acero and Geoffrey Zweig (pending, filed 2007)
- "Speech-Centric Multimodal User Interface
Design in Mobile Technology", with Li Deng (pending, filed 2007)
- "A Generic Framework for Large-Margin MCE Training
in Speech Recognition", with Li Deng, Xiaodong
He, Alex Acero (pending, filed 2007)
- "Integrated Speech Recognition and Semantic
Classification", with Sibel Yaman, Li Deng, Ye-Yi Wang, and Alex Acero
(pending, filed 2007)
- "Hidden Trajectory Modeling with Differential
Cepstra for Speech Recognition", with Li Deng (pending, filed 2007)
- "Indexing and Ranking Processes for Directory
Assistance Services", with Yun Cheng Ju, Ye-yi Wang, and Alex Acero (pending, filed 2007)
- "Adapting a Language Model to Accommodate Inputs Not Found
in a Directory Assistance Listing", with Yun Cheng Ju and Alex Acero (pending, filed 2007)
- "Compound Word Splitting for Directory Assistance Services",
with Yun Cheng Ju and Alex Acero (pending, filed 2007)
- "Incrementally Regulated Discriminative
Margins in MCE Training for Speech Recognition", with Li Deng, Xiaodong
He, Alex Acero (pending, filed 2006)
- "Detecting an Answering Machine Using
Speech Recognition", with Yun Cheng Ju, Alex Acero, Craig M. Fisher,
and Ye-yi Wang (pending, filed 2006)
- "Sharable Filler Model for Grammar
Authoring", with Yun Cheng Ju, Alex Acero, and Ye-Yi Wang (pending,
filed 2006)
- "Time Synchronous Decoding for Long-Span
Hidden Trajectory Model", with Xiaolong Li, Li Deng, and Alex Acero
(pending, filed 2006)
- "Parameter Learning in a Hidden Trajectory
Model", with Li Deng, Xiaolong Li, Alex Acero (pending, filed 2006)
- "Time Asynchronous Decoding for Long-Span
Trajectory Model", with Li Deng, Alex Acero (pending, filed 2005)
- "Learning Statistically Characterized
Resonance Targets in a Hidden Trajectory Model", with Li Deng, Alex
Acero (pending, filed 2005)
- "Speaker-adaptive Learning of Resonance
Targets in a Hidden Trajectory Model of Speech Coarticulation", with
Li Deng, Alex Acero (pending, filed 2005)
- "Configurable Grammar Templates", with
Ye-Yi Wang, Yun-Cheng Ju, and Alex Acero (pending, filed 2005)
- "Interactive Clustering Method for
Identifying Problems in Speech Applications", with Alex Acero
(pending, filed 2005)
- "Classification Filter for Processing Data
for Creating a Language Model", with Alex Acero, Julian J. Odell,
Milind V. Mahajan, and Peter Mau (pending, filed 2005)
- "Method of Automatically Ranking Speech
Dialog States and Transitions to Aid in Performance Analysis in
Speech Applications", with Alex Acero (pending, filed 2005)
- "System and Method for Identifying Semantic
Intent from Acoustic Information", with Xiao Li, Asela J.
Gunawardana, Alex Acero, and Milind Mahajan (pending, filed 2005)
- "Acoustic Models with Structured Hidden
Dynamics with Integration over Many Possible Hidden Trajectories",
with Li Deng, and Alex Acero (pending, filed 2004)
- "Two Stage Implementation for Phonetic
Recognition Using a Bi-directional Target-filtering Model of Speech
Co-articulation and Reduction", with Li Deng, and Alex Acero
(pending, filed 2004)
- "Quantitative Model for Formant Dynamics
and Contextually Assimilated Reduction in Fluent Speech", with Li
Deng, and Alex Acero (pending, filed 2004)
- "Automatic Speech Recognition Learning
Using User Corrections", With Peter Mau, Mei-Yuh Hwang , and Alex
Acero (pending, filed 2004)
- "Efficient Capitalization through User
Modeling", with Peter Mau (pending, filed 2003)
- "Named entity recognition with user
modeling", with Peter Mau, Kuansan Wang, Milind Mahajan, and Alex
Acero, (granted 2007, US patent #7289956)
Technical Services
-
Grant reviewer: Research Grants Council (RGC) of Hong Kong
-
Technical program committee member: ICSC 2007, ICSC 2008,
ISCSLP 2008
-
Organization committee member: MMSP 2006, ICSC 2008
-
Session chair: ACNS 2004, ICSC 2007.
-
Paper
reviewer: J. Computer Speech
and Language, J. Speech
Communication, IEEE Transactions on Audio, Speech, and Language Processing, J. Pattern
Recognition Letters, EURASIP J. on Audio Speech and Music Processing, J. Computer
Security, J. Computer
Networks, IEEE Transactions on Computer, ICASSP 2004-2008, INTERSPEECH 2004-2007,
ICSC 2007-2008, EUSIPCO 2008, ACMSE
2005
Honors and Awards
-
Best Paper Award of ACMSE, 2005
-
Gold Star Award from Microsoft, 2004
-
Patent Awards from Microsoft, 2002-2008
-
Graduate school fellowship from
Indiana University, 1995-96
-
Presidential Award of Chinese
Academy of Sciences, 1994
-
Excellent Graduate Student award
from Chinese Academy of Sciences, 1993
-
Excellent Graduate Student award
from Graduate School, Academia
Sinica, 1992
-
Excellent Graduate award from
Zhejiang province, 1991
-
Excellent Graduate award from
Zhejiang University, 1991
-
Excellent Student Award from
Zhejiang University, 1987-1991
|
|