Share this page
Share this page E-mail this page Print this page RSS feeds
Home > People > Li Deng
Li Deng

PRINCIPAL RESEARCHER
.

Brief Biography

Li Deng received the Bachelor degree from the University of Science and Technology of China (with the Guo Mo-Ruo Award), and received the Ph.D. degree from the University of Wisconsin-Madison (with the Jerzy E. Rose Award). In 1989, he joined Dept. Electrical and Computer Engineering, University of Waterloo, Ontario, Canada as an Assistant Professor, where he became a Full Professor in 1996. From 1992 to 1993, he conducted sabbatical research at Laboratory for Computer Science, Massachusetts Institute of Technology, Cambridge, Mass, and from 1997-1998, at ATR Interpreting Telecommunications Research Laboratories, Kyoto, Japan. In 1999, he joined Microsoft Research, Redmond, WA as a Senior Researcher, where he is currently a Principal Researcher. He is also an Affiliate Professor in the Department of Electrical Engineering at University of Washington, Seattle. His past and current research activities include automatic speech and speaker recognition, statistical methods and machine learning, neural information processing, machine intelligence, audio and acoustic signal processing, statistical signal processing and digital communication, human speech production and perception, acoustic phonetics, auditory speech processing, auditory physiology and modeling, noise robust speech processing, speech synthesis and enhancement, spoken language understanding systems, multimedia signal processing, and multimodal human-computer interaction. In these areas, he has published over 300 refereed papers in leading international conferences and journals, 12 book chapters, and has given keynotes, tutorials, and lectures worldwide. He has been granted over 20 US or international patents in acoustics, speech/language technology, and signal processing. He authored or co-authored three books in speech processing and learning. He serves on the Board of Governors of the IEEE Signal Processing Society, and as Editor-in-Chief for the IEEE Signal Processing Magazine. He is a Fellow of the Acoustical Society of America, and a Fellow of the IEEE.

Education

  • B.S.: University of Science and Technology of China (USTC).
  • Master: University of Wisconsin-Madison, U.S.A.
  • Ph.D.: University of Wisconsin-Madison, U.S.A.
Publications: Books
Book Chapters
Journal/Magazine Publications

    2009

    2008

    2007

    2006

    2005

    2004

    2003

    2002

    2001

    2000

    1999

    1998

    1997

    1996

    1995

    1994

    1993

    1992

    1991

    1990

    1989

    1988

    1987

    1986

    1985

    Conference Publications

      2009

      2008

      2007

      2006

      2005

      2004

      2003

      2002

      2001

      2000

      • J. Sun and L. Deng. "Annotation and use of speech production corpus for building language-universal speech recognizers", Proceedings of the 2nd International Symposium on Chinese Spoken Language Processing (ISCSLP), Beijing, October 2000, Vol. 3, pp. 31-34.
      • J. Sun, R. Tongneri and L. Deng. "A robust speech understanding system using conceptual relational grammar," Proceedings of the International Conference on Spoken Language Processing,October 2000, Vol. 2, pp. 879-882.
      • S. Dusan and L. Deng. "Acoustic-to-articulatory inversion using dynamical and phonological constraints" Proceedings of the 5th Speech Production Workshop: MODELS AND DATA, Kloster Seeon, Germany, May 1-4, 2000, pp. 237-240.
      • M. Naito, L. Deng, and Y. Sagisaka. "Speaker clustering for speech recognition using the parameters characterizing vocal tract dimensions," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Seattle, WA, May 11-15, 1998.
      • M. Naito, L. Deng, and Y. Sagisaka. "Speaker adaptation methods using vocal tract parameters," (in Japanese) Proceedings of the 1998 Spring Meeting of the Acoustical Society of Japan, Yokohama, Japan, March 17-19, 1998, pp. 55-56.
      • M. Naito, L. Deng, and Y. Sagisaka. "A study on speaker clustering methods using vocal tract parameters," (in Japanese) Proceedings of Japan Institute of Electronics, Information, and Communication Engineers (IEICE), Yokosuka, Japan, December 1997, Vol. 97, No. 441, pp. 35-40.
      • L. Deng (invited). "A dynamic, feature-based approach to speech modeling and recognition," Proceedings of the 1997 IEEE Workshop on Automatic Speech Recognition and Understanding, Santa Barbara, CA, December 14-17, 1997, pp. 107-114.
      • C. Rathinavelu and L. Deng. "Speech adaptation experiments using nonstationary-state HMMs: A MAP approach," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Munich, Germany, 1997, Vol. 2, pp. 1415-1418.
      • L. Deng. "Integrated-multilingual speech recognition using universal phonological features in a functional speech production model," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Munich, Germany, 1997, Vol. 2, pp. 1007-1010.
      • C. Rathinavelu and L. Deng. "On the use of discriminatively derived feature space transformation in speech recognition," Proceedings of the International Conference on Signal Processing Applications and Technology, Boston, MA, October 7-10, 1996, pp. 1769-1773.
      • C. Rathinavelu and L. Deng. "Trended HMM with discriminative training for phonetic classification," Proceedings of the International Conference on Spoken Language Processing, Philadelphia, PA, October 3-6, 1996, pp. 1049-1052.
      • X. Shen, L. Deng, and A. Yasmin. "H-infinity filtering for speech enhancement," Proceedings of the International Conference on Spoken Language Processing, Philadelphia, PA, October 3-6, 1996, pp. 873-876.
      • L. Deng, X. Shen, and D. Jamieson. "Simulation of disordered speech using a frequency-domain vocal tract model," Proceedings of the International Conference on Spoken Language Processing, Philadelphia, PA, October 3-6, 1996, pp. 768-771.
      • D. Jamieson, L. Deng, M. Price, V. Parsa, and J. Till. "Interactions of speech disorders with speech coders: Effects on speech intelligibility," Proceedings of the International Conference on Spoken Language Processing, Philadelphia, PA, October 3-6, 1996, pp. 737-740.
      • G. Ramsay and L. Deng. "Optimal filtering and smoothing for speech recognition using a stochastic target model," Proceedings of the International Conference on Spoken Language Processing, Philadelphia, PA, October 3-6, 1996, pp. 1113-1116.
      • L. Deng and J. Wu. "Hierarchical partitioning of articulatory state space for articulatory-feature based speech recognition," Proceedings of the International Conference on Spoken Language Processing, Philadelphia, PA, October 3-6, 1996, pp. 2266-2269.
      • J. Wu and L. Deng. "Acoustic Modeling for Continuous Mandarin-Chinese Speech Recognition," Proceedings of the International Conference on Spoken Language Processing, Philadelphia, PA, October 3-6, 1996, pp. 2281-2284.
      • L. Deng, G. Ramsay, and D. Sun. (invited). "Production models as a structural basis for automatic speech recognition," Proceedings of the Fourth European Speech Production Workshop, Autrans, France, May 24-27, 1996, pp. 69--80.
      • L. Deng. "Finite-state automata derived from overlapping articulatory features: A novel phonological construct for speech recognition," Proceedings of the Workshop on Computational Phonology in Speech Technology, (published by Association for Computational Linguistics), Santa Cruz, CA, June 28, 1996. pp. 37-45.
      • L. Deng and H. Sheikhzadeh. "Temporal and rate aspects of speech encoding in the auditory system: Simulation results on TIMIT data using a layered neural network interfaced with a cochlear model," Proceedings of European Speech Communication Association Tutorial and Research Workshop on the Auditory Basis of Speech Recognition, July 15 - 19, 1996, Keele University, United Kingdom, pp. 75-78.
      • C. Rathinavelu and L. Deng. "HMM-based speech recognition using state-dependent, discriminatively derived transforms on Mel-warped DFT features", Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.1, Atlanta, Georgia, May 7-10, 1996, pp. 9--12.
      • L. Deng, G. Ramsay, and H. Sameti. "From modeling surface phenomena to modeling mechanisms: Towards a faithful model of the speech process aiming at speech recognition," Proceedings of the 1995 IEEE Workshop on Automatic Speech Recognition, December 10-13, 1995, Snowbird, Utah, pp. 183-184.
      • G. Ramsay and L. Deng. "Maximum-likelihood estimation for articulatory speech recognition using a stochastic target model," Proceedings of the 1995 European Conference on Speech Communication and Technology, Spain, September 18-21, 1995, pp. 1401-1404.
      • G. Ramsay and L. Deng. "Modal analysis of acoustic wave propagation in the vocal tract using a finite-difference method," Proceedings of the XII International Congress of Phonetic Sciences, Stockholm, Sweden, August 13-19, 1995, Vol 2, pp. 338-341.
      • G. Ramsay and L. Deng. "Articulatory synthesis using a stochastic target model of speech production," Proceedings of the XII International Congress of Phonetic Sciences, Stockholm, Sweden, August 13-19, 1995, Vol 2, pp. 478-481.
      • L. Deng, J. Wu, and H. Sameti. "Improved speech modeling and recognition using multi-dimensional articulatory states as primitive speech units," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Detroit, MI, May 8-12, 1995, pp. 385-388.
      • D. Sun and L. Deng. "Analysis of acoustic-phonetic variations in fluent speech using TIMIT," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Detroit, MI, May 8-12, 1995, pp. 201-204.
      • C. Rathinavelu and L. Deng. "Use of generalized dynamic feature parameters for speech recognition: Maximum likelihood and minimum classification error approaches," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Detroit, MI, May 8-12, 1995, pp. 373-376.
      • S. Shen and L. Deng. "Discrete H-infinity filtering design with application to speech enhance ment," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Detroit, MI, May 8-12, 1995, pp. 1504-1507.
      • H. Sheikhzadeh, R. Brennan, L. Deng, and H. Sameti, "Real-time implementation of HMM-based MMSE algorithm for speech enhancement in hearing aid applications," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 1995 ,
      • D. Sun and L. Deng. "Nonstationary-state hidden Markov model with state-dependent time warping: Application to speech recognition," Proceedings of the 1994 International Conference on Spoken Language Processing, Vol. 1, Yokohama, Japan, September, 18-22, 1994. pp. 243--246,
      • L. Deng and H. Sameti. "Speech recognition using dynamically defined speech units," Proceedings of the 1994 International Conference on Spoken Language Processing, Vol. 4, pp. 2167-2170, Yokohama, Japan, September, 18-22, 1994.
      • H. Sheikhzadeh and L. Deng. "Interval statistics from a cochlear model in response to speech sounds," Journal of the Acoustical Society of America, Vol. 95, No. 6, June 1994 (Abstract), pp. 2842. (The 127th Meeting of the Acoustical Society of America, June 4-8, 1994, Cambridge, MA.)
      • L. Deng and I. Kheirallah. "Stability analysis on finite-difference solution of a basilar-membrane vibration model with application to acoustic signal processing," Journal of the Acoustical Society of America, Vol. 95, No. 6, June 1994 (Abstract), pp. 2840. (The 127th Meeting of the Acoustical Society of America, June 4-8, 1994, Cambridge, MA.)
      • L. Deng and H. Sameti. "Articulatory phonology and speech recognition: A study on use of dynamically defined speech primitives," Journal of the Acoustical Society of America, Vol. 95, No. 6, June 1994 (Abstract), pp. 2870. (The 127th Meeting of the Acoustical Society of America, June 4-8, 1994, Cambridge, MA.)
      • G. Ramsay and L. Deng. "A stochastic framework for articulatory speech recognition," Journal of the Acoustical Society of America, Vol. 95, No. 6, June 1994 (Abstract), pp. 2871. (The 127th Meeting of the Acoustical Society of America, June 4-8, 1994, Cambridge, MA.)
      • K. Hassanein, L. Deng and M. Elmasry. "A neural predictive hidden Markov model for speaker recognition," Proceedings of the Workshop on Automatic Speaker Recognition, Identification and Verification, Martigny, Switzerland, April, 1994, pp. 115-118.
      • L. Deng and M. Aksmanovic. "HMMs with mixtures of trended functions for automatic speech recognition," IEEE International Conference on Speech, Image Processing and Neural Networks, April 13-15, 1994, HongKong, pp. 702-705.
      • L. Deng. "A theory on optimal construction of dynamic features for hidden Markov modeling of speech," IEEE International Conference on Speech, Image Processing and Neural Networks, April 13-15, 1994, HongKong, pp. 351-354.
      • L. Deng and D. Sun. "Phonetic classification and recognition using HMM representation of overlapping articulatory features for all classes of English sounds," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Adelaide, Australia, April 19-22, 1994, Vol. 1, pp. 45-48.
      • K. Hassanein, L. Deng and M. Elmasry. "Vowel classification using a neural predictive HMM: A discriminative training approach," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Adelaide, Australia, April 19-22, 1994, Vol 2, pp. 665-668.
      • H. Sameti, H. Sheikhzadeh, L. Deng and R. Brennan. "Comparative performance of spectral subtraction and HMM-based speech enhancement strategies with application to hearing aid design." Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Adelaide, Australia, April 19-22, 1994, Vol. 1, pp. 13-16.
      • L. Deng. "A computational model of phonology-phonetics integration for automatic speech recognition," Proceedings of the 1993 IEEE Workshop on Automatic Speech Recognition, December 12-15, 1993, Snowbird, Utah, pp. 83--84.
      • K. Hassanein, L. Deng and M. Elmasry. "A neural predictive hidden Markov model for speech and speaker recognition," Proceedings of the Fifth International Conference on Microelectronics December 14-16, 1993, Dhahran, Saudi Arabia, pp. 108-111.
      • L. Deng and D. Sun. "Speech recognition using the atomic speech units constructed from overlapping articulatory features," Proceedings of the 1993 European Conference on Speech Communication and Technology, September 21-23, 1993, Berlin, Germany, Vol. III, pp. 1635--1638.
      • D. Zhang, L. Deng, and M. Elmasry. "Pipelined neural network architecture for speech recognition," Proceedings of the 1993 World Congress on Neural Networks, July 11-15, 1993, Portland, Oregon, Vol. III, pp. 55-58.
      • L. Deng. "Design of a feature-based speech recognizer aiming at integration of auditory processing, signal modeling, and phonological structure of speech." (invited) Journal of the Acoustical Society of America, Vol. 93, No.4, Pt. 2, pp. 2318, April, 1993.
      • K. Hassanein, L. Deng, and M. Elmasry. "Maximal mutual information training of a neural predictive HMM speech recognition system," Proceedings of the 1992 IEEE Workshop on Neural Networks for Signal Processing, August 31--September 2, 1992, Copenhagen, Denmark, pp. 164-173.
      • K. Erler and L. Deng. "HMM representation of quantized articulatory features for recognition of highly confusible words," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, San Francisco, CA., March, 1992, pp.545-548.
      • L. Deng. "Speech modeling and recognition using a time series model containing trend functions with Markov modulated parameters," Proceedings of the 1991 IEEE Workshop on Automatic Speech Recognition, Arden House, New York, December, 1991, pp. 24-26.
      • L. Deng and K. Erler. "Microstructural speech units and their HMM representation for discrete utterance speech recognition," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Toronto, Ontario, Canada, May, 1991, pp. 193--196. P. Seitz, V. Gupta, M. Lennig, P. Kenny, L. Deng, D. O'Shaughnessy, and P. Mermelstein. "Phonological rule set complexity as a factor in the performance of a very large vocabulary word recognition system," Journal of the Acoustical Society of America, 87(1), May, 1990, S108 (Abstract).
      • L. Deng, V. Gupta, M. Lennig, P. Kenny, and P. Mermelstein. "Acoustic recognition component of an 86,000-word speech recognizer," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Albuquerque, New Mexico, 1990, pp. 741--744.
      • L. Deng, P. Kenny, M. Lennig, V. Gupta and P. Mermelstein. "A locus model of coarticulation in a hidden-Markov-model-based speech recognizer," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Glascow, Scotland, 1989, pp. 97-100.
      • L. Deng, P. Kenny, M. Lennig, V. Gupta and P. Mermelstein. "Large vocabulary word recognition based on phonetic representation by hidden Markov models", Proceedings of the Canadian Conference on Electrical and Computer Engineering, Vancouver, Canada, November 1988, pp. 131-134.
      • L. Deng, M. Lennig, and P. Mermelstein. "Modeling acoustic-phonetic detail in a hidden-Markov-model-based large vocabulary speech recognizer," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, New York, New York, Vol. 1, April 1988, pp. 509--512.

      Patents (awarded)

      • Removing noise from feature vectors, U.S. Patent No.: 7,310,599; Granted on December 18, 2007;
      • Method of determining uncertainty associated with acoustic distortion-based noise reduction, U.S. Patent No. 7,289,955; Granted on October 30, 2007
      • Method and apparatus for identifying noise environments from noisy signals, U.S. Patent No. 7,266,494; Granted on September 4, 2007
      • Method of noisy reduction using correction and scaling vectors with partitioning of the acoustic space in the domain of noisy speech, U.S. Patent No.7,254,536; Granted on August 7, 2007
      • Method of determining uncertainty in noise reduction, US and International Patents; U.S. Patent No.: 7,174,292; Granted on Feb. 6, 2007
      • Method of Noise Estimation Using Incremental Bayes Learning, US. Patent; Patent No.: 7,165,026; Granted on Jan. 16, 2007
      • Method of iterative noise estimation in a recursive framework, U.S. Patent; Patent No. 7,139,703; Granted on Nov. 21, 2006.
      • Method of noise reduction using correction vectors based on dynamic aspects of speech and noise normalization, United States Patent No. 7,117,148; Granted on October 3, 2006.
      • Method of noise reduction based on dynamic aspects of speech, United States Patent No. 7,107,210; Granted on Sept 12, 2006.
      • Method of pattern recognition using noise reduction uncertainty, United States Patent No. 7,103,540; Granted on Sept 5, 2006.
      • Microphone array signal enhancement using mixture models (jointly with Hagai Attias), United States Patent No. 7,103,541; Granted on Sept 5, 2006.
      • Efficient backward recursion for computing posterior probabilities, United States Patent No. 7,062,407; Granted on June 13, 2006.
      • Method of speech recognition using time-dependent interpolation and hidden dynamics, United States (and International) Patent No. 7,050,975; Granted on May 23, 2006.
      • Nonlinear observation models for removing noise from corrupted speech, United States (and International) Patent No. 7,047,047; Granted on May 16, 2006.
      • Method of Noise Reduction Using Correction and Scaling Vectors with Partitioning of the Acoustic Space in the Domain of Noisy Speech, United States Patent No. 7,003,455; Granted on February 21, 2006
      • Methods and Apparatus for Denoising and Dereverberation Using Variational Inference and Strong Speech Models, United States Patent No. 6,990,447; Granted on January 24, 2006
      • Method and Apparatus for Removing Noise from Feature Vectors, United States Patent No. 6,985,858; Granted on January 10, 2006
      • Methods for Including the Category of Environmental Noise When Processing Speech Signals, United States Patent No. 6,959,276; Granted on October 25, 2005
      • Method of iterative noise estimation in a recursive framework, United States Patent; Patent No. 6,944,590; Granted on September 13, 2005
      • Method of speech recognition using variational inference with switching state space models, United States Patent; Patent No. 6,931,374; Granted on August 16, 2005
      • Pattern Recognition Training Method and Apparatus Using Inserted Noise Followed by Noise Reduction, United States (and International) Patent; Patent No. 6,876,966; Granted on April 5, 2005
      • Apparatus for Speaker Clustering and for Speech Recognition, Patent No.: 2,965,537; Granted on Aug. 13, 1999; Countries of issue: United States and Japan.
      • Apparatus for Speaker Normalization Processor and for Voice Recognition Device, Patent No.: 2986792; Granted on Oct. 1, 1999; Countries of issue: United States and Japan.

       

      Downloads

      •  IPAM05-MSR-VTR-Formants (This database was created by the joint work of MSR and UCLA (IPAM). See our ICASSP2006 paper (contained in the download) for details. Note that this is a 20MB download. We suggest that you save it in your disks before installing it. Note also that this is a database, although it appears as a program when you are running and "installing" it.)

       

      E-mail: deng at microsoft dot com
      U.S.Mail: Microsoft Research, One Microsoft Way, Redmond WA, 98052, USA
      Tel: (425) 706-2719
      Fax: (425) 706-7329 (This is the main MS FAX number so make sure to send documents to Li Deng's attention)

       

       

      Professional Activities

      • Editor-In-Chief, IEEE Signal Processing Magazine (term 2009-2012)
      • Board of Governors, IEEE Signal Processing Society (Member at large, elected September 2007; term 2008-2010)
      • Board of Governors, Asian-Pacific Signal and Information Processing Association (APSIPA) (Member, elected September 2009)
      • Publications Board, IEEE Signal Processing Society (Member, 2009-2011)
      • Area Editor, IEEE Signal Processing Magazine (2006-2008)
      • General Chair, IEEE Workshop on Multimedia Signal Processing, Victoria, BC, Canada (2006)
      • Co-General Chair, IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Vancouver, BC, Canada (2013)
      • Co-Chair, NIPS Workshop: Speech and Language --- Learning-Based Methods and Systems, Whistler, BC, Canada, 2008
      • Co-Chair, NIPS Workshop: Deep Learning for Speech Recognition and Related Applications, Whistler, BC, Canada, 2009
      • Guest Editor, IEEE Journal of Selected Topics in Signal Processing, Special Issue on Statistical Learning Methods for Speech and Language Processing, 2009
      • IEEE Signal Processing Society TC Review Committee (Member, term 2008-2009)
      • IEEE Signal Processing Society Long Range Planning & Implementation Committee (Member, term 2009-2010)
      • Member, Multimedia Signal Processing Technical Committee of the IEEE Signal Processing Society (2004-2008)
      • Member, Editorial Board, IEEE Signal Processing Letters (2007-2008)
      • Member, Editorial Board, IEEE Signal Processing Magazine (2005-2007)
      • Member, Editorial Board, J. Audio, Music, and Speech Processing (2005-present)
      • Founding Member, Education Committee, IEEE Signal Processing Society (1997-2000)
      • Member, Speech Processing Technical Committee, IEEE Signal Processing Society (1996-1999)
      • Associate Editor, IEEE Transactions on Speech and Audio Processing (2002-2005)
      • Principal Investigator, DARPA (US DoD) EARS Program, (2002-2005)
      • Technical Chair, IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2004), Montreal, Quebec, Canada.
      • Co-Guest Editor, IEEE Signal Processing Magazine, Special Issue on Speech Technology and Systems in Human-Machine Communication (Sept 2005)
      • Co-Guest Editor, IEEE Trans. on Computers, Special Issue on Emergent Systems, Algorithms and Architectures for Speech-based Human-Machine Interaction (2006)
      • Member, IEEE Signal Processing Society Technical Directions Committee (2003-2005)
      • Member, IEEE International Conference on Multimedia and Expo Steering Committee (2004-2006)
      • Keynote speaker, IEEE 5th Workshop on Multimedia Signal Processing (IEEE Signal Processing Society), St. Thomas, US Virgin Islands (December 2002)
      • Organizer and speaker, AAAS (American Association for Advancement of Science) Symposium on "Scientific Problems Facing Speech Recognition Today", 2004
      • Gold Star Award, Microsoft Corp
      • Invited Lecturer, NATO Advanced Study Institute
      • Invited Lecturer, European Speech Communication (ESCA) Tutorial and Research Workshops
      • Fellow, The Acoustical Society of America (The American Institute of Physics) (elected Dec. 2003)
      • Fellow, The IEEE (elected Dec. 2004)