*
Quick Links|Home|Worldwide
Microsoft*

Search for



Li Li Deng
Principal Researcher
Speech Technology Group






Education

  • B.S.: Biophysics, University of Science and Technology of China (USTC).
  • Master: Electrical Engineering, University of Wisconsin-Madison, U.S.A.
  • Ph.D.: Electrical Engineering, University of Wisconsin-Madison, U.S.A.

Brief Biography

Li Deng received the Bachelor degree from the University of Science and Technology of China (with the Guo Mo-Ruo Award), and received the Ph.D. degree from the University of Wisconsin-Madison (with the Jerzy E. Rose Award). In 1989, he joined Dept. Electrical and Computer Engineering, University of Waterloo, Ontario, Canada as an Assistant Professor, where he became a Full Professor in 1996. From 1992 to 1993, he conducted sabbatical research at Laboratory for Computer Science, Massachusetts Institute of Technology, Cambridge, Mass, and from 1997-1998, at ATR Interpreting Telecommunications Research Laboratories, Kyoto, Japan. In 1999, he joined Microsoft Research, Redmond, WA as a Senior Researcher, where he is currently a Principal Researcher. He is also an Affiliate Professor in the Department of Electrical Engineering at University of Washington, Seattle. His past and current research activities include automatic speech and speaker recognition, statistical methods and machine learning, neural information processing, machine intelligence, audio and acoustic signal processing, statistical signal processing and digital communication, human speech production and perception, acoustic phonetics, auditory speech processing, auditory physiology and modeling, noise robust speech processing, speech synthesis and enhancement, spoken language understanding systems, multimedia signal processing, and multimodal human-computer interaction. In these areas, he has published over 250 refereed papers in leading international conferences and journals, 12 book chapters, and has given keynotes, tutorials, and lectures worldwide. He has been granted over a dozen US or international patents in acoustics, speech/language technology, and signal processing. He authored two books in speech processing. He serves on the Board of Governors of the IEEE Signal Processing Society, and as Editor-in-Chief (elect) for the IEEE Signal Processing Magazine.

He is a Fellow of the Acoustical Society of America, and a Fellow of the IEEE.  


Professional Activities

  • Area Editor, IEEE Signal Processing Magazine (2006-present)
  • General Chair,  IEEE Workshop on Multimedia Signal Processing (2006)
  • Member, Multimedia Signal Processing Technical Committee of the IEEE Signal Processing Society (2004-present)
  • Member, Editorial Board, IEEE Signal Processing Letters (2007-)
  • Member, Editorial Board, IEEE Signal Processing Magazine (2005-2007)
  • Member, Editorial Board, J. Audio, Music, and Speech Processing (2005-present)
  • Founding Member, Education Committee, IEEE Signal Processing Society (1997-2000)
  • Member, Speech Processing Technical Committee, IEEE Signal Processing Society (1996-1999)
  • Associate Editor,  IEEE Transactions on Speech and Audio Processing (2002-2005)
  • Principal Investigator, DARPA (US DoD) EARS Program, (2002-2005)
  • Technical Chair, IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)  (2004)
  • Co-Guest Editor, IEEE Signal Processing Magazine, Special Issue on Speech Technology and Systems in Human-Machine Communication (Sept 2005)
  • Co-Guest Editor, IEEE Trans. on Computers, Special Issue on Emergent Systems, Algorithms and Architectures for Speech-based Human-Machine Interaction (2006)
  • Member, IEEE Signal Processing Society Technical Directions Committee (2003-2005)
  • Member, IEEE International Conference on Multimedia and Expo Steering Committee (2004-2006)
  • Keynote speaker, IEEE 5th Workshop on Multimedia Signal Processing (IEEE Signal Processing Society), St. Thomas, US Virgin Islands (December 2002)
  • Organizer and speaker, AAAS (American Association for Advancement of Science) Symposium on "Scientific Problems Facing Speech Recognition Today", 2004
  • Invited Lecturer, NATO Advanced Study Institute
  • Invited Lecturer, European Speech Communication (ESCA) Tutorial and Research Workshops
  • Fellow, The Acoustical Society of America (The American Institute of Physics) (elected Dec. 2003)
  • Fellow, The IEEE (elected Dec. 2004)
  • IEEE Signal Processing Society TC Review Committee (Member, term 2008-2009)
  • Board of Governors of IEEE Signal Processing Society (Member, elected September 2007; term 2008-2010)
  • Editor-In-Chief Elect, IEEE Signal Processing Magazine

 


Publications (Some PDF files may be downloaded from http://research.microsoft.com/research/srg/papers/)

Books

          Table of Contents:  http://www.amazon.com/gp/reader/0824740408/ref=sib_dp_bod_toc/002-8541730-1403255?ie=UTF8&p=S00L#

  • Li Deng: DYNAMIC SPEECH MODELS --- Theory, Algorithms, and Applications, Morgan & Claypool Publishers, May 2006, (http://www.amazon.com/gp/product/1598290649)
  • Xiaodong He and Li Deng: Discriminative Learning for Speech Recognition --- Theory and Practice, Morgan & Claypool Publishers, 2008.

   Publications in Refereed Journals

  • Sibel Yaman, Li Deng, Dong Yu, Yeyi Wang, Alex Acero. "A discriminative technique for spoken utterance classification,'' IEEE Trans. Audio, Speech, and Language Processing, 2008
  • Dong Yu, Li Deng, J. Droppo, Jian Wu, Yifan Gong, and Alex Acero. "Robust speech recognition using cepstral minimum-mean-square-error noise suppressor," IEEE Trans. Audio, Speech, and Language Processing, 2008.
  • Xiaodong He, Li Deng, Wu Chou. "Discriminative Learning in Sequential Pattern Recognition --- A Unifying Review for Optimization-Oriented Speech Recognition", IEEE Signal Processing Magazine, 2008.
  • Xiaodong He and Li Deng. "Discriminative Learning in Speech Recognition," Technical Report of Microsoft Research (MSR-TR-2007-129). pp. 1-47, Oct 2007. (http://research.microsoft.com/research/pubs/view.aspx?type=Technical%20Report&id=1372)
  • Xiaodong He and Li Deng (invited). "A new look at discriminative learning for hidden Markov models," Pattern Recognition Letters, Vol. 28, 2007, pp.1285-1294.
  • L. Deng, H. Attias, L. Lee, and A. Acero. "Adaptive Kalman smoothing for tracking vocal tract resonances using a continuous-valued hidden dynamic model", IEEE Transactions on audio, Speech and Language Processing, Vol. 15, No. 1, January 2007, pp. 13-23.
  • L. Deng. "Editorial: Expanding the Scope pf Signal Processing," IEEE Signal Processing Magazine, Vol. 25, No. 3, May 2008, pp. 2-4.
  • L. Deng. "Editorial: Write feature articles with a lasting impact," IEEE Signal Processing Magazine, Vol. 24, No. 2, March 2007.
  • Rodrigo Guido, Li Deng, and Shoji Makino. "Introduction: Special Section on Emergent Systems, Algorithms, and Architectures for Speech-Based Human-Machine Interaction," IEEE Transactions on Computers, Vol. 56, No. 9, September 2007, pp. 1-3.
  • D. Yu, L. Deng, and A. Acero. "A lattice search technique for long-contextual-span hidden trajectory model of speech," Speech Communication, Vol. 48, 2006, pp. 1214-1226.
  • L. Deng, D. Yu, and A. Acero. "Structured speech modeling," IEEE Transactions on Audio, Speech and Language Processing (Special Issue on Rich Transcription), Vol. 14, No. 5, Sept 2006, pp. 1492-1504.
  • D. Yu, L. Deng, and A. Acero. "Speaker-adaptive learning of resonance targets in a hidden trajectory model of speech coarticulation," Computer Speech and Language, Vol. 27, 2007, pp. 72-87.
  • R. Togneri and L. Deng. "A state-space model with neural-network prediction for recovering vocal tract resonances in fluent speech from Mel-cepstral coefficients," Speech Communication, Vol. 48, 2006, pp. 971-988.
  • L. Deng, K. Wang, and W. Chou. "Speech Technology and Systems in Human-Machine Communication --- Guest editors' editorial," IEEE Signal Processing Magazine, Vol. 22, No. 5, Sept 2005, pp. 12-14.
  • Y. Wang, L. Deng, and A. Acero. "An introduction to the statistical framework of spoken language understanding," IEEE Signal Processing Magazine, Vol. 22, No. 5, Sept. 2005, pp. 16-31.
  • L. Deng and D. Yu (invited) "A speech-centric perspective for human-computer interface --- A case study," Journal of VLSI Signal Processing Systems (Special Issue on Multimedia Signal Processing),  Vol. 41, 2005, pp. 255-269.
  • L. Deng, D. Yu, and A. Acero. "A bi-directional target-filtering model of speech coarticulation and reduction: Two-stage implementation for phonetic recognition," IEEE Transactions on Speech and Audio Processing, Vol. 14, No. 1, January 2006, pp. 256-265. 
  • L. Deng, J. Wu, J. Droppo, and A. Acero. "Analysis and comparison of two feature extraction/compensation algorithms," IEEE Signal Processing Letters, Vol. 12, No. 6, June, 2005, pp. 477-480.
  • L. Deng, A. Acero, and I. Bazzi. "Tracking vocal tract resonances using a quantized nonlinear function embedded in a temporal constraint," IEEE Transactions on Speech and Audio Processing, Vol. 14, No. 2, March 2006, pp. 425-434.
  • L. Deng and X.D. Huang. "Forum: Author Response to 'For Voice Interfaces, Hold the SALT'," Communications of the ACM, Vol. 47, No. 7, July 2004, pp. 11-13.
  • L. Deng and X.D. Huang. "Challenges in adopting speech recognition," Communications of the ACM, Vol. 47, No. 1, January 2004, pp. 69-75.
  • L. Deng, J. Droppo, and A. Acero. "Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion," IEEE Transactions on Speech and Audio Processing, Vol. 13, No. 3, May 2005, pp. 412-421.
  • L. Deng, J. Droppo, and A. Acero. "Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition," IEEE Transactions on Speech and Audio Processing," Vol.11, No.6, Nov. 2003, pp. 568-580.
  • L. Deng, J. Droppo, and A. Acero. "Estimating cepstrum of speech under the presence of noise using a joint prior of static and dynamic features," IEEE Transactions on Speech and Audio Processing, Vol. 12, No. 3, May 2004, pp. 218-233.
  • L. Deng, J. Droppo, and A. Acero. "Enhancement of log-spectra of speech using a phase-sensitive model of the acoustic environment," IEEE Transactions on Speech and Audio Processing, Vol. 12, No. 3, March 2004, pp. 133-143.
  • R. Togneri and L. Deng. "Joint state and parameter estimation for a target-directed nonlinear dynamic system model," IEEE Transactions on Signal Processing,  Vol. 51, No. 12, December 2003, pp. 3061-3070.
  • Z. Ma and L. Deng. "A mixed-level switching dynamic system for continuous speech recognition," Computer Speech and Language. Vol. 18, 2004, pp. 49-65.
  • L. Deng, Y. Wang, K. Wang, A. Acero, H. Hon, J. Droppo, C. Boulis, D. Jacoby, M. Mahajan, C. Chelba, and X.D.Huang (invited). "Speech and language processing for multimodal human-computer interaction," Journal of VLSI Signal Processing Systems (Special issue on Real-World Speech Processing), Vol. 36, No. 2, February 2004, pp. 161-187.
  • Jack Xin, Y. Y. Qi, and L. Deng. "Time domain computation of a nonlinear, nonlocal cochlear model with applications to multitone interactions in hearing," Communications in Mathematical Sciences, Vol.1, No.2, 2003, pp. 211-227.
  • J. Ma and L. Deng. "Target-directed mixture linear dynamic models for spontaneous speech recognition,," IEEE Transactions on Speech and Audio Processing, Vol. 12, No. 1, 2004, pp. 47-58.
  • J. Ma and L. Deng. "Efficient decoding strategies for conversational speech recognition using a constrained nonlinear state-space model for vocal-tract-resonance dynamics," IEEE Transactions on Speech and Audio Processing,  Vol.11, No.6, Nov. 2003, pp. 590-602.
  • L. Deng, K. Wang, A. Acero, H. Hon, J. Droppo, C. Boulis, Y. Wang, D. Jacoby, M. Mahajan, C. Chelba, and X.D.Huang. "Distributed speech processing in MiPad's multimodal user interface" IEEE Transactions on Speech and Audio Processing, Vol. 10, No. 8, November 2002, pp. 605-619.
  • M. Naito,  L. Deng, and Y. Sagisaka. "Speaker clustering for speech recognition using vocal-tract parameters," Speech Communication, Vol. 36, No. 3-4, March 2002, pp. 305-315.
  • H. Sameti and L. Deng. "Nonstationary-state hidden Markov model representation of speech signals for speech enhancement",  Signal Processing, Vol. 82, 2002, pp. 205-227.
  • J. Sun and  L. Deng. "An overlapping-feature based phonological model incorporating linguistic constraints: Applications to speech recognition," Journal of the Acoustical Society of America, Vol. 111, No. 2, pp. 1086-1101, 2002.
  • H. Jiang and L. Deng. "A robust compensation strategy against extraneous acoustic variations in spontaneous speech recognition,"  IEEE Transactions on Speech and Audio Processing, Vol 10, No. 1, January 2002,  pp.9-17.
  • C. Rathinavelu and  L. Deng. "A maximum a posteriori approach to speaker adaptation using the trended hidden Markov model,"  IEEE Transactions on Speech and Audio Processing, Vol.9, No.5, July 2001, pp. 549-557.
  • H. Jiang and L. Deng. "A Bayesian approach to speaker verification," IEEE Transactions on Speech and Audio Processing , Vol. 9, No. 8, November 2001, pp.874-884. 
  • R. Togneri, J. Ma, and  L. Deng. "Parameter estimation of a target-directed dynamic system model with switching states," Signal Processing, Vol.81, No.5, 2001, pp. 975-987.
  • L. Deng and Z. Ma. "Spontaneous speech recognition using a statistical coarticulatory model for the hidden vocal-tract-resonance dynamics," J. Acoust. Soc. Am, Vol.108, No. 6, Dec 2000, pp.3036-3048.
  • M. Naito,  L. Deng, and Y. Sagisaka. "Speaker normalization for speech recognition using model-based vocal-tract parameters," Transactions of Japan Institute of Electronics, Information, and Communication Engineers (IEICE), Vol.J83-D-II No.11, November 2000, pp. 2360-2369.
  • J. Ma and L. Deng. "A path-stack algorithm for optimizing dynamic regimes in a statistical hidden dynamic model of speech," Computer Speech and Language , Vol. 14, 2000. pp 101-104
  • Jiping Sun and L. Deng. "Use of high-level linguistic constraints for constructing feature-based phonological model in speech recognition," Journal of Intelligent Information Processing Systems , 1999, p. 269-276.
  • L. Deng (invited). "Locus equation and hidden parameters of speech," Journal of Behavioral and Brain Sciences, Vol. 21, Issue 2. April 1998. pp. 263-264. 
  • X. Shen and L. Deng. "A dynamic system approach to speech enhancement using H-infinity filtering algorithm," IEEE Transactions on Speech and Audio Processing , Vol. 7, 1998, p. 391-399.
  • H. Sheikhzadeh and L. Deng. "A layered neural network interfaced with a cochlear model for the study of speech encoding in the auditory system," Computer Speech and Language, Vol. 13, 1999, p. 39-64.
  • H. Sameti, H. Sheikhzadeh, L. Deng and R. Brennan. "HMM-based strategies for enhancement of speech embedded in nonstationary noise," IEEE Transactions on Speech and Audio Processing, Vol.6, No.5, September 1998, p. 445-455.
  • L. Deng. "A dynamic, feature-based approach to the interface between phonology and phonetics for speech modeling and recognition," Speech Communication. Vol. 24, No. 4, pp. 299-323, 1998.
  • C. Rathinavalu and L. Deng. "Speech trajectory discrimination using the minimum classification error learning," IEEE Transactions on Speech and Audio Processing, Vol.6, No.6, Nov. 1998, p. 505-515.
  • L. Deng, G. Ramsay, and D. Sun. (invited) "Production models as a structural basis for automatic speech recognition," Speech Communication (special issue on speech production modeling), Vol. 22, No. 2, August 1997, pp. 93-112.
  • L. Deng. "Autosegmental representation of phonological units of speech and its phonetic interface," Speech Communication, Vol. 23, No. 3, 1997, pp. 211-222.
  • X. Shen and L. Deng. "Game theory approach to H_inf filter design," IEEE Transactions on Signal Processing, Vol. 45, No. 4, April 1997, pp. 1092-1095
  • C. Rathinavalu and L. Deng. "HMM-based speech recognition using state-dependent, discriminatively derived transforms on Mel-warped DFT features", IEEE Transactions on Speech and Audio Processing, May, 1997, pp. 243-256.
  • L. Deng and X. Shen. "Maximum likelihood in statistical estimation of dynamical systems: Decomposition algorithm and simulation results," Signal Processing, Vol.57, No. 1, 1997, pp. 65-79.
  • L. Deng and C. Rathinavalu. "Construction of state-dependent dynamic parameters by maximum likelihood: Applications to speech recognition," Signal Processing, Vol. 55, No.2, 1997, pp. 149-165.
  • H. Sameti, H. Sheikhzadeh, L. Deng and R. Brennan. "HMM-based strategies for enhancement of speech embedded in nonstationary noise," IEEE Transactions on Speech and Audio Processing, September 1998. 
  • C. Rathinavalu and L. Deng. "Use of generalized dynamic feature parameters for speech recognition," IEEE Transactions on Speech and Audio Processing, May 1997, pp. 232-242. 
  • H. Sheikhzadeh and L. Deng. "Speech analysis and recognition using interval statistics generated from a composite audit ory model," IEEE Transactions on Speech and Audio Processing, Vol. 6, No. 1, January 1998, pp. 50-54. 
  • L. Deng and M. Aksmanovic. "Speaker-independent phonetic classification using hidden Markov models with state-conditioned mixtures of trend functions," IEEE Transactions on Speech and Audio Processing, Vol. 5, No. 4, July 1997, pp. 319-324.
  • X. Shen, and L. Deng. "Decomposition solution of H-infinity filter gain in singularly perturbed systems," Signal Processing, Vol.55, No. 3, 1996, pp. 313-320.
  • L. Deng and H. Sameti. "Transitional speech units and their representation by the regressive Markov states: Applications to speech recognition," IEEE Transactions on Speech and Audio Processing, Vol.4, No.4, July 1996, pp. 301--306. 
  • L. Deng. "Transiems as dynamically-defined, sub-phonemic units of speech: A computational model," Signal Processing, Vol. 49, No. 1, 1996, pp. 25-35.
  • G. Ramsay and L. Deng. "Tracking non-stationary targets using a dynamical system with Markov-modulated parameters, " IEEE Signal Processing Letters, Vol. 2, No. 9, September, 1995, pp. 172-175.
  • L. Deng and C. Rathinavalu. "A Markov model containing state-conditioned second-order nonstationarity: Application to speech recognition," Computer Speech and Language, Vol. 9, No. 1, January, 1995, pp. 63-86. 
  • L. Deng and D. Braam. "Context-dependent Markov model structured by locus equations: Application to phonetic classification," Journal of the Acoustical Society of America, Vol. 96, No. 4, October, 1994, pp. 2008-2025. 
  • L. Deng, M. Aksmanovic, D. Sun, and C. F. J. Wu. "Speech recognition using hidden Markov models with polynomial regression functions as nonstationary states," IEEE Transactions on Speech and Audio Processing, Vol. 2, No. 4, October, 1994, pp. 507-520. 
  • D. Sun, L. Deng, C. F. J. Wu. "State-dependent time warping in the trended hidden Markov model," Signal Processing, Vol. 39, No. 1, 1994, pp. 263-275. 
  • L. Deng. "Integrated optimization of dynamic feature parameters for hidden Markov modeling of speech," IEEE Signal Processing Letters, Vol. 1, No. 4, April, 1994, pp. 66-69. 
  • L. Deng and D. Sun. "A statistical approach to automatic speech recognition using the atomic speech units constructed from overlapping articulatory features," Journal of the Acoustical Society of America, Vol. 95, No. 5, May 1994, pp. 2702-2719. 
  • L. Deng, K. Hassanein, and M. Elmasry. "Analysis of correlation structure for a neural predictive model with application to speech recognition," Neural Networks, Vol. 7, No. 2, 1994, pp. 331-339. 
  • D. Zhang, L. Deng, and M. Elmasry. "Pipelined architectures for neural-network-based speech recognition," Neural, Parallel & Scientific Computations, Vol. 2, No. 1, March, 1994, pp. 81-- 92.
  • L. Deng. "A statistical model for formant-transition microsegments of speech incorporating locus equations," Signal Processing, Vol. 37, No. 1, 1994, pp. 121--128.
  • H. Sheikhzadeh and L. Deng. "Waveform-based speech recognition using hidden filter models: Parameter selection and sensitivity to power normalization," IEEE Transactions on Speech and Audio Processing, Vol. 2, No. 1, January, 1994, pp. 80--91.
  • L. Deng and I. Kheirallah. "Numerical property and efficient solution of a nonlinear transmission-line model for basilar-membrane wave motions," Signal Processing, Vol. 33, No. 3, 1993, pp. 269--286.
  • L. Deng. "A stochastic model of speech incorporating hierarchical nonstationarity," IEEE Transactions on Speech and Audio Processing, Vol. 1, No. 4, October 1993, pp. 471--475. 
  • L. Deng and Jon W. Mark. "Parameter estimation of Markov modulated Poisson processes as a telecommunication traffic model via the EM algorithm with time discretization," Telecommunication Systems, Vol. 1, No. 3, 1993, pp. 321-338.
  • K. Erler and L. Deng. "Hidden Markov model representation of quantized articulatory features for speech recognition," Computer Speech and Language, Vol. 7, No. 3, 1993, pp. 265-282.
  • L. Deng and I. Kheirallah. "Dynamic formant tracking of noisy speech using temporal analysis on outputs from a nonlinear cochlear model," IEEE Transactions on Biomedical Engineering, Vol. 40, No. 5, 1993, pp. 456--467. 
  • L. Deng and K. Erler. "Structural design of a hidden Markov model based speech recognizer using multi-valued phonetic features: Comparison with segmental speech units," Journal of the Acoustical Society of America, Vol.92, No.6, December, 1992, pp.3058-3067. 
  • L. Deng. "A generalized hidden Markov model with state-conditioned trend functions of time for the speech signal," Signal Processing, Vol.27, No.1, April 1992, pp. 65-78. 
  • L. Deng, P. Kenny, M. Lennig, and P. Mermelstein. "Modeling acoustic transitions in speech by state-interpolation hidden Markov models," IEEE Transactions on Signal Processing, Vol.40, No.2, February, 1992, pp. 265-272.
  • L. Deng. "Processing of acoustic signals in a cochlear model incorporating laterally coupled suppressive elements," Neural Networks, Vol.5, No.1, January 1992, pp.19-34.
  • L. Deng. "Hierarchical non-stationarity in a class of doubly stochastic time series models with application to speech recognition," (invited paper). Canadian Acoustics, Vol. 19, No. 4, September, 1991, pp. 113--115.
  • L. Deng, P. Kenny, M. Lennig, V. Gupta, F. Seitz and P. Mermelstein. "Phonemic hidden Markov models with continuous mixture output densities for large vocabulary word recognition," IEEE Transactions on Signal Processing, Vol. 39, No. 7, July, 1991, pp. 1677--1681.
  • L. Deng. "Non-parametric estimation of phase variance in auditory-nerve fiber' s responses to tonal stimuli," Journal of the Acoustical Society of America, Vol.90, No.6, December 1991, pp. 3099--3106.
  • L. Deng. "The semi-relaxed algorithm for parameter estimation of hidden Markov models," Computer Speech and Language, Vol. 5, No.3, August, 1991, pp. 231--236.
  • L. Deng, M. Lennig, F. Seitz and P. Mermelstein. "Large vocabulary word recognition using context-dependent allophonic hidden Markov models," Computer Speech and Language, Vol.4, No.4, December, 1990, pp. 345-357.
  • P. Seitz, V. Gupta, M. Lennig, P. Kenny, L. Deng, D. O'Shaughnessy, and P. Mermelstein. "A dictionary for a very large vocabulary word recognition system," Computer Speech and Language, Vol. 4, No.2, 1990, pp. 193-202.
  • L. Deng, M. Lennig, and P. Mermelstein. "Modeling microsegments of stop consonants in a hidden Markov model based word recognizer," Journal of the Acoustical Society of America, Vol. 87, June, 1990, pp. 2738-2747.
  • L. Deng, M. Lennig, and P. Mermelstein. "Use of vowel duration information in a large vocabulary word recognizer," Journal of the Acoustical Society of America, Vol. 86, August, 1989, pp. 540-548.
  • L. Deng, C.D. Geisler, and S. Greenberg. "A composite model of the auditory periphery for the processing of speech," (invited paper). Journal of Phonetics, special theme issue on Representation of Speech in the Auditory Periphery, Vol. 16, No. 1, January, 1988, pp. 93-108.
  • L. Deng and C.D. Geisler. "Responses of auditory-nerve fibers to nasal consonant-vowel syllables," Journal of the Acoustical Society of America, Vol. 82, No. 6, December 1987, pp. 1977--1988.
  • L. Deng, C.D. Geisler, and S. Greenberg. "Responses of auditory-nerve fibers to multiple-tone complexes," Journal of the Acoustical Society of America, Vol. 82, No. 6, December 1987, pp. 1989--2000.
  • L. Deng and C.D. Geisler. "A composite auditory model for processing speech sounds," Journal of the Acoustical Society of America, Vol. 82, No. 6, December 1987, pp. 2001--2012.
  • S.R. Greenberg, C.D. Geisler, and L. Deng. "Frequency selectivity of single cochlear-nerve fibers based on the temporal response pattern of two-tone signals," Journal of the Acoustical Society of America, Vol. 79, No. 4, April 1986, pp. 10 10--1019.
  • L. Deng and C.D. Geisler. "Changes in the phase of excitor-tone responses in auditory-nerve fibers by suppressor tones," Journal of the Acoustical Society of America, Vol. 78, No. 11, November 1985, p p. 1633--1644.
  • C.D. Geisler and L. Deng. "Thresholds for primary auditory fibers using statistically defined criteria," Journal of the Acoustical Society of America, Vol. 77, No. 3, March 1985, pp. 1102--1109.

 

Recent Refereed Conference Publications:
  • Jinyu Li, Li Deng, Dong Yu, Jian Wu, Yifan Gong, Alex Acero. "ADAPTATION OF COMPRESSED HMM PARAMETERS FOR RESOURCE-CONSTRAINED SPEECH RECOGNITION," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, March 31-April 5, 2008, Las Vegas.
  • Tsung-Hui Chang, Zhi-Quan Luo, Li Deng, Chong-Yung Chi. "A Convex Optimization Method for Joint Mean and Variance Parameter Estimation of Large-Margin CDHMM,"Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, March 31-April 5, 2008, Las Vegas, pp.
  • Dong Yu, Li Deng, Jasha Droppo, Jian Wu, Yifan Gong, Alex Acero. "MINIMUM-MEAN-SQUARE-ERROR NOISE REDUCTION ALGORITHM ON MEL-FREQUENCY CEPSTRA FOR ROBUST SPEECH RECOGNITION," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, March 31-April 5, 2008, Las Vegas, pp.
  • Jinyu Li, Li Deng, Dong Yu, Yifan Gong, Alex Acero. "HMM ADAPTATION USING A PHASE-SENSITIVE ACOUSTIC DISTORTION MODEL FOR ENVIRONMENT-ROBUST SPEECH RECOGNITION," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, March 31-April 5, 2008, Las Vegas, pp.
  • Li Deng (invited). ``Roles of high-fidelity acoustic modeling in robust speech recognition,''
    Proc. IEEE Workshop on Automatic Speech Recognition and Understanding, Kyoto, Japan, Dec 9-13, 2007, 12 pages.
  • Jinyu Li, Li Deng, Dong Yu, Yifan Gong, Alex Acere. ``HIGH-PERFORMANCE HMM ADAPTATION WITH JOINT COMPENSATION OF ADDITIVE AND CONVOLUTIVE DISTORTIONS VIA VECTOR TAYLOR SERIES'', Proc. IEEE Workshop on Automatic Speech Recognition and Understanding, Kyoto, Japan, Dec 9-13, 2007, 6 pages.
  • Jinyu Li, Li Deng, Dong Yu, Yifan Gong, Alex Acero. "HIGH-PERFORMANCE HMM ADAPTATION WITH JOINT COMPENSATION OF ADDITIVE AND CONVOLUTIVE DISTORTIONS VIA VECTOR TAYLOR SERIES", Proc. IEEE Workshop on Automatic Speech Recognition and Understanding, Kyoto, Japan, Dec 9-13, 2007.
  • D. Yu and L. Deng (invited). "Large-Margin Discriminative Training of Hidden Markov Models for Speech Recognition," Proc. IEEE International Conference on Semantic Computing, Irvine, CA, September 17-19, 2007.
  • R. Togneri and L. Deng. "A Structured Speech Model Parameterized by Recursive Dynamics and Neural Networks," Proceedings of Interspeech, Antweerp, Belgium, Aug. 27-31, 2007. pp. 894-897.
  • D. Yu, L. Deng, and A. Acero. "Handling Phonetic Context and Speaker Variation in a Structure-Based Speech Recognizer," Proceedings of Interspeech, Antweerp, Belgium, Aug. 27-31, 2007, pp. 906-909.
  • L. Deng and H. Strik. "Structure-Based and Template-Based Automatic Speech Recognition --- Comparing parametric and non-parametric approaches," Proceedings of Interspeech, Antweerp, Belgium, Aug. 27-31, 2007, pp. 894-897.
  • Q. Fu, Xiaodong He, and L. Deng. "Phone-Discriminating Minimum Classification Error (P-MCE) Training for Phonetic Recognition," Proceedings of Interspeech, Antweerp, Belgium, Aug. 27-31, 2007, pp. 2073-2076.
  • Sibel Yaman, Li Deng, Dong Yu, Ye-Yi Wang, and Alex Acero, "A DISCRIMINATIVE TRAINING FRAMEWORK USING N-BEST SPEECH RECOGNITION TRANSCRIPTIONS AND SCORES FOR SPOKEN UTTERANCE CLASSIFICATION," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Honolulu, Hawaii, April 2007
  • Li Deng and Dong Yu, "Use of Differential Cepstra as Acoustic Features in Hidden Trajectory Modeling for Phonetic Recognition," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Honolulu, Hawaii, April 2007
  • Dong Yu, Li Deng, Xiaodong He, Alex Acero, "LARGE-MARGIN MINIMUM CLASSIFICATION ERROR TRAINING FOR LARGE-SCALE SPEECH RECOGNITION TASKS," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Honolulu, Hawaii, April 2007
  • Xiaolong Li, Yuncheng Ju, Li Deng, Alex Acero, "EFFICIENT AND ROBUST LANGUAGE MODELING IN AN AUTOMATIC CHILDREN’S READING TUTOR SYSTEM," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Honolulu, Hawaii, April 2007
  • D. Yu, L. Deng, X. He, and A. Acero."Use of incrementally regulated discriminative margins in MCE training for speech recognition,"  Proceedings of Interspeech, Pittsburgh, PA, Sept 2006, pp. 2418-2421
  •  Xiaolong Li, L. Deng, and A. Acero. "Time synchronous decoding for a Long-Contextual-Span Hidden Trajectory Model of Speech,"  Proceedings of Interspeech, Pittsburgh, PA, Sept 2006, pp. 609-612.
  •  Xiaodong He, L. Deng, and W. Chou."A novel learning method for hidden Markov models in speech and audio processing,"  Proc. IEEE Workshop on Multimedia Signal Processing, Victoria, BC, October 2006, 6 pages. CDROM.
  •  Li Deng, Xiaodong Cui, Robert Pruvenok, Jonathan Huang, Safiyy Momen, Yanyi Chen, and Abeer Alwan. "A database of vocal tract resonance trajectories for research in speech processing,"  Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, May 14-19, 2006, Toulouse, France, pp. 60-63.
  • L. Deng, D. Yu, and A. Acero."A generative modeling framework for structured hidden speech dynamics,"  Neural Information Processing System (NIPS) Workshop, Whistler, BC, Canada, Dec. 2005.
  •  L. Deng, D. Yu, and A. Acero."A Long-Contextual-Span Model of Resonance Dynamics for Speech Recognition: Parameter Learning and Recognizer Evaluation,"  IEEE Workshop on ASRU, Nov. 27-Dec 1, 2005, 6 pages (CDROM).
  •  D. Yu, L. Deng, and A. Acero. "A* Lattice Search Algorithm for a Long-Contextual-Span Hidden Trajectory Model and Phonetic Recognizer," Proceedings of Interspeech, Lisbon, Sept 2005, pp. 553-556.
  •  L. Deng, D. Yu, and A. Acero."Learning Statistically Characterized Resonance Targets in a Hidden Trajectory Model of Speech Coarticulation and Reduction," Proceedings of Interspeech, Lisbon, Sept 2005, pp. 1097-1100.
  •  A. Subramanya, L. Deng, Z. Liu, and Z. Zhang. "Multi-sensory speech processing: Incorporating automatically extracted hidden dynamic information,"  Proceedings of the IEEE International Conference on Multimedia & Expo (ICME), July 2005, Amsterdam, 4 pages.
  •  L. Deng, X. Li, D. Yu, and A. Acero."A hidden trajectory model with bidirectional target filtering: Cascaded vs. Integrated implementation for phonetic recognition,"  Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, March 19-23, 2005, Philadelphia, PA, pp 337-340.
  • L. Deng, X. Li, D. Yu, and, A. Acero. "Novel Acoustic Modeling with Structured Dynamics for Speech Coarticulation and Reduction," Proc. of DARPA/NIST RT-04 Workshop, Palisades, New York, Nov. 7-10, 2004, 6 pages.
  • D. Yu, M. Hwang, P. Mau, A. Acero, and L. Deng. "Unsupervised learning from users’error correction in speech dictation," Proceedings of the International Conference on Spoken Language Processing, Oct.4-8, 2004, Jeju Island, Korea, No. Spec4201o.1, pp. 4201-4204.
  • L. Deng, D. Yu, and A. Acero. "A quantitative model for formant dynamics and contextually assimilated reduction in fluent speech," Proceedings of the International Conference on Spoken Language Processing, Oct.4-8, 2004, Jeju Island, Korea, No. WeA501p.20, pp.\ 501-504.
  • R. Togneri and L. Deng. "Use of neural network mapping and extended Kalman filter to recover vocal tract resonances from the MFCC parameters of speech," Proceedings of the International Conference on Spoken Language Processing, Oct.4-8, 2004, Jeju Island, Korea, No.WeB1201o.4, pp. 1201-1204.
  • L. Deng, Z. Liu, Z. Zhang, and A. Acero. "Information fusion for multi-sensor processing --- Extracting and exploiting hidden dynamics of speech captured by a bone-conductive microphone," Proceedings of the IEEE Fifth Workshop on Multimedia Signal Processing, Siena, Italy, Sept 28-Oct 2, 2004, 4 pages.
  • L. Deng, L. Lee, H. Attias, and A. Acero. "A structured speech model with continuous hidden dynamics and prediction-residual training for tracking vocal tract resonances," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Montreal, Canada, May 2004, Vol. I,  pp.557-560.
  • L. Lee, L. Deng, and H. Attias."A multimodal variational approach to learning and inference in switching state space models," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Montreal, Canada, May 2004, Vol. V,  pp.505-508.
  • Z. Zhang, Z. Liu, M. Sinclair, A. Acero, L. Deng, J. Droppo. X. Huang, Y. Zheng. "Multisensory microphones for robust speech detection, enhancement, and recognition," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Montreal, Canada, May 2004, Vol. III,  pp.781-784.
  •  Y. Zheng, Z. Liu, Z. Zhang, M. Sinclair, J. Droppo, L. Deng, A. Acero, and X Huang. "Air- and bone-conductive integrated microphones for robust speech detection and enhancement," Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding, Nov. 30--Dec. 4, 2003, St. Thomas, US Virgin Islands. 6 pages in CDROM.
  • J. Wu, J. Droppo, L. Deng, and A. Acero. "A noise-robust ASR frontend using Wiener filters constructed from MMSE estimates of clean speech and noise," Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding, Nov. 30--Dec. 4, 2003, St. Thomas, US Virgin Islands. 6 pages in CDROM.
  • L. Deng, I. Bazzi, and A. Acero. "Tracking vocal tract resonances using an analytical nonlinear predictor and a target-guided temporal constraint," Proceedings of the European Conference on Speech Communication and Technology, Geneva, Switzerland, September 2003, Vol.I, pp. 73-76.
  • J. Droppo, L. Deng, and A. Acero. "A comparison of three non-linear observation models for noisy speech features," Proceedings of the European Conference on Speech Communication and Technology, Geneva, Switzerland, September 2003,  Vol. II, pp. 681-684.
  • L. Deng, J. Droppo, and A. Acero. "Incremental Bayes learning with prior evolution for tracking nonstationary noise statistics from noisy speech data," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Hong Kong, April 2003, Vol.I, pp. 672-675.
  • F. Seide, J.L. Zhou, and L. Deng. "Coarticulation modeling by embedding a target-directed hidden trajectory model into HMM --- MAP decoding and evaluation," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Hong Kong, April 2003, Vol.I, pp. 748-751.
  • J.L. Zhou, F. Seide, and L. Deng. "Coarticulation modeling by embedding a target-directed hidden trajectory model into HMM --- Models and training," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Hong Kong, April 2003, Vol.I, pp. 744-747.
  • L.J. Lee, H. Attias, and L. Deng. "Variational inference and learning for segmental switching state space models of hidden speech dynamics," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Hong Kong, April 2003, Vol.I, pp. 920-923.
  •  I. Bazzi, A. Acero, and L. Deng. "An expectation-maximization approach for formant tracking using a parameter-free non-linear predictor," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Hong Kong, April 2003, Vol.I, pp. 464-467.
  •  L. Deng, A. Acero, Y. Wang, K. Wang, H. Hon, J. Droppo, M. Mahajan, and XD Huang. "A speech-centric perspective for human-computer interface," (invited). Proceedings of the IEEE Fifth Workshop on Multimedia Signal Processing, Dec. 9-11, 2002, St. Thomas, US Virgin Islands,  5 pages in CDROM.
  • H. Attias and L. Deng. "A new approach to speech enhancement by a microphone array using EM and mixture moels," Proceedings of the International Conference on Spoken Language Processing, Denver CO, September 2002,  pp. 151-154.
  •  J. Droppo, A. Acero, and L. Deng. "Evaluation of SPLICE on the Aurora2 and Aurora3 tasks," Proceedings of the International Conference on Spoken Language Processing, Denver CO, September 2002, pp. 121-124.
  • L. Deng, J. Droppo, and A. Acero. "Log-domain speech feature enhancement using sequential MAP noise estimation and a phase-sensitive model of the acoustic environment," Proceedings of the International Conference on Spoken Language Processing, Denver CO, September 2002, pp. 192-195.
  •  L. Deng, J. Droppo, and A. Acero. "Exploiting variances in robust feature extraction based on a parametric model of speech distortion," Proceedings of the International Conference on Spoken Language Processing, Denver CO, September 2002, pp. 217-220.
  •  J. Droppo, A. Acero, and L. Deng. "A nonlinear observation model for removing noise from corrupted speech log mel-spectral energies," Proceedings of the International Conference on Spoken Language Processing, Denver CO, September 2002, pp. 182-185.
  •  L. Deng, J. Droppo, and A. Acero. "A Bayesian approach to speech feature enhancement using the dynamic cepstral prior," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.I, Orlando, Florida, May 2002, pp. 829-832.
  •  J. Droppo, A. Acero, and L. Deng. "Uncertainty decoding with SPLICE for noise robust speech recognition," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.I, Orlando, Florida, May 2002, pp. 57-60.
  •  J. Ma and L. Deng. "A mixture linear model with target-directed dynamics for spontaneous speech recognition," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.I, Orlando, Florida, May 2002, pp. 961-964.
  •  L. Deng, J. Droppo, and A. Acero. "Recursive estimation of nonstationary noise using a nonlinear model with iterative stochastic approximation," Proceedings of Automtic Speech Recognition and Understanding Workshop, Madonna di Campiglio, Trento, Italy, Dec. 9-13, 2001. 4 pages (CDROM).
  •  T. Kristjansson, B. Frey, L. Deng, and A. Acero. "Joint estimation of noise and channel distortion in a generalized EM framework," Proceedings of Automtic Speech Recognition and Understanding Workshop, Madonna di Campiglio, Trento, Italy, Dec. 9-13, 2001. 4 pages (CDROM).
  •  B. Frey, T. Kristjansson, L. Deng, and A. Acero. "Learning dynamic noise models from noisy speech for robust speech recognition," Advances in Neural Information Processing Systems (NIPS), Vol. 14, Vancouver, Canada, 2001, pp. 101-108.
  •  J. Droppo, L. Deng, A. Acero. "Evaluation of the SPLICE algorithm on the Aurora2 database," Proceedings of the European Conference on Speech Communication and Technology, Vol. 1, Aalborg, Denmark, September 2001, pp. 217-220.
  •  H. Attias, L. Deng, A. Acero, and J. Platt. "A new method for speech denoising and robust speech recognition using probabilistic models for clean speech and for noise," Proceedings of the European Conference on Speech Communication
    and Technology, Vol. 2, Aalborg, Denmark, September 2001, pp. 1903-1906.
  •  B. Frey, L. Deng, A. Acero, and T. Kristjansson. "ALGONQUIN: Iterating Laplace's method to remove multiple types of acoustic distortion for robust speech recognition," Proceedings of the European Conference on Speech Communication and Technology, Aalborg, Denmark, September 2001, pp. 901-904.
  •  J. Ma and L. Deng. "Efficient decoding strategy for conversational speech recognition using state-space models for vocal-tract-resonance dynamics", Proceedings of the European Conference on Speech Communication and Technology, Aalborg, Denmark, September 2001, pp. 603-606.
  •  L. Deng, A. Acero, L. Jiang, J. Droppo, and XD Huang. "High-performance robust speech recognition using stereo training data," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.I, Salt Lake City, Utah, April 2001, pp. 301-304.
  • T. Kristjansson, L. Deng, A. Acero and B. Frey. "Towards non-stationary model-based noise adaptation for large vocabulary speech recognition," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.I, Salt Lake City, Utah, April 2001, pp. 337-340.
  • X. Huang, A. Acero, C. Chelba, L. Deng, J. Droppo, H. Hon, et al. (invited) "MIPAD: A next generation PDA prototype," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.I, Salt Lake City, Utah, April 2001, pp. 9-12. 
  • R. Togneri and L. Deng. "An EKF-based algorithm for learning statistical hidden dynamic model parameters for phonetic recognition," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.I, Salt Lake City, Utah, April 2001, pp. 465-468.
  • L. Lee, P. Fieguth, and L. Deng. "A functional articulatory dynamic model for speech production,"  Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.II, Salt Lake City, Utah, April 2001, pp. 797-800.
  • J. Droppo, L. Deng, A. Acero. "Efficient on-line acoustic environment estimation for FCDCN in a continuous speech recognition system,  Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing,  Vol.I, Salt Lake City, Utah, April 2001, pp. 209-212.
  •  H. Attias, J. Platt, A. Acero, and L. Deng. "Speech denoising and dereveberation using probabilistic models,"  Advances in Neural Information Processing Systems (NIPS), Vol. 13, Denver, CO, Nov. 27-Dec 2, 2000, pp. 758-764.
  • J. Sun and  L. Deng. "Annotation and use of speech production corpus for building language-universal speech recognizers", Proceedings of the 2nd International Symposium on Chinese Spoken Language Processing (ISCSLP), Beijing, October 2000, Vol. 3, pp. 31-34.
  • J. Sun, L. Deng, and X. Jing."Data-driven model construction for continuous speech recognition using overlapping articulatory features,"  Proceedings of the International Conference on Spoken Language Processing, October 2000, Vol. 1, pp. 437-440.
  •  A. Acero, L. Deng, T. Kristjansson, and J. Zhang. "HMM adaptation using vector Taylor series for noisy speech recognition,"   Proceedings of the International Conference on Spoken Language Processing,October 2000, Vol. 3, pp. 869-872.
  • X. Huang, A. Acero, C. Chelba, L. Deng, D. Duchene, J. Goodman, H. Hon, D. Jacoby, L. Jiang, R. Loynd, M. Mahajan, P. Mau, S. Meredith, S. Mughal, S. Neto, M. Plumpe, K. Wang, Y. Wang. "MIPAD: A next generation PDA prototype,"  Proceedings of the International Conference on Spoken Language Processing, October 2000, Vol. 3, pp. 33-36.
  • L. Deng, A. Acero, M. Plumpe, and X.D. Huang."Large-vocabulary speech recognition under adverse acoustic environments,"  Proceedings of the International Conference on Spoken Language Processing, October 2000, Vol. 3, pp. 806-809.
  • H. Jiang and  L. Deng. "A robust training strategy against extraneous acoustic variations for spontaneous speech recognition,"  Proceedings of the International Conference on Spoken Language Processing, October 2000, Vol. 4, pp. 161-164.
  • J. Sun, R. Tongneri and  L. Deng. "A robust speech understanding system using conceptual relational grammar,"  Proceedings of the International Conference on Spoken Language Processing,October 2000, Vol. 2, pp. 879-882.
  • S. Dusan and L. Deng. "Acoustic-to-articulatory inversion using dynamical and phonological constraints"  Proceedings of the 5th Speech Production Workshop: MODELS AND DATA, Kloster Seeon, Germany, May 1-4, 2000, pp. 237-240.
Book Chapters
  • Li Deng and Jianwu Dang. "Speech Analysis: The Production-Perception Perspective," Chapter 1 in Hai-Zhou Li and Chin-Hui Lee (eds.), Advances in Chinese Spoken Language Processing,  Publisher: World Scientific, New Jersey, 2007, pp. 3-32.
  • Dong Yu and Li Deng. "Speech-Centric Multimodal User Interface Design in Mobile Technology",  Chapter XVIII in Jo Lumsden (Ed.), Handbook of Research on User Interface Design and Evaluation for Mobile Technology,  Publisher: IGI Global (Information Science Reference), New York, 2008.
  • L. Deng and H. Sheikhzadeh.  "Use of an Integrated Neural-Network and Cochlear Model for the Study of Speech Encoding in the Auditory System," in W. Ainsworth and S. Greenberg (eds.)  Listening to Speech: An Auditory Perspective, Publisher: Lawrence Erlbaum Associates, 2006, pp. 237-256.
  • L. Deng. "Switching Dynamic System Models for Speech Articulation and Acoustics," in M. Johnson, M. Ostendorf, S. Khudanpur, and R. Rosenfeld (eds.)  IMA Volume 138:  Mathematical Foundations of Speech and Language Processing, Springer-Verlag, New York, 2003, pp. 115--134.
  •  C. Avendano, L. Deng, H. Hermansky, and B. Gold. "The Analysis and Representation of Speech," Chapter 2 in S. Greenberg, W. Ainsworth, A. Popper, and R. Fay (eds.)  Speech Processing in the Auditory System, Springer, New York, 2005.
  • L. Deng. "Articulatory Features and Associated Production Models in Statistical Speech Recognition," in K. Ponting (ed.) Computational Models of Speech Pattern Processing, (NATO ASI Series), Springer, 1999, pp. 214-224.
  • L. Deng. "Computational Models for Speech Production," Computational Models of Speech Pattern Processing, (NATO ASI Series), Springer, 1999, pp. 199-213. 
  • L. Deng. "Computational Models for Auditory Speech Processing," Computational Models of Speech Pattern Processing,  (NATO ASI Series), Springer, 1999, pp. 67-77. 
  • L. Deng. "A dynamic, feature-based approach to speech modeling and recognition," in S. Furui, F. Juang and W. Chou (eds.) Automatic Speech Recognition and Understanding, NJ., IEEE (Catalog No. 97TH8241), 1997, pp. 107-114.
  • D. Sun and L. Deng. "Nonstationary-State Hidden Markov Models for Speech Recognition," in S. E. Levinson and L. Shepp, (eds.) Image and Speech Models --- Volume 80 in IMA Volumes in Mathematics and its Applications, Springer-Verlag, New York, 1995, pp. 161--182.
  • D. Zhang, L. Deng, and M. Elmasry. "Pipelined Neural Network Architecture For Speech Recognition," Chapter 9 in M.I. Elmasry, (ed.) VLSI Artificial Neural Networks Engineering, Kluwer Academic Publishers, 1994, pp. 297-315.
  • K. Hassanein, L. Deng, and M. Elmasry. "Neural Predictive Hidden Markov Model Architecture For Speech And Speaker Recognition," Chapter 10 in M.I. Elmasry, (ed.) VLSI Artificial Neural Networks Engineering, Kluwer Academic Publishers, 1994, pp. 316-336.
  • L. Deng, K. Hassanein, and M. Elmasry. "Neural-Network Architecture For Linear And Nonlinear Predictive Hidden Markov Models: Application To Speech Recognition," in B. H. Juang, S. Y. Kung, and C. A. Kamm, (eds.) Neural Networks for Signal Processing , Princeton, NJ, IEEE (Catalog No. 91TH0385), 1991, pp. 411--421.
  • L. Deng. "Interfacing Displacement Sensors --- Linear Variable Differential Transformers," Chapter 9 in W. Tompkins and J. Webster, (eds.) Interfacing Sensors to the IBM PC, Prentice-Hall Inc., Englewood Cliffs, New Jersey, 1988, pp. 250-301.

Selected Publications (prior to 1999 joining MS)
  • M. Naito, L. Deng, and Y. Sagisaka. "Speaker clustering for speech recognition using the parameters characterizing vocal tract dimensions," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Seattle, WA, May 11-15, 1998.
  • M. Naito, L. Deng, and Y. Sagisaka. "Speaker adaptation methods using vocal tract parameters," (in Japanese) Proceedings of the 1998 Spring Meeting of the Acoustical Society of Japan, Yokohama, Japan, March 17-19, 1998, pp. 55-56.
  • M. Naito, L. Deng, and Y. Sagisaka. "A study on speaker clustering methods using vocal tract parameters," (in Japanese) Proceedings of Japan Institute of Electronics, Information, and Communication Engineers (IEICE), Yokosuka, Japan, December 1997, Vol. 97, No. 441, pp. 35-40.
  • L. Deng (invited). "A dynamic, feature-based approach to speech modeling and recognition," Proceedings of the 1997 IEEE Workshop on Automatic Speech Recognition and Understanding, Santa Barbara, CA, December 14-17, 1997, pp. 107-114.
  • C. Rathinavelu and L. Deng. "Speech adaptation experiments using nonstationary-state HMMs: A MAP approach," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Munich, Germany, 1997, Vol. 2, pp. 1415-1418.
  • L. Deng. "Integrated-multilingual speech recognition using universal phonological features in a functional speech production model," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Munich, Germany, 1997, Vol. 2, pp. 1007-1010.
  • C. Rathinavelu and L. Deng. "On the use of discriminatively derived feature space transformation in speech recognition," Proceedings of the International Conference on Signal Processing Applications and Technology, Boston, MA, October 7-10, 1996, pp. 1769-1773.
  • C. Rathinavelu and L. Deng. "Trended HMM with discriminative training for phonetic classification," Proceedings of the International Conference on Spoken Language Processing, Philadelphia, PA, October 3-6, 1996, pp. 1049-1052.
  • X. Shen, L. Deng, and A. Yasmin. "H_inf filtering for speech enhancement," Proceedings of the International Conference on Spoken Language Processing, Philadelphia, PA, October 3-6, 1996, pp. 873-876.
  • L. Deng, X. Shen, and D. Jamieson. "Simulation of disordered speech using a frequency-domain vocal tract model," Proceedings of the International Conference on Spoken Language Processing, Philadelphia, PA, October 3-6, 1996, pp. 768-771.
  • D. Jamieson, L. Deng, M. Price, V. Parsa, and J. Till. "Interactions of speech disorders with speech coders: Effects on speech intelligibility," Proceedings of the International Conference on Spoken Language Processing, Philadelphia, PA, October 3-6, 1996, pp. 737-740.
  • G. Ramsay and L. Deng. "Optimal filtering and smoothing for speech recognition using a stochastic target model," Proceedings of the International Conference on Spoken Language Processing, Philadelphia, PA, October 3-6, 1996, pp. 1113-1116.
  • L. Deng and J. Wu. "Hierarchical partitioning of articulatory state space for articulatory-feature based speech recognition," Proceedings of the International Conference on Spoken Language Processing, Philadelphia, PA, October 3-6, 1996, pp. 2266-2269.
  • J. Wu and L. Deng. "Acoustic Modeling for Continuous Mandarin-Chinese Speech Recognition," Proceedings of the International Conference on Spoken Language Processing, Philadelphia, PA, October 3-6, 1996, pp. 2281-2284.
  • L. Deng, G. Ramsay, and D. Sun. (invited). "Production models as a structural basis for automatic speech recognition," Proceedings of the Fourth European Speech Production Workshop, Autrans, France, May 24-27, 1996, pp. 69--80.
  • L. Deng. "Finite-state automata derived from overlapping articulatory features: A novel phonological construct for speech recognition," Proceedings of the Workshop on Computational Phonology in Speech Technology, (published by Association for Computational Linguistics), Santa Cruz, CA, June 28, 1996. pp. 37-45.
  • L. Deng and H. Sheikhzadeh. "Temporal and rate aspects of speech encoding in the auditory system: Simulation results on TIMIT data using a layered neural network interfaced with a cochlear model," Proceedings of European Speech Communication Association Tutorial and Research Workshop on the Auditory Basis of Speech Recognition, July 15 - 19, 1996, Keele University, United Kingdom, pp. 75-78.
  • C. Rathinavelu and L. Deng. "HMM-based speech recognition using state-dependent, discriminatively derived transforms on Mel-warped DFT features", Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol.1, Atlanta, Georgia, May 7-10, 1996, pp. 9--12.
  • L. Deng, G. Ramsay, and H. Sameti. "From modeling surface phenomena to modeling mechanisms: Towards a faithful model of the speech process aiming at speech recognition," Proceedings of the 1995 IEEE Workshop on Automatic Speech Recognition, December 10-13, 1995, Snowbird, Utah, pp. 183-184.
  • G. Ramsay and L. Deng. "Maximum-likelihood estimation for articulatory speech recognition using a stochastic target model," Proceedings of the 1995 European Conference on Speech Communication and Technology, Spain, September 18-21, 1995, pp. 1401-1404.
  • G. Ramsay and L. Deng. "Modal analysis of acoustic wave propagation in the vocal tract using a finite-difference method," Proceedings of the XII International Congress of Phonetic Sciences, Stockholm, Sweden, August 13-19, 1995, Vol 2, pp. 338-341.
  • G. Ramsay and L. Deng. "Articulatory synthesis using a stochastic target model of speech production," Proceedings of the XII International Congress of Phonetic Sciences, Stockholm, Sweden, August 13-19, 1995, Vol 2, pp. 478-481.
  • L. Deng, J. Wu, and H. Sameti. "Improved speech modeling and recognition using multi-dimensional articulatory states as primitive speech units," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Detroit, MI, May 8-12, 1995, pp. 385-388.
  • D. Sun and L. Deng. "Analysis of acoustic-phonetic variations in fluent speech using TIMIT," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Detroit, MI, May 8-12, 1995, pp. 201-204.
  • C. Rathinavelu and L. Deng. "Use of generalized dynamic feature parameters for speech recognition: Maximum likelihood and minimum classification error approaches," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Detroit, MI, May 8-12, 1995, pp. 373-376.
  • S. Shen and L. Deng. "Discrete H_inf filtering design with application to speech enhance ment," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Detroit, MI, May 8-12, 1995, pp. 1504-1507.
  • H. Sheikhzadeh, R. Brennan, L. Deng, and H. Sameti, "Real-time implementation of HMM-based MMSE algorithm for speech enhancement in hearing aid applications," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 1995 ,
  • D. Sun and L. Deng. "Nonstationary-state hidden Markov model with state-dependent time warping: Application to speech recognition," Proceedings of the 1994 International Conference on Spoken Language Processing, Vol. 1, Yokohama, Japan, September, 18-22, 1994. pp. 243--246,
  • L. Deng and H. Sameti. "Speech recognition using dynamically defined speech units," Proceedings of the 1994 International Conference on Spoken Language Processing, Vol. 4, pp. 2167--2170, Yokohama, Japan, September, 18-22, 1994.
  • H. Sheikhzadeh and L. Deng. "Interval statistics from a cochlear model in response to speech sounds," Journal of the Acoustical Society of America, Vol. 95, No. 6, June 1994 (Abstract), pp. 2842. (The 127th Meeting of the Acoustical Society of America, June 4-8, 1994, Cambridge, MA.)
  • L. Deng and I. Kheirallah. "Stability analysis on finite-difference solution of a basilar-membrane vibration model with application to acoustic signal processing," Journal of the Acoustical Society of America, Vol. 95, No. 6, June 1994 (Abstract), pp. 2840. (The 127th Meeting of the Acoustical Society of America, June 4-8, 1994, Cambridge, MA.)
  • L. Deng and H. Sameti. "Articulatory phonology and speech recognition: A study on use of dynamically defined speech primitives," Journal of the Acoustical Society of America, Vol. 95, No. 6, June 1994 (Abstract), pp. 2870. (The 127th Meeting of the Acoustical Society of America, June 4-8, 1994, Cambridge, MA.)
  • G. Ramsay and L. Deng. "A stochastic framework for articulatory speech recognition," Journal of the Acoustical Society of America, Vol. 95, No. 6, June 1994 (Abstract), pp. 2871. (The 127th Meeting of the Acoustical Society of America, June 4-8, 1994, Cambridge, MA.)
  • K. Hassanein, L. Deng and M. Elmasry. "A neural predictive hidden Markov model for speaker recognition," Proceedings of the Workshop on Automatic Speaker Recognition, Identification and Verification, Martigny, Switzerland, April, 1994, pp. 115-118.
  • L. Deng and M. Aksmanovic. "HMMs with mixtures of trended functions for automatic speech recognition," IEEE International Conference on Speech, Image Processing and Neural Networks, April 13-15, 1994, HongKong, pp. 702-705.
  • L. Deng. "A theory on optimal construction of dynamic features for hidden Markov modeling of speech," IEEE International Conference on Speech, Image Processing and Neural Networks, April 13-15, 1994, HongKong, pp. 351-354.
  • L. Deng and D. Sun. "Phonetic classification and recognition using HMM representation of overlapping articulatory features for all classes of English sounds," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Adelaide, Australia, April 19-22, 1994, Vol. 1, pp. 45-48.
  • K. Hassanein, L. Deng and M. Elmasry. "Vowel classification using a neural predictive HMM: A discriminative training approach," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Adelaide, Australia, April 19-22, 1994, Vol 2, pp. 665-668.
  • H. Sameti, H. Sheikhzadeh, L. Deng and R. Brennan. "Comparative performance of spectral subtraction and HMM-based speech enhancement strategies with application to hearing aid design." Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Adelaide, Australia, April 19-22, 1994, Vol. 1, pp. 13-16.
  • L. Deng. "A computational model of phonology-phonetics integration for automatic speech recognition," Proceedings of the 1993 IEEE Workshop on Automatic Speech Recognition, December 12-15, 1993, Snowbird, Utah, pp. 83--84.
  • K. Hassanein, L. Deng and M. Elmasry. "A neural predictive hidden Markov model for speech and speaker recognition," Proceedings of the Fifth International Conference on Microelectronics December 14-16, 1993, Dhahran, Saudi Arabia, pp. 108-111.
  • L. Deng and D. Sun. "Speech recognition using the atomic speech units constructed from overlapping articulatory features," Proceedings of the 1993 European Conference on Speech Communication and Technology, September 21-23, 1993, Berlin, Germany, Vol. III, pp. 1635--1638.
  • D. Zhang, L. Deng, and M. Elmasry. "Pipelined neural network architecture for speech recognition," Proceedings of the 1993 World Congress on Neural Networks, July 11-15, 1993, Portland, Oregon, Vol. III, pp. 55-58.
  • L. Deng. "Design of a feature-based speech recognizer aiming at integration of auditory processing, signal modeling, and phonological structure of speech." (invited) Journal of the Acoustical Society of America, Vol. 93, No.4, Pt. 2, pp. 2318, April, 1993.
  • K. Hassanein, L. Deng, and M. Elmasry. "Maximal mutual information training of a neural predictive HMM speech recognition system," Proceedings of the 1992 IEEE Workshop on Neural Networks for Signal Processing, August 31--September 2, 1992, Copenhagen, Denmark, pp. 164-173.
  • K. Erler and L. Deng. "HMM representation of quantized articulatory features for recognition of highly confusible words," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, San Francisco, CA., March, 1992, pp.545-548.
  • L. Deng. "Speech modeling and recognition using a time series model containing trend functions with Markov modulated parameters," Proceedings of the 1991 IEEE Workshop on Automatic Speech Recognition, Arden House, New York, December, 1991, pp. 24-26.
  • L. Deng and K. Erler. "Microstructural speech units and their HMM representation for discrete utterance speech recognition," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Toronto, Ontario, Canada, May, 1991, pp. 193--196. P. Seitz, V. Gupta, M. Lennig, P. Kenny, L. Deng, D. O'Shaughnessy, and P. Mermelstein. "Phonological rule set complexity as a factor in the performance of a very large vocabulary word recognition system," Journal of the Acoustical Society of America, 87(1), May, 1990, S108 (Abstract).
  • L. Deng, V. Gupta, M. Lennig, P. Kenny, and P. Mermelstein. "Acoustic recognition component of an 86,000-word speech recognizer," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Albuquerque, New Mexico, 1990, pp. 741--744.
  • L. Deng, P. Kenny, M. Lennig, V. Gupta and P. Mermelstein. "A locus model of coarticulation in a hidden-Markov-model-based speech recognizer," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Glascow, Scotland, 1989, pp. 97-100.
  • L. Deng, P. Kenny, M. Lennig, V. Gupta and P. Mermelstein. "Large vocabulary word recognition based on phonetic representation by hidden Markov models", Proceedings of the Canadian Conference on Electrical and Computer Engineering, Vancouver, Canada, November 1988, pp. 131-134.
  • L. Deng, M. Lennig, and P. Mermelstein. "Modeling acoustic-phonetic detail in a hidden-Markov-model-based large vocabulary speech recognizer," Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, New York, New York, Vol. 1, April 1988, pp. 509--512.
  

Patents (awarded)

  • Removing noise from feature vectors, U.S. Patent No.: 7,310,599; Granted on December 18, 2007;
  • Method of determining uncertainty associated with acoustic distortion-based noise reduction, U.S. Patent No. 7,289,955; Granted on October 30, 2007
  • Method and apparatus for identifying noise environments from noisy signals, U.S. Patent No. 7,266,494; Granted on September 4, 2007
  • Method of noisy reduction using correction and scaling vectors with partitioning of the acoustic space in the domain of noisy speech, U.S. Patent No.7,254,536; Granted on August 7, 2007
  • Method of determining uncertainty in noise reduction, US and International Patents; U.S. Patent No.: 7,174,292; Granted on Feb. 6, 2007
  • Method of Noise Estimation Using Incremental Bayes Learning, US. Patent; Patent No.: 7,165,026; Granted on Jan. 16, 2007
  • Method of iterative noise estimation in a recursive framework, U.S. Patent; Patent No. 7,139,703; Granted on Nov. 21, 2006.
  • Method of noise reduction using correction vectors based on dynamic aspects of speech and noise normalization, United States Patent No. 7,117,148; Granted on October 3, 2006.
  • Method of noise reduction based on dynamic aspects of speech, United States Patent No. 7,107,210; Granted on Sept 12, 2006.   
  • Method of pattern recognition using noise reduction uncertainty, United States Patent No. 7,103,540; Granted on Sept 5, 2006.
  • Microphone array signal enhancement using mixture models (jointly with Hagai Attias),  United States Patent No. 7,103,541; Granted on Sept 5, 2006.
  • Efficient backward recursion for computing posterior probabilities, United States Patent No. 7,062,407; Granted on June 13, 2006.
  • Method of speech recognition using time-dependent interpolation and hidden dynamics, United States  (and International) Patent No. 7,050,975; Granted on May 23, 2006.
  • Nonlinear observation models for removing noise from corrupted speech, United States  (and International) Patent No. 7,047,047; Granted on May 16, 2006.
  • Method of Noise Reduction Using Correction and Scaling Vectors with Partitioning of the Acoustic Space in the Domain of Noisy Speech, United States Patent No. 7,003,455; Granted on February 21, 2006
  • Methods and Apparatus for Denoising and Dereverberation Using Variational Inference and Strong Speech Models, United States Patent No. 6,990,447; Granted on January 24, 2006
  • Method and Apparatus for Removing Noise from Feature Vectors, United States Patent No. 6,985,858; Granted on January 10, 2006
  • Methods for Including the Category of Environmental Noise When Processing Speech Signals, United States Patent No. 6,959,276; Granted on October 25, 2005
  • Method of iterative noise estimation in a recursive framework, United States Patent;  Patent No. 6,944,590; Granted on September 13, 2005
  • Method of speech recognition using variational inference with switching state space models, United States Patent;  Patent No. 6,931,374; Granted on August 16, 2005
  • Pattern Recognition Training Method and Apparatus Using Inserted Noise Followed by Noise Reduction, United States (and International) Patent;  Patent No. 6,876,966; Granted on April 5, 2005
  • Apparatus for Speaker Clustering and for Speech Recognition, Patent No.: 2,965,537; Granted on Aug. 13, 1999; Countries of issue: United States and Japan.
  • Apparatus for Speaker Normalization Processor and for Voice Recognition Device, Patent No.: 2986792; Granted on Oct. 1, 1999; Countries of issue: United States and Japan.
  • 30 patent applications pending

Downloads

  • IPAM05-MSR-VTR-Formants (This database was created by the joint work of MSR and UCLA (IPAM). See our ICASSP2006 paper (contained in the download) for details. Note that this is a 20MB download. We suggest that you save it in your disks before installing it. Note also that this is a database, although it appears as a program when you are running and "installing" it.)

E-mail: deng@NO_SPAM.microsoft.com
U.S.Mail: Microsoft Corporation, One Microsoft Way, Redmond WA, 98052-6399, USA
Tel: (425) 706-2719
Fax: (425) 706-7329 (This is the main MS FAX number so make sure to send documents to Li Deng's attention)


©2008 Microsoft Corporation. All rights reserved. Terms of Use |Trademarks |Privacy Statement