|
|
Li Deng
Principal Researcher
Speech Technology Group
Education
-
B.S.: Biophysics, University of Science and Technology of China
(USTC).
-
Master: Electrical Engineering, University of Wisconsin-Madison,
U.S.A.
-
Ph.D.: Electrical Engineering, University of
Wisconsin-Madison, U.S.A.
Brief Biography
Li Deng received the Bachelor degree from
the University of Science and Technology of China (with the Guo Mo-Ruo Award), and received
the Ph.D. degree from the University of Wisconsin-Madison (with the Jerzy E. Rose Award). In 1989, he joined Dept. Electrical and Computer
Engineering, University of Waterloo, Ontario, Canada as an Assistant
Professor, where he became a Full Professor in 1996. From 1992 to 1993, he
conducted sabbatical research at Laboratory for Computer Science,
Massachusetts Institute of Technology, Cambridge, Mass, and from
1997-1998, at ATR Interpreting Telecommunications Research Laboratories,
Kyoto, Japan. In 1999, he joined Microsoft Research, Redmond, WA as
a Senior Researcher, where he is currently a Principal Researcher. He is
also an Affiliate Professor in the Department of Electrical Engineering at
University of Washington, Seattle. His past and current research
activities include automatic speech and speaker recognition, statistical
methods and machine learning, neural information processing, machine
intelligence, audio and acoustic signal processing, statistical signal
processing and digital communication, human speech production and
perception, acoustic phonetics, auditory speech processing, auditory
physiology and modeling, noise robust speech processing, speech
synthesis and enhancement, spoken language understanding systems,
multimedia signal processing, and multimodal human-computer interaction.
In these areas, he has published over 250 refereed papers in leading
international conferences and journals, 12 book chapters, and has given
keynotes, tutorials, and lectures worldwide. He has been granted over a
dozen US or international patents in acoustics, speech/language
technology, and signal processing. He authored two books in speech
processing. He serves on the
Board of Governors
of the IEEE Signal Processing Society, and as Editor-in-Chief
(elect) for the IEEE Signal Processing Magazine.
He is a Fellow of the Acoustical Society of America, and
a Fellow of the IEEE.
Professional Activities
- Area Editor, IEEE Signal Processing Magazine (2006-present)
- General
Chair, IEEE Workshop on Multimedia Signal Processing (2006)
- Member, Multimedia Signal Processing Technical Committee of the
IEEE Signal Processing Society (2004-present)
- Member, Editorial Board, IEEE Signal Processing Letters (2007-)
- Member, Editorial Board, IEEE Signal Processing Magazine
(2005-2007)
- Member, Editorial Board, J.
Audio, Music, and Speech Processing (2005-present)
- Founding Member, Education Committee, IEEE Signal Processing Society (1997-2000)
- Member, Speech Processing Technical Committee, IEEE Signal
Processing Society (1996-1999)
- Associate Editor, IEEE Transactions on Speech and Audio
Processing (2002-2005)
- Principal Investigator, DARPA
(US DoD) EARS Program, (2002-2005)
- Technical Chair, IEEE International Conference on Acoustics, Speech,
and Signal Processing (ICASSP) (2004)
- Co-Guest Editor,
IEEE Signal Processing Magazine, Special Issue on Speech Technology
and Systems in Human-Machine Communication (Sept 2005)
- Co-Guest Editor, IEEE Trans. on Computers, Special Issue on
Emergent Systems, Algorithms and Architectures for Speech-based
Human-Machine Interaction (2006)
- Member, IEEE Signal Processing Society Technical Directions
Committee (2003-2005)
- Member, IEEE International Conference on Multimedia and Expo
Steering Committee (2004-2006)
- Keynote speaker, IEEE 5th Workshop on Multimedia Signal
Processing (IEEE Signal Processing Society), St. Thomas, US Virgin
Islands (December 2002)
- Organizer and speaker, AAAS
(American Association for Advancement of Science) Symposium on
"Scientific Problems Facing Speech Recognition Today", 2004
- Invited Lecturer, NATO
Advanced Study Institute
- Invited Lecturer, European
Speech Communication (ESCA) Tutorial and Research Workshops
- Fellow, The Acoustical Society of America (The American Institute of
Physics) (elected Dec. 2003)
- Fellow, The IEEE (elected Dec. 2004)
- IEEE Signal Processing Society TC
Review Committee (Member, term 2008-2009)
- Board of Governors
of
IEEE
Signal Processing Society
(Member, elected September 2007; term 2008-2010)
- Editor-In-Chief Elect, IEEE Signal
Processing Magazine
Books
Table of
Contents:
http://www.amazon.com/gp/reader/0824740408/ref=sib_dp_bod_toc/002-8541730-1403255?ie=UTF8&p=S00L#
- Li Deng: DYNAMIC SPEECH MODELS --- Theory,
Algorithms, and Applications,
Morgan & Claypool Publishers, May 2006,
(http://www.amazon.com/gp/product/1598290649)
- Xiaodong He and Li Deng: Discriminative
Learning for Speech Recognition --- Theory and Practice,
Morgan & Claypool Publishers, 2008.
Publications in Refereed Journals
- Sibel Yaman, Li Deng, Dong Yu, Yeyi Wang, Alex Acero.
"A discriminative technique for spoken utterance classification,'' IEEE
Trans. Audio, Speech, and Language Processing, 2008
-
Dong Yu, Li Deng, J. Droppo, Jian Wu, Yifan Gong, and Alex
Acero. "Robust speech recognition using cepstral minimum-mean-square-error
noise suppressor," IEEE Trans. Audio, Speech, and Language Processing, 2008.
-
Xiaodong He, Li Deng, Wu Chou. "Discriminative Learning in
Sequential Pattern Recognition --- A Unifying Review for
Optimization-Oriented Speech Recognition", IEEE Signal Processing Magazine,
2008.
- Xiaodong He and Li Deng. "Discriminative
Learning in Speech Recognition," Technical Report of Microsoft Research
(MSR-TR-2007-129). pp. 1-47, Oct 2007. (http://research.microsoft.com/research/pubs/view.aspx?type=Technical%20Report&id=1372)
- Xiaodong He and Li Deng (invited). "A new look at discriminative
learning for hidden Markov models," Pattern Recognition Letters, Vol. 28,
2007, pp.1285-1294.
- L. Deng, H. Attias, L. Lee,
and A. Acero. "Adaptive Kalman smoothing for tracking vocal tract resonances
using a continuous-valued hidden dynamic model", IEEE Transactions on
audio, Speech and Language Processing, Vol. 15, No. 1, January 2007, pp.
13-23.
- L. Deng. "Editorial: Expanding the Scope pf Signal Processing,"
IEEE Signal Processing Magazine, Vol. 25, No. 3, May 2008, pp. 2-4.
- L. Deng. "Editorial: Write
feature articles with a lasting impact," IEEE Signal Processing Magazine,
Vol. 24, No. 2, March 2007.
- Rodrigo Guido, Li Deng, and Shoji Makino.
"Introduction: Special Section on Emergent Systems, Algorithms, and
Architectures for Speech-Based Human-Machine Interaction," IEEE Transactions
on Computers, Vol. 56, No. 9, September 2007, pp. 1-3.
- D. Yu, L. Deng, and A. Acero.
"A lattice search technique for
long-contextual-span hidden trajectory model of speech," Speech
Communication, Vol. 48, 2006, pp. 1214-1226.
- L. Deng, D. Yu, and A. Acero.
"Structured speech modeling," IEEE Transactions on Audio, Speech and
Language Processing (Special Issue on Rich Transcription), Vol. 14, No. 5,
Sept 2006, pp. 1492-1504.
-
D. Yu, L. Deng, and A. Acero. "Speaker-adaptive learning of resonance
targets in a hidden trajectory model of speech coarticulation," Computer
Speech and Language, Vol. 27, 2007, pp. 72-87.
- R. Togneri and L. Deng. "A
state-space model with neural-network prediction for recovering vocal tract
resonances in fluent speech from Mel-cepstral coefficients," Speech Communication, Vol. 48, 2006, pp. 971-988.
- L. Deng, K. Wang, and W. Chou. "Speech Technology and Systems in Human-Machine Communication --- Guest editors' editorial,"
IEEE Signal Processing Magazine, Vol. 22, No. 5, Sept 2005, pp. 12-14.
- Y. Wang, L. Deng, and A. Acero. "An introduction to the statistical framework of spoken language understanding,"
IEEE Signal Processing Magazine, Vol. 22, No. 5, Sept. 2005, pp. 16-31.
- L. Deng and D. Yu (invited) "A speech-centric perspective for
human-computer interface --- A case study," Journal of VLSI Signal
Processing Systems (Special Issue on Multimedia Signal Processing),
Vol. 41, 2005, pp. 255-269.
- L. Deng, D. Yu, and A. Acero. "A bi-directional target-filtering model
of speech coarticulation and reduction: Two-stage implementation for
phonetic recognition," IEEE Transactions on Speech and Audio Processing,
Vol. 14, No. 1, January 2006, pp. 256-265.
- L. Deng, J. Wu, J. Droppo, and A. Acero. "Analysis and
comparison of two feature extraction/compensation algorithms," IEEE Signal
Processing Letters, Vol. 12, No. 6, June, 2005, pp. 477-480.
- L. Deng, A. Acero, and I. Bazzi. "Tracking vocal tract resonances using a
quantized nonlinear function embedded in a temporal constraint," IEEE
Transactions on Speech and Audio Processing, Vol. 14, No. 2, March 2006, pp.
425-434.
- L. Deng and X.D. Huang. "Forum: Author
Response to 'For Voice Interfaces, Hold the SALT'," Communications of the
ACM, Vol. 47, No. 7, July 2004, pp. 11-13.
- L. Deng and X.D. Huang. "Challenges in adopting speech recognition,"
Communications of the ACM, Vol. 47, No. 1, January 2004, pp. 69-75.
- L. Deng, J. Droppo, and A. Acero. "Dynamic compensation of HMM
variances using the feature enhancement uncertainty computed from a
parametric model of speech distortion," IEEE Transactions on Speech and
Audio Processing, Vol. 13, No. 3, May 2005, pp. 412-421.
- L. Deng, J. Droppo, and A. Acero.
"Recursive estimation of nonstationary noise using iterative stochastic
approximation for robust speech recognition," IEEE Transactions on Speech
and Audio Processing," Vol.11, No.6, Nov. 2003, pp. 568-580.
- L. Deng, J.
Droppo, and A. Acero. "Estimating cepstrum of speech under the presence of
noise using a joint prior of static and dynamic features," IEEE
Transactions on Speech and Audio Processing, Vol. 12, No. 3, May 2004, pp.
218-233.
- L. Deng, J. Droppo, and A. Acero. "Enhancement of log-spectra of
speech using a phase-sensitive model of the acoustic environment," IEEE
Transactions on Speech and Audio Processing, Vol. 12, No. 3, March 2004, pp.
133-143.
- R. Togneri and L. Deng. "Joint state and parameter estimation
for a target-directed nonlinear dynamic system model," IEEE Transactions on
Signal Processing, Vol. 51, No. 12, December 2003, pp. 3061-3070.
-
Z. Ma and L. Deng. "A mixed-level switching dynamic system for continuous
speech recognition," Computer Speech and Language. Vol. 18, 2004, pp.
49-65.
- L. Deng, Y. Wang, K. Wang, A. Acero, H. Hon, J. Droppo, C. Boulis,
D. Jacoby, M. Mahajan, C. Chelba, and X.D.Huang (invited). "Speech and
language processing for multimodal human-computer interaction," Journal of
VLSI Signal Processing Systems (Special issue on Real-World Speech
Processing), Vol. 36, No. 2, February 2004, pp. 161-187.
- Jack Xin, Y. Y. Qi, and L. Deng. "Time domain computation of a
nonlinear, nonlocal cochlear model with applications to multitone
interactions in hearing," Communications in Mathematical Sciences, Vol.1,
No.2, 2003, pp. 211-227.
-
J. Ma and L. Deng. "Target-directed mixture linear
dynamic models for spontaneous speech recognition,," IEEE Transactions
on Speech and Audio Processing, Vol. 12, No. 1, 2004, pp. 47-58.
- J. Ma and L. Deng.
"Efficient decoding strategies for conversational speech recognition using a
constrained nonlinear state-space model for vocal-tract-resonance dynamics,"
IEEE Transactions on Speech and Audio Processing, Vol.11, No.6, Nov.
2003, pp. 590-602.
- L. Deng, K. Wang, A. Acero, H. Hon, J. Droppo, C. Boulis, Y.
Wang, D. Jacoby, M. Mahajan, C. Chelba, and X.D.Huang. "Distributed speech
processing in MiPad's multimodal user interface" IEEE Transactions on Speech
and Audio Processing, Vol. 10, No. 8, November 2002, pp. 605-619.
- M. Naito, L. Deng, and Y. Sagisaka. "Speaker clustering for speech
recognition using vocal-tract parameters," Speech Communication, Vol. 36,
No. 3-4, March 2002, pp. 305-315.
- H. Sameti and L. Deng.
"Nonstationary-state hidden Markov model representation of speech signals
for speech enhancement", Signal Processing, Vol. 82, 2002, pp.
205-227.
- J. Sun and L. Deng. "An overlapping-feature based phonological model
incorporating linguistic constraints: Applications to speech recognition,"
Journal of the Acoustical Society of America, Vol. 111, No. 2, pp. 1086-1101,
2002.
- H. Jiang and L. Deng. "A robust compensation strategy against extraneous
acoustic variations in spontaneous speech recognition," IEEE
Transactions on Speech and Audio Processing, Vol 10, No. 1, January 2002,
pp.9-17.
- C. Rathinavelu and L. Deng. "A maximum a posteriori
approach to speaker adaptation using the trended hidden Markov
model," IEEE Transactions on
Speech and Audio Processing, Vol.9, No.5, July 2001, pp. 549-557.
- H. Jiang and L. Deng. "A Bayesian approach to speaker verification," IEEE
Transactions on Speech and Audio Processing , Vol. 9, No. 8, November 2001,
pp.874-884.
- R. Togneri, J. Ma, and L. Deng. "Parameter estimation of a
target-directed dynamic system model with switching states," Signal
Processing, Vol.81, No.5, 2001, pp. 975-987.
- L. Deng and Z. Ma. "Spontaneous speech recognition using a statistical
coarticulatory model for the hidden vocal-tract-resonance dynamics," J.
Acoust. Soc. Am, Vol.108, No. 6, Dec 2000, pp.3036-3048.
- M. Naito, L. Deng, and Y. Sagisaka. "Speaker normalization for speech
recognition using model-based vocal-tract parameters," Transactions of Japan
Institute of Electronics, Information, and Communication Engineers (IEICE),
Vol.J83-D-II No.11, November 2000, pp. 2360-2369.
- J. Ma and L. Deng. "A path-stack algorithm for optimizing
dynamic regimes in a statistical hidden dynamic model of
speech," Computer Speech and Language , Vol. 14, 2000. pp 101-104
- Jiping Sun and L. Deng. "Use of high-level linguistic
constraints for constructing feature-based phonological
model in speech recognition," Journal
of Intelligent Information Processing Systems , 1999, p. 269-276.
- L. Deng (invited). "Locus equation and hidden parameters of
speech," Journal of Behavioral and Brain Sciences, Vol. 21, Issue 2.
April 1998. pp. 263-264.
- X. Shen and L. Deng. "A dynamic system approach to speech
enhancement using H-infinity filtering algorithm," IEEE Transactions on Speech and Audio
Processing , Vol. 7, 1998, p. 391-399.
- H. Sheikhzadeh and L. Deng. "A layered neural network
interfaced with a cochlear model for the study of speech
encoding in the auditory system," Computer
Speech and Language, Vol. 13, 1999, p. 39-64.
- H. Sameti, H. Sheikhzadeh, L. Deng and R. Brennan.
"HMM-based strategies for enhancement of speech embedded in
nonstationary noise," IEEE
Transactions on Speech and Audio Processing, Vol.6, No.5, September
1998, p. 445-455.
- L. Deng. "A dynamic, feature-based approach to the interface
between phonology and phonetics for speech modeling and
recognition," Speech Communication.
Vol. 24, No. 4, pp. 299-323, 1998.
- C. Rathinavalu and L. Deng.
"Speech trajectory discrimination using the minimum classification error
learning," IEEE Transactions on Speech
and Audio Processing, Vol.6, No.6, Nov. 1998, p. 505-515.
- L. Deng, G. Ramsay, and D. Sun. (invited) "Production
models as a structural basis for automatic speech
recognition," Speech Communication
(special issue on speech production modeling), Vol. 22, No. 2, August 1997, pp.
93-112.
- L. Deng. "Autosegmental representation of phonological units
of speech and its phonetic interface," Speech Communication, Vol. 23, No. 3, 1997,
pp. 211-222.
- X. Shen and L. Deng. "Game theory approach to H_inf filter
design," IEEE
Transactions on Signal Processing,
Vol. 45, No. 4, April 1997, pp. 1092-1095
- C. Rathinavalu and L. Deng. "HMM-based speech recognition using
state-dependent, discriminatively derived transforms on Mel-warped DFT
features", IEEE Transactions on Speech and Audio Processing, May,
1997, pp. 243-256.
- L. Deng and X. Shen. "Maximum likelihood in statistical
estimation of dynamical systems: Decomposition algorithm and
simulation results," Signal
Processing, Vol.57, No. 1, 1997, pp. 65-79.
- L. Deng and C. Rathinavalu. "Construction of
state-dependent dynamic parameters by maximum likelihood:
Applications to speech recognition," Signal
Processing, Vol. 55, No.2, 1997, pp. 149-165.
- H. Sameti, H. Sheikhzadeh, L. Deng and R. Brennan.
"HMM-based strategies for enhancement of speech embedded in
nonstationary noise," IEEE
Transactions on Speech and Audio Processing, September 1998.
- C. Rathinavalu and L. Deng. "Use of generalized
dynamic feature parameters for speech recognition," IEEE Transactions on Speech and Audio
Processing, May 1997, pp. 232-242.
- H. Sheikhzadeh and L. Deng. "Speech analysis and recognition using
interval statistics generated from a composite audit ory
model," IEEE
Transactions on Speech and Audio Processing, Vol. 6, No. 1, January
1998, pp. 50-54.
- L. Deng and M. Aksmanovic. "Speaker-independent
phonetic classification using hidden Markov models with
state-conditioned mixtures of trend functions," IEEE Transactions on Speech and Audio Processing, Vol. 5,
No. 4, July 1997, pp. 319-324.
- X. Shen, and L. Deng. "Decomposition solution of H-infinity
filter gain in singularly perturbed systems," Signal Processing,
Vol.55, No. 3, 1996, pp. 313-320.
- L. Deng and H. Sameti. "Transitional speech units and
their representation by the regressive Markov states:
Applications to speech recognition," IEEE Transactions on Speech and Audio Processing, Vol.4,
No.4, July 1996, pp. 301--306.
- L. Deng. "Transiems as dynamically-defined, sub-phonemic
units of speech: A computational model," Signal Processing, Vol. 49, No. 1, 1996,
pp. 25-35.
- G. Ramsay and L. Deng. "Tracking non-stationary
targets using a dynamical system with Markov-modulated
parameters, " IEEE Signal Processing
Letters, Vol. 2, No. 9, September, 1995, pp. 172-175.
- L. Deng and C. Rathinavalu. "A Markov model
containing state-conditioned second-order nonstationarity:
Application to speech recognition," Computer Speech and Language, Vol. 9, No. 1, January,
1995, pp. 63-86.
- L. Deng and D. Braam. "Context-dependent Markov model
structured by locus equations: Application to phonetic
classification," Journal of the
Acoustical Society of America, Vol. 96, No. 4, October, 1994, pp.
2008-2025.
- L. Deng, M. Aksmanovic, D. Sun, and C. F. J. Wu.
"Speech recognition using hidden Markov models with
polynomial regression functions as nonstationary states,"
IEEE Transactions on Speech and Audio Processing,
Vol. 2, No. 4, October, 1994, pp. 507-520.
- D. Sun, L. Deng, C. F. J. Wu. "State-dependent time
warping in the trended hidden Markov model," Signal Processing, Vol. 39, No. 1, 1994,
pp. 263-275.
- L. Deng. "Integrated optimization of dynamic feature
parameters for hidden Markov modeling of speech," IEEE Signal Processing Letters, Vol. 1,
No. 4, April, 1994, pp. 66-69.
- L. Deng and D. Sun. "A statistical approach to
automatic speech recognition using the atomic speech units
constructed from overlapping articulatory features,"
Journal of the Acoustical Society of America,
Vol. 95, No. 5, May 1994, pp. 2702-2719.
- L. Deng, K. Hassanein, and M. Elmasry.
"Analysis of correlation structure for a neural predictive
model with application to speech recognition," Neural Networks, Vol. 7, No. 2,
1994, pp. 331-339.
- D. Zhang, L. Deng, and M. Elmasry. "Pipelined
architectures for neural-network-based speech recognition,"
Neural, Parallel
& Scientific Computations, Vol. 2, No. 1, March, 1994, pp. 81--
92.
- L. Deng. "A statistical model for formant-transition microsegments
of speech incorporating locus equations," Signal Processing, Vol. 37, No. 1,
1994, pp. 121--128.
- H. Sheikhzadeh and L. Deng. "Waveform-based speech
recognition using hidden filter models: Parameter selection
and sensitivity to power normalization," IEEE Transactions on Speech and Audio
Processing, Vol. 2, No. 1, January, 1994, pp. 80--91.
- L. Deng and I. Kheirallah. "Numerical property and
efficient solution of a nonlinear transmission-line model
for basilar-membrane wave motions," Signal Processing, Vol. 33, No. 3,
1993, pp. 269--286.
- L. Deng. "A stochastic model of speech incorporating
hierarchical nonstationarity," IEEE Transactions on Speech and Audio Processing,
Vol. 1, No. 4, October 1993, pp. 471--475.
- L. Deng and Jon W. Mark. "Parameter estimation of Markov
modulated Poisson processes as a telecommunication traffic
model via the EM algorithm with time discretization,"
Telecommunication Systems, Vol.
1, No. 3, 1993, pp. 321-338.
- K. Erler and L. Deng. "Hidden Markov model
representation of quantized articulatory features for speech
recognition," Computer Speech and Language, Vol. 7, No. 3, 1993,
pp. 265-282.
- L. Deng and I. Kheirallah. "Dynamic formant tracking
of noisy speech using temporal analysis on outputs from a
nonlinear cochlear model," IEEE Transactions on Biomedical Engineering, Vol.
40, No. 5, 1993, pp. 456--467.
- L. Deng and K. Erler. "Structural design of a hidden
Markov model based speech recognizer using multi-valued
phonetic features: Comparison with segmental speech units,"
Journal of the Acoustical
Society of America, Vol.92, No.6, December, 1992, pp.3058-3067.
- L. Deng. "A generalized hidden Markov model with
state-conditioned trend functions of time for the speech
signal," Signal
Processing, Vol.27, No.1, April 1992, pp. 65-78.
- L. Deng, P. Kenny, M. Lennig, and P. Mermelstein.
"Modeling acoustic transitions in speech by
state-interpolation hidden Markov models," IEEE Transactions on Signal Processing, Vol.40, No.2,
February, 1992, pp. 265-272.
- L. Deng. "Processing of acoustic signals in a cochlear model
incorporating laterally coupled suppressive elements,"
Neural
Networks, Vol.5, No.1, January 1992, pp.19-34.
- L. Deng. "Hierarchical non-stationarity in a class of
doubly stochastic time series models with application to speech recognition,"
(invited paper). Canadian Acoustics, Vol. 19, No. 4, September,
1991, pp. 113--115.
- L. Deng, P. Kenny, M. Lennig, V. Gupta, F. Seitz and
P. Mermelstein. "Phonemic hidden Markov models with
continuous mixture output densities for large vocabulary
word recognition," IEEE Transactions on Signal
Processing, Vol. 39, No. 7, July, 1991, pp. 1677--1681.
- L. Deng. "Non-parametric estimation of phase variance in
auditory-nerve fiber' s responses to tonal stimuli,"
Journal of the Acoustical Society of America,
Vol.90, No.6, December 1991, pp. 3099--3106.
- L. Deng. "The semi-relaxed algorithm for parameter
estimation of hidden Markov models," Computer Speech and Language,
Vol. 5, No.3, August, 1991, pp. 231--236.
- L. Deng, M. Lennig, F. Seitz and P. Mermelstein.
"Large vocabulary word recognition using context-dependent
allophonic hidden Markov models," Computer Speech and Language, Vol.4, No.4,
December, 1990, pp. 345-357.
- P. Seitz, V. Gupta, M. Lennig, P. Kenny, L. Deng, D. O'Shaughnessy, and P.
Mermelstein. "A dictionary for a very large vocabulary word
recognition system," Computer Speech and Language, Vol. 4, No.2, 1990, pp.
193-202.
- L. Deng, M. Lennig, and P. Mermelstein. "Modeling microsegments
of stop consonants in a hidden Markov model based word
recognizer," Journal of the Acoustical Society of America, Vol.
87, June, 1990, pp. 2738-2747.
- L. Deng, M. Lennig, and P. Mermelstein. "Use of vowel
duration information in a large vocabulary word recognizer,"
Journal of
the Acoustical Society of America, Vol. 86, August, 1989, pp.
540-548.
- L. Deng, C.D. Geisler, and S. Greenberg. "A
composite model of the auditory periphery for the processing of speech,"
(invited paper). Journal of Phonetics, special theme issue on Representation of
Speech in the Auditory Periphery, Vol. 16, No. 1, January, 1988,
pp. 93-108.
- L. Deng and C.D. Geisler. "Responses of auditory-nerve
fibers to nasal consonant-vowel syllables," Journal of the
Acoustical Society of America, Vol. 82, No. 6, December 1987, pp.
1977--1988.
- L. Deng, C.D. Geisler, and S. Greenberg. "Responses of
auditory-nerve fibers to multiple-tone complexes," Journal of the
Acoustical Society of America, Vol. 82, No. 6, December 1987, pp.
1989--2000.
- L. Deng and C.D. Geisler. "A composite auditory model for
processing speech sounds," Journal of the Acoustical Society of
America, Vol. 82, No. 6, December 1987, pp. 2001--2012.
- S.R. Greenberg, C.D. Geisler, and L. Deng. "Frequency
selectivity of single cochlear-nerve fibers based on the
temporal response pattern of two-tone signals," Journal of the Acoustical Society of America, Vol. 79,
No. 4, April 1986, pp. 10 10--1019.
- L. Deng and C.D. Geisler. "Changes in the phase of excitor-tone
responses in auditory-nerve fibers by suppressor tones,"
Journal of the Acoustical Society
of America, Vol. 78, No. 11, November 1985, p p. 1633--1644.
- C.D. Geisler and L. Deng. "Thresholds for primary auditory
fibers using statistically defined criteria," Journal of the Acoustical Society of America,
Vol. 77, No. 3, March 1985, pp. 1102--1109.
Recent Refereed Conference Publications:
- Jinyu Li, Li Deng, Dong Yu, Jian Wu, Yifan Gong,
Alex Acero. "ADAPTATION OF COMPRESSED HMM PARAMETERS FOR
RESOURCE-CONSTRAINED SPEECH RECOGNITION," Proceedings of the IEEE
International Conference on Acoustics, Speech, and Signal
Processing, March 31-April 5, 2008, Las Vegas.
-
Tsung-Hui Chang, Zhi-Quan Luo, Li Deng, Chong-Yung
Chi. "A Convex Optimization Method for Joint Mean and Variance
Parameter Estimation of Large-Margin CDHMM,"Proceedings of the IEEE
International Conference on Acoustics, Speech, and Signal
Processing, March 31-April 5, 2008, Las Vegas, pp.
-
Dong Yu, Li Deng, Jasha Droppo, Jian Wu, Yifan Gong,
Alex Acero. "MINIMUM-MEAN-SQUARE-ERROR NOISE REDUCTION ALGORITHM ON
MEL-FREQUENCY CEPSTRA FOR ROBUST SPEECH RECOGNITION," Proceedings of
the IEEE International Conference on Acoustics, Speech, and Signal
Processing, March 31-April 5, 2008, Las Vegas, pp.
- Jinyu Li, Li Deng, Dong Yu, Yifan Gong, Alex
Acero. "HMM ADAPTATION USING A PHASE-SENSITIVE ACOUSTIC DISTORTION
MODEL FOR ENVIRONMENT-ROBUST SPEECH RECOGNITION," Proceedings of the
IEEE International Conference on Acoustics, Speech, and Signal
Processing, March 31-April 5, 2008, Las Vegas, pp.
- Li Deng (invited). ``Roles of high-fidelity acoustic modeling in
robust speech recognition,''
Proc. IEEE Workshop on Automatic Speech Recognition and
Understanding, Kyoto, Japan, Dec 9-13, 2007, 12 pages. - Jinyu Li,
Li Deng, Dong Yu, Yifan Gong, Alex Acere. ``HIGH-PERFORMANCE HMM
ADAPTATION WITH JOINT COMPENSATION OF ADDITIVE AND CONVOLUTIVE
DISTORTIONS VIA VECTOR TAYLOR SERIES'', Proc. IEEE Workshop on
Automatic Speech Recognition and Understanding, Kyoto, Japan, Dec
9-13, 2007, 6 pages.
- Jinyu Li, Li Deng, Dong Yu, Yifan
Gong, Alex Acero. "HIGH-PERFORMANCE HMM ADAPTATION WITH JOINT
COMPENSATION OF ADDITIVE AND CONVOLUTIVE DISTORTIONS VIA VECTOR
TAYLOR SERIES", Proc. IEEE Workshop on Automatic Speech Recognition
and Understanding, Kyoto, Japan, Dec 9-13, 2007.
-
D. Yu and L. Deng
(invited). "Large-Margin Discriminative Training of Hidden Markov
Models for Speech Recognition," Proc. IEEE International Conference
on Semantic Computing, Irvine, CA, September 17-19, 2007.
-
R. Togneri and L. Deng. "A Structured Speech Model
Parameterized by Recursive Dynamics and Neural Networks,"
Proceedings of Interspeech, Antweerp, Belgium, Aug. 27-31, 2007. pp.
894-897.
- D. Yu, L. Deng, and A. Acero. "Handling Phonetic Context and
Speaker Variation in a Structure-Based Speech Recognizer,"
Proceedings of Interspeech, Antweerp, Belgium, Aug. 27-31, 2007,
pp. 906-909.
- L. Deng and H. Strik. "Structure-Based and Template-Based
Automatic Speech Recognition --- Comparing parametric and
non-parametric approaches," Proceedings of Interspeech, Antweerp,
Belgium, Aug. 27-31, 2007, pp. 894-897.
- Q. Fu, Xiaodong He, and L. Deng. "Phone-Discriminating Minimum
Classification Error (P-MCE) Training for Phonetic Recognition,"
Proceedings of Interspeech, Antweerp, Belgium, Aug. 27-31, 2007, pp.
2073-2076.
- Sibel Yaman, Li Deng, Dong Yu, Ye-Yi Wang, and Alex Acero, "A
DISCRIMINATIVE TRAINING FRAMEWORK USING N-BEST SPEECH RECOGNITION
TRANSCRIPTIONS AND SCORES FOR SPOKEN UTTERANCE CLASSIFICATION,"
Proceedings of the IEEE International Conference on Acoustics,
Speech, and Signal Processing, Honolulu, Hawaii, April 2007
- Li
Deng and Dong Yu, "Use of Differential Cepstra as Acoustic Features
in Hidden Trajectory Modeling for Phonetic Recognition," Proceedings
of the IEEE International Conference on Acoustics, Speech, and
Signal Processing, Honolulu, Hawaii, April 2007
- Dong Yu, Li Deng, Xiaodong He, Alex Acero, "LARGE-MARGIN MINIMUM
CLASSIFICATION ERROR TRAINING FOR LARGE-SCALE SPEECH RECOGNITION
TASKS," Proceedings of the IEEE International Conference on
Acoustics, Speech, and Signal Processing, Honolulu, Hawaii, April
2007
- Xiaolong Li, Yuncheng Ju, Li Deng, Alex Acero, "EFFICIENT AND
ROBUST LANGUAGE MODELING IN AN AUTOMATIC CHILDREN’S READING TUTOR
SYSTEM," Proceedings of the IEEE International Conference on
Acoustics, Speech, and Signal Processing, Honolulu, Hawaii, April
2007
- D. Yu, L. Deng, X. He, and A. Acero."Use of incrementally
regulated discriminative margins in MCE training for speech
recognition," Proceedings of Interspeech, Pittsburgh, PA,
Sept 2006, pp. 2418-2421
- Xiaolong Li, L. Deng, and A. Acero. "Time synchronous decoding
for a Long-Contextual-Span Hidden Trajectory Model of Speech,"
Proceedings of Interspeech, Pittsburgh, PA, Sept 2006, pp.
609-612.
- Xiaodong He, L. Deng, and W. Chou."A novel learning
method for hidden Markov models in speech and audio
processing," Proc. IEEE Workshop on Multimedia Signal
Processing, Victoria, BC, October 2006, 6 pages. CDROM.
- Li Deng, Xiaodong Cui, Robert Pruvenok, Jonathan Huang,
Safiyy Momen, Yanyi Chen, and Abeer Alwan. "A database of vocal
tract resonance trajectories for research in speech
processing," Proceedings of the IEEE International
Conference on Acoustics, Speech, and Signal Processing, May
14-19, 2006, Toulouse, France, pp. 60-63.
- L. Deng, D. Yu, and A. Acero."A generative modeling framework
for structured hidden speech dynamics," Neural
Information Processing System (NIPS) Workshop, Whistler, BC,
Canada, Dec. 2005.
- L. Deng, D. Yu, and A. Acero."A Long-Contextual-Span Model
of Resonance Dynamics for Speech Recognition: Parameter Learning
and Recognizer Evaluation," IEEE Workshop on ASRU, Nov.
27-Dec 1, 2005, 6 pages (CDROM).
- D. Yu, L. Deng, and A. Acero. "A* Lattice Search Algorithm
for a Long-Contextual-Span Hidden Trajectory Model and Phonetic
Recognizer," Proceedings of Interspeech, Lisbon, Sept 2005, pp.
553-556.
- L. Deng, D. Yu, and A. Acero."Learning Statistically
Characterized Resonance Targets in a Hidden Trajectory Model of
Speech Coarticulation and Reduction," Proceedings of
Interspeech, Lisbon, Sept 2005, pp. 1097-1100.
- A. Subramanya, L. Deng, Z. Liu, and Z. Zhang.
"Multi-sensory speech processing: Incorporating automatically
extracted hidden dynamic information," Proceedings of the
IEEE International Conference on Multimedia & Expo (ICME), July
2005, Amsterdam, 4 pages.
- L. Deng, X. Li, D. Yu, and A. Acero."A hidden trajectory
model with bidirectional target filtering: Cascaded vs.
Integrated implementation for phonetic recognition,"
Proceedings of the IEEE International Conference on Acoustics,
Speech, and Signal Processing, March 19-23, 2005, Philadelphia,
PA, pp 337-340.
- L. Deng, X. Li, D. Yu, and, A. Acero. "Novel Acoustic Modeling
with Structured Dynamics for Speech Coarticulation and
Reduction," Proc. of DARPA/NIST RT-04 Workshop, Palisades, New
York, Nov. 7-10, 2004, 6 pages.
- D. Yu, M. Hwang, P. Mau, A. Acero, and L. Deng. "Unsupervised
learning from users’error correction in speech dictation,"
Proceedings of the International Conference on Spoken Language
Processing, Oct.4-8, 2004, Jeju Island, Korea, No. Spec4201o.1,
pp. 4201-4204.
- L. Deng, D. Yu, and A. Acero. "A quantitative model for formant
dynamics and contextually assimilated reduction in fluent
speech," Proceedings of the International Conference on Spoken
Language Processing, Oct.4-8, 2004, Jeju Island, Korea, No.
WeA501p.20, pp.\ 501-504.
- R. Togneri and L. Deng. "Use of neural network mapping and
extended Kalman filter to recover vocal tract resonances from
the MFCC parameters of speech," Proceedings of the
International Conference on Spoken Language Processing, Oct.4-8,
2004, Jeju Island, Korea, No.WeB1201o.4, pp. 1201-1204.
- L. Deng, Z. Liu, Z. Zhang, and A. Acero. "Information fusion for
multi-sensor processing --- Extracting and exploiting hidden
dynamics of speech captured by a bone-conductive microphone,"
Proceedings of the IEEE Fifth Workshop on Multimedia Signal
Processing, Siena, Italy, Sept 28-Oct 2, 2004, 4 pages.
- L. Deng, L. Lee, H. Attias, and A. Acero. "A structured speech
model with continuous hidden dynamics and prediction-residual
training for tracking vocal tract resonances," Proceedings of
the IEEE International Conference on Acoustics, Speech, and
Signal Processing, Montreal, Canada, May 2004, Vol. I,
pp.557-560.
- L. Lee, L. Deng, and H. Attias."A multimodal variational
approach to learning and inference in switching state space
models," Proceedings of the IEEE International Conference on
Acoustics, Speech, and Signal Processing, Montreal, Canada, May
2004, Vol. V, pp.505-508.
- Z. Zhang, Z. Liu, M. Sinclair, A. Acero, L. Deng, J. Droppo. X.
Huang, Y. Zheng. "Multisensory microphones for robust speech
detection, enhancement, and recognition," Proceedings of the
IEEE International Conference on Acoustics, Speech, and Signal
Processing, Montreal, Canada, May 2004, Vol. III,
pp.781-784.
- Y. Zheng, Z. Liu, Z. Zhang, M. Sinclair, J. Droppo, L.
Deng, A. Acero, and X Huang. "Air- and bone-conductive
integrated microphones for robust speech detection and
enhancement," Proceedings of the IEEE Workshop on Automatic
Speech Recognition and Understanding, Nov. 30--Dec. 4, 2003, St.
Thomas, US Virgin Islands. 6 pages in CDROM.
- J. Wu, J. Droppo, L. Deng, and A. Acero. "A noise-robust ASR
frontend using Wiener filters constructed from MMSE estimates of
clean speech and noise," Proceedings of the IEEE Workshop on
Automatic Speech Recognition and Understanding, Nov. 30--Dec.
4, 2003, St. Thomas, US Virgin Islands. 6 pages in CDROM.
- L. Deng, I. Bazzi, and A. Acero. "Tracking vocal tract
resonances using an analytical nonlinear predictor and a
target-guided temporal constraint," Proceedings of the European
Conference on Speech Communication and Technology, Geneva,
Switzerland, September 2003, Vol.I, pp. 73-76.
- J. Droppo, L. Deng, and A. Acero. "A comparison of three
non-linear observation models for noisy speech features,"
Proceedings of the European Conference on Speech Communication
and Technology, Geneva, Switzerland, September 2003, Vol.
II, pp. 681-684.
- L. Deng, J. Droppo, and A. Acero. "Incremental Bayes learning
with prior evolution for tracking nonstationary noise statistics
from noisy speech data," Proceedings of the IEEE International
Conference on Acoustics, Speech, and Signal Processing, Hong
Kong, April 2003, Vol.I, pp. 672-675.
- F. Seide, J.L. Zhou, and L. Deng. "Coarticulation modeling by
embedding a target-directed hidden trajectory model into HMM ---
MAP decoding and evaluation," Proceedings of the IEEE
International Conference on Acoustics, Speech, and Signal
Processing, Hong Kong, April 2003, Vol.I, pp. 748-751.
- J.L. Zhou, F. Seide, and L. Deng. "Coarticulation modeling by
embedding a target-directed hidden trajectory model into HMM ---
Models and training," Proceedings of the IEEE International
Conference on Acoustics, Speech, and Signal Processing, Hong
Kong, April 2003, Vol.I, pp. 744-747.
- L.J. Lee, H. Attias, and L. Deng. "Variational inference and
learning for segmental switching state space models of hidden
speech dynamics," Proceedings of the IEEE International
Conference on Acoustics, Speech, and Signal Processing, Hong
Kong, April 2003, Vol.I, pp. 920-923.
- I. Bazzi, A. Acero, and L. Deng. "An expectation-maximization
approach for formant tracking using a parameter-free non-linear
predictor," Proceedings of the IEEE International Conference on
Acoustics, Speech, and Signal Processing, Hong Kong, April 2003,
Vol.I, pp. 464-467.
- L. Deng, A. Acero, Y. Wang, K. Wang, H. Hon, J. Droppo, M.
Mahajan, and XD Huang. "A speech-centric perspective for
human-computer interface," (invited). Proceedings of the IEEE
Fifth Workshop on Multimedia Signal Processing, Dec. 9-11, 2002,
St. Thomas, US Virgin Islands,
5 pages in
CDROM.
- H. Attias and L. Deng. "A new approach to speech enhancement by
a microphone array using EM and mixture moels,"
Proceedings of the International Conference on Spoken Language
Processing, Denver CO, September 2002, pp. 151-154.
- J. Droppo, A. Acero, and L. Deng. "Evaluation of SPLICE
on the Aurora2 and Aurora3 tasks," Proceedings of the
International Conference on Spoken Language Processing, Denver
CO, September 2002, pp. 121-124.
- L. Deng, J. Droppo, and A. Acero. "Log-domain speech feature
enhancement using sequential MAP noise estimation
and a phase-sensitive model of the acoustic environment,"
Proceedings of the International Conference on Spoken Language
Processing, Denver CO, September 2002, pp. 192-195.
- L. Deng, J. Droppo, and A. Acero. "Exploiting variances
in robust feature extraction based on a parametric model of
speech distortion," Proceedings of the International Conference
on Spoken Language Processing, Denver CO, September 2002, pp.
217-220.
- J. Droppo, A. Acero, and L. Deng. "A nonlinear
observation model for removing noise from corrupted speech log mel-spectral energies," Proceedings of the International
Conference on Spoken Language Processing, Denver CO, September
2002, pp. 182-185.
- L. Deng, J. Droppo, and A. Acero. "A Bayesian approach to
speech feature enhancement using the dynamic cepstral prior,"
Proceedings of the IEEE International Conference on Acoustics,
Speech, and Signal Processing, Vol.I, Orlando, Florida, May
2002, pp. 829-832.
- J. Droppo, A. Acero, and L. Deng. "Uncertainty decoding
with SPLICE for noise robust speech recognition," Proceedings
of the IEEE International Conference on Acoustics, Speech, and
Signal Processing, Vol.I, Orlando, Florida, May 2002, pp. 57-60.
- J. Ma and L. Deng. "A mixture linear model with
target-directed dynamics for spontaneous speech recognition,"
Proceedings of the IEEE International Conference on Acoustics,
Speech, and Signal Processing, Vol.I, Orlando, Florida, May
2002, pp. 961-964.
- L. Deng, J. Droppo, and A. Acero. "Recursive estimation
of nonstationary noise using a nonlinear model with iterative
stochastic approximation," Proceedings of Automtic Speech
Recognition and Understanding Workshop, Madonna di Campiglio,
Trento, Italy, Dec. 9-13, 2001. 4 pages (CDROM).
- T. Kristjansson, B. Frey, L. Deng, and A. Acero. "Joint
estimation of noise and channel distortion in a generalized EM
framework," Proceedings of Automtic Speech Recognition and
Understanding Workshop, Madonna di Campiglio, Trento, Italy,
Dec. 9-13, 2001. 4 pages (CDROM).
- B. Frey, T. Kristjansson, L. Deng, and A. Acero.
"Learning dynamic noise models from noisy speech for robust
speech recognition," Advances in Neural Information Processing
Systems (NIPS), Vol. 14, Vancouver, Canada, 2001, pp. 101-108.
- J. Droppo, L. Deng, A. Acero. "Evaluation of the SPLICE
algorithm on the Aurora2 database," Proceedings of the European
Conference on Speech Communication and Technology, Vol. 1,
Aalborg, Denmark, September 2001, pp. 217-220.
- H. Attias, L. Deng, A. Acero, and J. Platt. "A new method
for speech denoising and robust speech recognition using
probabilistic models for clean speech and for noise,"
Proceedings of the European Conference on Speech Communication
and Technology, Vol. 2, Aalborg, Denmark, September 2001, pp.
1903-1906.
- B. Frey, L. Deng, A. Acero, and T. Kristjansson.
"ALGONQUIN: Iterating Laplace's method to remove multiple types
of acoustic distortion for robust speech recognition,"
Proceedings of the European Conference on Speech Communication
and Technology, Aalborg, Denmark, September 2001, pp. 901-904.
- J. Ma and L. Deng. "Efficient decoding strategy for
conversational speech recognition using state-space models for
vocal-tract-resonance dynamics", Proceedings of the European
Conference on Speech Communication and Technology, Aalborg,
Denmark, September 2001, pp. 603-606.
- L. Deng, A. Acero, L. Jiang, J. Droppo, and XD Huang.
"High-performance robust speech recognition using stereo
training data," Proceedings of the IEEE International
Conference on Acoustics, Speech, and Signal Processing, Vol.I,
Salt Lake City, Utah, April 2001, pp. 301-304.
- T. Kristjansson, L. Deng, A. Acero and B. Frey. "Towards
non-stationary model-based noise adaptation for large vocabulary
speech recognition," Proceedings of the IEEE International
Conference on Acoustics, Speech, and Signal Processing, Vol.I,
Salt Lake City, Utah, April 2001, pp. 337-340.
- X. Huang, A. Acero, C. Chelba, L. Deng, J. Droppo, H. Hon, et al. (invited)
"MIPAD: A next generation PDA prototype," Proceedings of the IEEE
International Conference on Acoustics, Speech, and Signal Processing,
Vol.I, Salt Lake City, Utah, April 2001, pp. 9-12.
- R. Togneri and L. Deng. "An EKF-based algorithm for learning
statistical hidden dynamic model parameters for phonetic
recognition," Proceedings of the IEEE International Conference
on Acoustics, Speech, and Signal Processing, Vol.I, Salt Lake
City, Utah, April 2001, pp. 465-468.
- L. Lee, P. Fieguth, and L. Deng. "A functional articulatory
dynamic model for speech production," Proceedings of the IEEE
International Conference on Acoustics, Speech, and Signal
Processing, Vol.II, Salt Lake City, Utah, April 2001, pp. 797-800.
- J. Droppo, L. Deng, A. Acero. "Efficient on-line acoustic environment
estimation for FCDCN in a continuous speech recognition system,
Proceedings of the IEEE International Conference on Acoustics,
Speech, and Signal Processing, Vol.I, Salt Lake City,
Utah, April 2001, pp. 209-212.
- H. Attias, J. Platt, A. Acero, and L. Deng. "Speech denoising and dereveberation using probabilistic models," Advances in Neural Information Processing Systems (NIPS), Vol. 13, Denver, CO, Nov. 27-Dec 2, 2000, pp. 758-764.
- J. Sun and L. Deng. "Annotation and use of speech production corpus for
building language-universal speech recognizers", Proceedings of the 2nd
International Symposium on Chinese Spoken Language Processing (ISCSLP),
Beijing, October 2000, Vol. 3, pp. 31-34.
- J. Sun, L. Deng, and X. Jing."Data-driven model construction for continuous
speech recognition using overlapping articulatory features," Proceedings
of the International Conference on Spoken Language Processing, October 2000,
Vol. 1, pp. 437-440.
- A. Acero, L. Deng, T. Kristjansson, and J. Zhang. "HMM adaptation using
vector Taylor series for noisy speech recognition," Proceedings of
the International Conference on Spoken Language Processing,October 2000, Vol.
3, pp. 869-872.
- X. Huang, A. Acero, C. Chelba, L. Deng, D. Duchene, J. Goodman, H. Hon, D.
Jacoby, L. Jiang, R. Loynd, M. Mahajan, P. Mau, S. Meredith, S. Mughal, S.
Neto, M. Plumpe, K. Wang, Y. Wang. "MIPAD: A next generation PDA
prototype," Proceedings of the International Conference on Spoken
Language Processing, October 2000, Vol. 3, pp. 33-36.
- L. Deng, A. Acero, M. Plumpe, and X.D. Huang."Large-vocabulary speech
recognition under adverse acoustic environments," Proceedings of the
International Conference on Spoken Language Processing, October 2000, Vol. 3,
pp. 806-809.
- H. Jiang and L. Deng. "A robust training strategy against extraneous
acoustic variations for spontaneous speech recognition," Proceedings of
the International Conference on Spoken Language Processing, October 2000, Vol.
4, pp. 161-164.
- J. Sun, R. Tongneri and L. Deng. "A robust speech understanding system
using conceptual relational grammar," Proceedings of the International
Conference on Spoken Language Processing,October 2000, Vol. 2, pp. 879-882.
- S. Dusan and L. Deng. "Acoustic-to-articulatory inversion using dynamical and
phonological constraints" Proceedings of the 5th Speech Production
Workshop: MODELS AND DATA, Kloster Seeon, Germany, May 1-4, 2000, pp. 237-240.
Book Chapters
-
Li Deng and Jianwu Dang. "Speech Analysis: The
Production-Perception Perspective," Chapter 1 in Hai-Zhou
Li and Chin-Hui Lee (eds.), Advances in Chinese Spoken
Language Processing, Publisher: World Scientific,
New Jersey, 2007, pp. 3-32.
- Dong Yu and Li Deng. "Speech-Centric Multimodal User
Interface Design in Mobile Technology", Chapter
XVIII in
Jo Lumsden (Ed.), Handbook of Research on User Interface
Design and Evaluation for Mobile Technology,
Publisher: IGI Global (Information Science Reference), New
York, 2008.
- L. Deng and H. Sheikhzadeh. "Use of an
Integrated Neural-Network and Cochlear Model for the
Study of Speech Encoding in the Auditory System," in W.
Ainsworth and S. Greenberg (eds.) Listening to
Speech: An Auditory Perspective, Publisher: Lawrence Erlbaum Associates, 2006, pp. 237-256.
- L. Deng. "Switching Dynamic System Models for Speech Articulation and
Acoustics," in M. Johnson, M. Ostendorf, S. Khudanpur, and R. Rosenfeld (eds.) IMA Volume 138: Mathematical Foundations of
Speech and Language Processing, Springer-Verlag, New
York, 2003, pp. 115--134.
-
C. Avendano, L. Deng, H. Hermansky, and B. Gold.
"The Analysis and Representation of Speech," Chapter 2
in S. Greenberg, W. Ainsworth, A. Popper, and R. Fay
(eds.) Speech Processing in the Auditory System,
Springer, New York, 2005.
-
L. Deng. "Articulatory Features and Associated Production Models in
Statistical Speech Recognition," in K. Ponting (ed.)
Computational Models of Speech Pattern Processing, (NATO
ASI Series), Springer, 1999, pp. 214-224.
-
L. Deng. "Computational Models for Speech Production,"
Computational Models of Speech Pattern Processing, (NATO
ASI Series), Springer, 1999, pp. 199-213.
-
L. Deng. "Computational Models for Auditory Speech
Processing,"
Computational Models of Speech Pattern Processing,
(NATO ASI Series), Springer, 1999, pp. 67-77.
-
L. Deng. "A dynamic, feature-based approach to speech modeling and
recognition," in S. Furui, F. Juang and W. Chou (eds.)
Automatic Speech
Recognition and Understanding,
NJ., IEEE (Catalog No. 97TH8241), 1997, pp. 107-114.
- D. Sun and L. Deng. "Nonstationary-State Hidden Markov Models for Speech
Recognition," in S. E. Levinson and L. Shepp, (eds.)
Image and Speech Models
--- Volume 80 in IMA Volumes in Mathematics and its Applications,
Springer-Verlag, New York, 1995, pp. 161--182.
- D. Zhang, L. Deng, and M. Elmasry. "Pipelined Neural Network
Architecture For Speech Recognition," Chapter 9 in M.I. Elmasry, (ed.)
VLSI
Artificial Neural Networks Engineering, Kluwer Academic
Publishers, 1994, pp. 297-315.
- K. Hassanein, L. Deng, and M. Elmasry. "Neural
Predictive Hidden Markov Model Architecture For Speech And Speaker
Recognition," Chapter 10 in M.I. Elmasry, (ed.) VLSI Artificial Neural
Networks Engineering, Kluwer Academic Publishers, 1994, pp.
316-336.
- L. Deng, K. Hassanein, and M. Elmasry.
"Neural-Network Architecture For Linear And Nonlinear Predictive Hidden Markov
Models: Application To Speech Recognition," in B. H. Juang, S. Y. Kung, and C.
A. Kamm, (eds.) Neural Networks for Signal Processing
, Princeton, NJ, IEEE (Catalog No. 91TH0385), 1991, pp. 411--421.
- L. Deng. "Interfacing Displacement Sensors --- Linear Variable Differential
Transformers," Chapter 9 in W. Tompkins and J. Webster, (eds.)
Interfacing
Sensors to the IBM PC, Prentice-Hall Inc., Englewood Cliffs, New
Jersey, 1988, pp. 250-301.
Selected Publications (prior to 1999 joining MS)
-
M. Naito, L. Deng, and Y. Sagisaka. "Speaker clustering
for speech recognition using the parameters
characterizing vocal tract dimensions," Proceedings of the
IEEE International Conference on Acoustics, Speech, and Signal Processing,
Seattle, WA, May 11-15, 1998.
-
M. Naito, L. Deng, and Y. Sagisaka. "Speaker adaptation methods using vocal
tract parameters," (in Japanese) Proceedings of the 1998 Spring Meeting of the
Acoustical Society of Japan,
Yokohama, Japan, March 17-19, 1998, pp. 55-56.
-
M. Naito, L. Deng, and Y. Sagisaka. "A study on speaker clustering methods
using vocal tract parameters," (in Japanese) Proceedings of Japan Institute of
Electronics, Information, and Communication Engineers (IEICE),
Yokosuka, Japan, December 1997, Vol. 97, No. 441, pp. 35-40.
-
L. Deng (invited). "A dynamic, feature-based approach to
speech modeling and recognition," Proceedings of the 1997 IEEE Workshop on Automatic Speech
Recognition and Understanding,
Santa Barbara, CA, December 14-17, 1997, pp. 107-114.
-
C. Rathinavelu and L. Deng. "Speech adaptation
experiments using nonstationary-state HMMs: A MAP
approach," Proceedings of the IEEE
International Conference on Acoustics, Speech, and Signal Processing,
Munich, Germany, 1997, Vol. 2, pp. 1415-1418.
-
L. Deng. "Integrated-multilingual speech recognition
using universal phonological features in a functional
speech production model," Proceedings of
the IEEE International Conference on Acoustics, Speech, and Signal Processing,
Munich, Germany, 1997, Vol. 2, pp. 1007-1010.
-
C. Rathinavelu and L. Deng. "On the use of
discriminatively derived feature space transformation in
speech recognition," Proceedings of the
International Conference on Signal Processing Applications and Technology,
Boston, MA, October 7-10, 1996, pp. 1769-1773.
-
C. Rathinavelu and L. Deng. "Trended HMM with
discriminative training for phonetic classification," Proceedings of the International Conference on
Spoken Language Processing, Philadelphia, PA, October 3-6, 1996,
pp. 1049-1052.
-
X. Shen, L. Deng, and A. Yasmin. "H_inf filtering
for speech enhancement," Proceedings of the International Conference on Spoken Language
Processing, Philadelphia, PA, October 3-6, 1996, pp. 873-876.
-
L. Deng, X. Shen, and D. Jamieson. "Simulation of
disordered speech using a frequency-domain vocal tract
model," Proceedings of the International
Conference on Spoken Language Processing, Philadelphia, PA, October
3-6, 1996, pp. 768-771.
-
D. Jamieson, L. Deng, M. Price, V. Parsa, and J. Till.
"Interactions of speech disorders with speech coders:
Effects on speech intelligibility," Proceedings
of the International Conference on Spoken Language Processing,
Philadelphia, PA, October 3-6, 1996, pp. 737-740.
-
G. Ramsay and L. Deng. "Optimal filtering and
smoothing for speech recognition using a stochastic
target model," Proceedings of the International
Conference on Spoken Language Processing, Philadelphia, PA, October
3-6, 1996, pp. 1113-1116.
-
L. Deng and J. Wu. "Hierarchical partitioning of
articulatory state space for articulatory-feature based
speech recognition," Proceedings of the
International Conference on Spoken Language Processing,
Philadelphia, PA, October 3-6, 1996, pp. 2266-2269.
-
J. Wu and L. Deng. "Acoustic Modeling for Continuous
Mandarin-Chinese Speech Recognition," Proceedings of the International Conference on Spoken Language
Processing, Philadelphia, PA, October 3-6, 1996, pp. 2281-2284.
-
L. Deng, G. Ramsay, and D. Sun. (invited).
"Production models as a structural basis for automatic
speech recognition," Proceedings of the Fourth
European Speech Production Workshop, Autrans, France, May 24-27,
1996, pp. 69--80.
-
L. Deng. "Finite-state automata derived from overlapping
articulatory features: A novel phonological construct
for speech recognition," Proceedings
of the Workshop on Computational Phonology in Speech Technology,
(published by Association for Computational Linguistics), Santa Cruz, CA, June
28, 1996. pp. 37-45.
-
L. Deng and H. Sheikhzadeh. "Temporal and rate aspects of speech encoding in
the auditory system: Simulation results on TIMIT data
using a layered neural network interfaced with a
cochlear model," Proceedings of European Speech
Communication Association Tutorial and Research Workshop on the Auditory Basis
of Speech Recognition,
July 15 - 19, 1996, Keele University, United Kingdom, pp. 75-78.
-
C. Rathinavelu and L. Deng. "HMM-based speech recognition using
state-dependent, discriminatively derived transforms on Mel-warped DFT
features", Proceedings of the IEEE International Conference on Acoustics,
Speech, and Signal Processing, Vol.1, Atlanta, Georgia, May 7-10,
1996, pp. 9--12.
-
L. Deng, G. Ramsay, and H. Sameti. "From
modeling surface phenomena to modeling mechanisms:
Towards a faithful model of the speech process aiming at
speech recognition," Proceedings of the 1995 IEEE Workshop on
Automatic Speech Recognition,
December 10-13, 1995, Snowbird, Utah, pp. 183-184.
-
G. Ramsay and L. Deng. "Maximum-likelihood
estimation for articulatory speech recognition using a
stochastic target model," Proceedings of the 1995
European Conference on Speech Communication and Technology,
Spain, September 18-21, 1995, pp. 1401-1404.
-
G. Ramsay and L. Deng. "Modal analysis of
acoustic wave propagation in the vocal tract using a
finite-difference method," Proceedings of the XII
International Congress of Phonetic Sciences, Stockholm, Sweden,
August 13-19, 1995, Vol 2, pp. 338-341.
-
G. Ramsay and L. Deng. "Articulatory synthesis
using a stochastic target model of speech production," Proceedings of the XII International
Congress of Phonetic Sciences, Stockholm, Sweden, August 13-19,
1995, Vol 2, pp. 478-481.
-
L. Deng, J. Wu, and H. Sameti. "Improved speech modeling
and recognition using multi-dimensional articulatory
states as primitive speech units," Proceedings
of the IEEE International Conference on Acoustics, Speech, and Signal
Processing, Detroit, MI, May 8-12, 1995, pp. 385-388.
-
D. Sun and L. Deng. "Analysis of acoustic-phonetic variations in fluent speech
using TIMIT," Proceedings of the IEEE International Conference on Acoustics,
Speech, and Signal Processing, Detroit, MI, May 8-12, 1995, pp.
201-204.
-
C. Rathinavelu and L. Deng. "Use of generalized
dynamic feature parameters for speech recognition:
Maximum likelihood and minimum classification error
approaches," Proceedings of the IEEE International
Conference on Acoustics, Speech, and Signal Processing, Detroit,
MI, May 8-12, 1995, pp. 373-376.
-
S. Shen and L. Deng. "Discrete H_inf filtering design with application to
speech enhance ment," Proceedings of the IEEE International Conference on
Acoustics, Speech, and Signal Processing, Detroit, MI, May 8-12,
1995, pp. 1504-1507.
-
H. Sheikhzadeh, R. Brennan, L. Deng, and H. Sameti,
"Real-time
implementation of HMM-based MMSE algorithm for speech
enhancement in hearing aid applications," Proceedings of the IEEE International Conference on
Acoustics, Speech, and Signal Processing, 1995
,
-
D. Sun and L. Deng. "Nonstationary-state hidden Markov
model with state-dependent time warping: Application to
speech recognition," Proceedings
of the 1994 International Conference on Spoken Language Processing,
Vol. 1, Yokohama, Japan, September, 18-22, 1994. pp. 243--246,
-
L. Deng and H. Sameti. "Speech recognition using
dynamically defined speech units," Proceedings of the 1994 International Conference on Spoken
Language Processing, Vol. 4, pp. 2167--2170, Yokohama, Japan,
September, 18-22, 1994.
-
H. Sheikhzadeh and L. Deng. "Interval statistics
from a cochlear model in response to speech sounds," Journal of the Acoustical Society of America,
Vol. 95, No. 6, June 1994 (Abstract), pp. 2842. (The 127th Meeting of the
Acoustical Society of America, June 4-8, 1994, Cambridge, MA.)
-
L. Deng and I. Kheirallah. "Stability analysis on
finite-difference solution of a basilar-membrane
vibration model with application to acoustic signal
processing," Journal of the Acoustical Society of America,
Vol. 95, No. 6, June 1994 (Abstract), pp. 2840. (The 127th Meeting of the
Acoustical Society of America, June 4-8, 1994, Cambridge, MA.)
-
L. Deng and H. Sameti. "Articulatory phonology
and speech recognition: A study on use of dynamically
defined speech primitives," Journal of the
Acoustical Society of America, Vol. 95, No. 6, June 1994
(Abstract), pp. 2870. (The 127th Meeting of the Acoustical Society of America,
June 4-8, 1994, Cambridge, MA.)
-
G. Ramsay and L. Deng. "A stochastic framework
for articulatory speech recognition," Journal of the Acoustical Society of America, Vol.
95, No. 6, June 1994 (Abstract), pp. 2871. (The 127th Meeting of the Acoustical
Society of America, June 4-8, 1994, Cambridge, MA.)
-
K. Hassanein, L. Deng and M. Elmasry. "A neural
predictive hidden Markov model for speaker recognition," Proceedings of the Workshop on Automatic
Speaker Recognition, Identification and Verification,
Martigny, Switzerland, April, 1994, pp. 115-118.
-
L. Deng and M. Aksmanovic. "HMMs with mixtures of
trended functions for automatic speech recognition," IEEE International Conference on Speech, Image
Processing and Neural Networks, April 13-15, 1994, HongKong, pp.
702-705.
-
L. Deng. "A theory on optimal construction of dynamic
features for hidden Markov modeling of speech," IEEE International Conference on Speech, Image
Processing and Neural Networks, April 13-15, 1994, HongKong, pp.
351-354.
-
L. Deng and D. Sun. "Phonetic classification and
recognition using HMM representation of overlapping
articulatory features for all classes of English
sounds," Proceedings of the IEEE International Conference on Acoustics, Speech,
and Signal Processing, Adelaide, Australia, April 19-22, 1994, Vol.
1, pp. 45-48.
-
K. Hassanein, L. Deng and M. Elmasry. "Vowel
classification using a neural predictive HMM: A
discriminative training approach," Proceedings of the
IEEE International Conference on Acoustics, Speech, and Signal Processing,
Adelaide, Australia, April 19-22, 1994, Vol 2, pp. 665-668.
-
H. Sameti, H. Sheikhzadeh, L. Deng and R. Brennan.
"Comparative performance of spectral subtraction and
HMM-based speech enhancement strategies with application
to hearing aid design." Proceedings of the IEEE International
Conference on Acoustics, Speech, and Signal Processing, Adelaide,
Australia, April 19-22, 1994, Vol. 1, pp. 13-16.
-
L. Deng. "A computational model of phonology-phonetics
integration for automatic speech recognition," Proceedings of the 1993 IEEE Workshop on
Automatic Speech Recognition,
December 12-15, 1993, Snowbird, Utah, pp. 83--84.
-
K. Hassanein, L. Deng and M. Elmasry. "A neural
predictive hidden Markov model for speech and speaker
recognition," Proceedings of the Fifth
International Conference on Microelectronics
December 14-16, 1993, Dhahran, Saudi Arabia, pp. 108-111.
-
L. Deng and D. Sun. "Speech recognition using the atomic
speech units constructed from overlapping articulatory
features," Proceedings of the 1993
European Conference on Speech Communication and Technology,
September 21-23, 1993, Berlin, Germany, Vol. III, pp. 1635--1638.
-
D. Zhang, L. Deng, and M. Elmasry. "Pipelined
neural network architecture for speech recognition," Proceedings of the 1993 World Congress on
Neural Networks,
July 11-15, 1993, Portland, Oregon, Vol. III, pp. 55-58.
-
L. Deng. "Design of a feature-based speech recognizer aiming at integration of
auditory processing, signal modeling, and phonological structure of speech."
(invited) Journal of the Acoustical Society of America, Vol. 93,
No.4, Pt. 2, pp. 2318, April, 1993.
-
K. Hassanein, L. Deng, and M.
Elmasry. "Maximal mutual information training of a
neural predictive HMM speech recognition system," Proceedings of the 1992 IEEE Workshop on Neural
Networks for Signal Processing,
August 31--September 2, 1992, Copenhagen, Denmark, pp. 164-173.
-
K. Erler and L. Deng. "HMM
representation of quantized articulatory features for recognition of highly confusible
words," Proceedings of the IEEE International Conference on
Acoustics, Speech, and Signal Processing, San Francisco, CA.,
March, 1992, pp.545-548.
-
L. Deng. "Speech modeling and recognition using a time
series model containing trend functions with Markov
modulated parameters," Proceedings of the 1991 IEEE Workshop on Automatic
Speech Recognition,
Arden House, New York, December, 1991, pp. 24-26.
-
L. Deng and K. Erler. "Microstructural speech
units and their HMM representation for discrete
utterance speech recognition," Proceedings of the IEEE International Conference on Acoustics,
Speech, and Signal Processing, Toronto, Ontario, Canada, May, 1991,
pp. 193--196. P. Seitz, V. Gupta, M. Lennig, P. Kenny, L. Deng, D.
O'Shaughnessy, and P. Mermelstein. "Phonological rule
set complexity as a factor in the performance of a very
large vocabulary word recognition system,"
Journal of the Acoustical Society of America, 87(1), May, 1990,
S108 (Abstract).
-
L. Deng, V. Gupta, M. Lennig, P. Kenny, and P.
Mermelstein. "Acoustic recognition component of an
86,000-word speech recognizer," Proceedings of the IEEE International Conference on Acoustics,
Speech, and Signal Processing, Albuquerque, New Mexico, 1990, pp.
741--744.
-
L. Deng, P. Kenny, M. Lennig, V. Gupta and P.
Mermelstein. "A locus model of coarticulation in a
hidden-Markov-model-based speech recognizer," Proceedings of the IEEE International Conference on
Acoustics, Speech, and Signal Processing, Glascow, Scotland, 1989,
pp. 97-100.
-
L. Deng, P. Kenny, M. Lennig, V. Gupta
and P. Mermelstein. "Large vocabulary word recognition based on phonetic
representation by hidden Markov models", Proceedings of the Canadian
Conference on Electrical and Computer Engineering, Vancouver,
Canada, November 1988, pp. 131-134.
-
L. Deng, M. Lennig, and P. Mermelstein.
"Modeling acoustic-phonetic detail in a
hidden-Markov-model-based large vocabulary speech
recognizer," Proceedings of the IEEE International
Conference on Acoustics, Speech, and Signal Processing, New York,
New York, Vol. 1, April 1988, pp. 509--512.
Patents (awarded)
- Removing noise from
feature vectors, U.S. Patent No.: 7,310,599; Granted on December 18,
2007;
- Method of determining uncertainty associated
with acoustic distortion-based noise reduction, U.S. Patent No.
7,289,955; Granted on October 30, 2007
- Method and apparatus for identifying noise
environments from noisy signals, U.S. Patent No. 7,266,494; Granted
on September 4, 2007
- Method of noisy reduction using correction
and scaling vectors with partitioning of the acoustic space in the
domain of noisy speech, U.S. Patent No.7,254,536; Granted on August
7, 2007
- Method of determining uncertainty in noise reduction, US and International
Patents; U.S. Patent No.: 7,174,292; Granted on Feb. 6, 2007
- Method of Noise Estimation Using Incremental
Bayes Learning, US. Patent; Patent No.: 7,165,026; Granted on Jan.
16, 2007
- Method of iterative noise estimation in a
recursive framework, U.S. Patent; Patent No. 7,139,703; Granted on
Nov. 21, 2006.
- Method of noise reduction using correction
vectors based on dynamic aspects of speech and noise normalization,
United States Patent No. 7,117,148; Granted on October 3, 2006.
- Method of noise reduction based on dynamic
aspects of speech, United States Patent No. 7,107,210; Granted on
Sept 12, 2006.
- Method of pattern recognition using noise
reduction uncertainty, United States Patent No. 7,103,540; Granted
on Sept 5, 2006.
- Microphone array signal enhancement using mixture
models (jointly with Hagai Attias), United States Patent No.
7,103,541; Granted on Sept 5, 2006.
- Efficient backward recursion for computing
posterior probabilities, United States Patent No. 7,062,407; Granted
on June 13, 2006.
- Method of speech recognition using
time-dependent interpolation and hidden dynamics, United States
(and International) Patent No. 7,050,975; Granted on May 23, 2006.
- Nonlinear observation models for removing
noise from corrupted speech, United States (and International)
Patent No. 7,047,047; Granted on May 16, 2006.
- Method of Noise Reduction Using Correction
and Scaling Vectors with Partitioning of the Acoustic Space in the
Domain of Noisy Speech, United States Patent No. 7,003,455; Granted
on February 21, 2006
- Methods and Apparatus for Denoising and
Dereverberation Using Variational Inference and Strong Speech
Models, United States Patent No. 6,990,447; Granted on January 24,
2006
- Method and Apparatus for Removing Noise from
Feature Vectors, United States Patent No. 6,985,858; Granted on
January 10, 2006
- Methods for Including the Category of
Environmental Noise When Processing Speech Signals, United States
Patent No. 6,959,276; Granted on October 25, 2005
- Method of iterative noise estimation in a
recursive framework, United States Patent; Patent No. 6,944,590;
Granted on September 13, 2005
- Method of speech recognition using
variational inference with switching state space models, United
States Patent; Patent No. 6,931,374; Granted on August 16, 2005
- Pattern Recognition Training Method and
Apparatus Using Inserted Noise Followed by Noise Reduction, United
States (and International) Patent; Patent No. 6,876,966; Granted on
April 5, 2005
- Apparatus for Speaker Clustering and for
Speech Recognition, Patent No.: 2,965,537; Granted on Aug. 13,
1999; Countries of issue: United States and Japan.
- Apparatus for Speaker Normalization
Processor and for Voice Recognition Device, Patent No.: 2986792;
Granted on Oct. 1, 1999; Countries of issue: United States and
Japan.
- 30 patent applications pending
Downloads
- IPAM05-MSR-VTR-Formants (This database was created by the joint work of MSR and UCLA (IPAM). See our ICASSP2006 paper (contained in the download) for details. Note that this is a 20MB download. We suggest that you save it in your disks before installing it. Note also that this is a database, although it appears as a program when you are running and "installing" it.)
E-mail: deng@NO_SPAM.microsoft.com
U.S.Mail: Microsoft Corporation, One Microsoft Way, Redmond WA,
98052-6399, USA
Tel: (425) 706-2719
Fax: (425) 706-7329 (This is the main MS FAX number so
make sure to send documents to Li Deng's attention) |