|
|
Michael L. Seltzer
Researcher
Speech Technology Group
Research interests
- Speech recognition in adverse environments
- Acoustic modeling
- Microphone array processing
- Machine learning for speech and audio applications
Background
I have been a Researcher in the Speech Technology Group at
Microsoft Research since October, 2003.
I did my graduate work in the department of Electrical
and Computer Engineering at Carnegie Mellon University,
receiving my M.S. in 2000 and my Ph.D. in 2003. While at CMU, I was
a member of the Robust Speech Recognition group,
led by my advisor Professor Richard Stern.
My dissertation research focused on improving recognition accuracy in hands-free environments using microphone
arrays.
From 1996-1998, I worked at Teradyne, Inc. as an Applications
Engineer. Teradyne makes automatic test equipment (ATE) for the semiconductor industry.
I received my B.S. at Brown University
in 1996. At Brown, I worked in the Laboratory for Engineering Man/Machine Systems
(LEMS) with Professor Harvey Silverman on a huge microphone
array project that was called, in fact, the Huge Microphone Array.
Publications
Book Chapters
Journal Publications
- A. Subramanya, M. L. Seltzer, and A. Acero.
Automatic Removal of Typed Keystrokes from Speech Signals,
IEEE Signal Processing Letters, Volume: 14, Issue: 5, May 2007, pp. 363-366.
- M. L. Seltzer and A. Acero.
Training Wideband Acoustic Models Using Mixed-Bandwidth Training Data for Speech Recognition,
in IEEE Trans. on Audio, Speech and Language Processing. Volume: 15 Issue: 1, Jan 2007. pp. 235-245.
- M. L. Seltzer and R. Stern.
Subband Likelihood-Maximizing Beamforming for Speech Recognition in Reverberant Environments,
in IEEE Trans. on Audio, Speech and Language Processing. Volume: 14 Issue: 6, Nov 2006. pp. 2109-2121.
- M. L. Seltzer, B. Raj, and R. Stern.
Likelihood-Maximizing Beamforming for Robust Hands-Free Speech Recognition,
in IEEE Trans. on Speech and Audio Processing. Volume: 12 Issue: 5, Sep 2004. pp. 489-498.
(IEEE SPS Young Author Best Paper Award 2006)
- B. Raj, M. L. Seltzer, and R. Stern.
Reconstruction of Missing Features for Robust Speech Recognition,
in Speech Communication, Elsevier. Volume: 43 Issue: 4, Sep 2004. pp. 275-296.
- M. L. Seltzer, B. Raj, and R. Stern.
A Bayesian Classifier for Spectrographic Mask Estimation for Missing Feature Speech Recognition,
in Speech Communication, Elsevier. Volume: 43 Issue: 4, Sep 2004. pp. 379-393.
- M. L. Seltzer and B. Raj.
Speech recognizer-based filter optimization for microphone array processing,
IEEE Signal Processing Letters, Volume: 10, Issue: 3, March 2003, pp. 69-71.
Conference Publications
- M. L. Seltzer, I. Tashev, and A. Acero,
Microphone Array Post-filter Using Incremental Bayes Learning to Track the Spatial Distributions of Speech and Noise,
in Proc. of ICASSP 2007, Honolulu, HI.
- A. Subramanya, M. L. Seltzer, and A. Acero.
Automatic Removal of Typed Keystrokes from Speech Signals,
in Proc. of the Interspeech Conference. Pittsburgh, Sep, 2006.
- Z. Liu, M. L. Seltzer, A. Acero, I. Tashev, Z. Zhang, and M. Sinclair.
A Compact Multi-Sensor Headset for Hands-Free Communication,
in Proc. of the Workshop on Applications of Signal Processing to Audio and Acoustics. New Paltz, NY, USA, Oct. 2005.
- I. Tashev, M. L. Seltzer, and A. Acero.
Microphone Array for Headset with Spatial Noise Suppressor,
in Proc. of the Ninth Int. Workshop on Acoustic, Echo and Noise Control (IWAENC). Eindhoven, The Netherlands, Sep. 2005.
- M. L. Seltzer, A. Acero, and J. Droppo.
Robust Bandwidth Extension of Noise-corrupted Narrowband Speech,
in Proc. of the Interspeech Conference. Lisbon, Portugal, Sep, 2005.
- M. L. Seltzer and A. Acero.
An EM Algorithm for Training Wideband Acoustic Models from Mixed-Bandwidth Training Data,
in Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding. Puerto Rico, Dec, 2005.
- M. L. Seltzer and A. Acero,
Training Wideband Acoustic Models Using Mixed-Bandwidth Training Data via Feature Bandwidth Extension,
in Proc. of ICASSP 2005, Philadelphia, PA.
- M. L. Seltzer and R. Stern,
Parameter Sharing in Subband Likelihood-Maximizing Beamforming for Speech Recognition Using Microphone Arrays,
in Proc. of ICASSP 2004, Montreal, Canada.
- M. L. Seltzer, J. Droppo, and A. Acero,
A Harmonic-Model-Based Front End for Robust Speech Recognition,
in Proc. of the Eurospeech Conference. Geneva, Switzerland, Sep, 2003.
-
M. L. Seltzer and R. M. Stern,
Subband parameter optimization of microphone arrays for speech recognition in reverberant environments,
in Proc. of ICASSP 2003, Hong Kong.
-
M. L. Seltzer, B. Raj, and R. M. Stern,
Speech recognizer-based microphone array processing for robust hands-free speech recognition,
in Proc. of ICASSP 2002, Orlando, FL.
-
M. L. Seltzer and B. Raj,
Calibration of microphone arrays for improved speech recognition,
in Proc. of Eurospeech 2001, Aalborg, Denmark.
- B. Raj, M. L. Seltzer, and R. M. Stern,
Robust speech recognition using missing features,
in Proc. of the Workshop on Consistent and Reliable Cues (CRAC) for Sound Analysis 2001, Aalborg, Denmark.
- R. Singh, M. L. Seltzer, B. Raj, and R. M. Stern,
Speech in noisy environments: robust automatic segmentation, feature extraction, and hypothesis combination,
in Proc. of ICASSP 2001, Salt Lake City, UT.
- M. L. Seltzer, B. Raj, and R. M. Stern,
Classifier-based mask estimation for missing feature methods of robust speech recognition,
in Proc. ICSLP 2000, Beijing, China.
- B. Raj, M. L. Seltzer, and R. M. Stern,
Reconstruction of damaged spectrographic features for robust speech recognition,
in Proc. of ICSLP 2000, Beijing, China.
Theses
- M. L. Seltzer,
Microphone Array Processing for Robust Speech Recognition,
Ph.D. Thesis, Department of Electrical and Computer Engineering, Carnegie Mellon University, July, 2003.
- M. L. Seltzer,
Automatic Detection of Corrupted Speech Features for Robust Speech Recognition,
Master's Thesis, Department of Electrical and Computer Engineering, Carnegie Mellon University, May, 2000.
E-mail: mseltzer --at-- microsoft --dot-- com
Postal Mail: Microsoft Corporation, One Microsoft Way, Redmond WA,
98052-6399, USA
Tel: (425) 706-3763
Fax: (425) 706-7329 (This is the main MS FAX number so make sure to send
documents to my attention) |
Last updated: April 20, 2005