Research interests
- Speech recognition in adverse environments
- Acoustic modeling
- Microphone array processing
- Speech enhancement
- Machine learning for speech and audio applications
Background
I have been a Researcher in the Speech Technology Group at Microsoft Research since October, 2003. I did my graduate work in the department of Electrical and Computer Engineering at Carnegie Mellon University, receiving my M.S. in 2000 and my Ph.D. in 2003. While at CMU, I was a member of the Robust Speech Recognition group, led by my advisor Professor Richard Stern. My dissertation research focused on improving recognition accuracy in hands-free environments using microphone arrays. From 1996-1998, I worked at Teradyne, Inc. as an Applications Engineer. Teradyne makes automatic test equipment (ATE) for the semiconductor industry. I received my B.S. at Brown University in 1996. At Brown, I worked in the Laboratory for Engineering Man/Machine Systems (LEMS) with Professor Harvey Silverman on a huge microphone array project that was called, in fact, the Huge Microphone Array.
- Li Deng, Jinyu Li, Jui-Ting Huang, Kaisheng Yao, Dong Yu, Frank Seide, Michael Seltzer, Geoff Zweig, Xiaodong He, Jason Williams, Yifan Gong, and Alex Acero, Recent Advances in Deep Learning for Speech Research at Microsoft, IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2013
- Dong Yu, Mike Seltzer, Jinyu Li, Jui-Ting Huang, and Frank Seide, Feature Learning in Deep Neural Networks - Studies on Speech Recognition, in International Conference on Learning Representations, May 2013
- Jinyu Li, Michael Seltzer, and Yifan Gong, Improvements to VTS Feature Enhancement, IEEE International Confrence on Acoustics, Speech, and Signal Processing (ICASSP), March 2012
- Jinyu Li, Michael L. Seltzer, and Yifan Gong, Efficient VTS Adaptation Using Jacobian Approximation, in Interspeech, 2012
- Jinyu Li, Michael L. Seltzer, and Yifan Gong, Improvements to VTS feature enhancement, in Proc. ICASSP, 2012
- Dong Yu and Mike Seltzer, Improved Bottleneck Features Using Pretrained Deep Neural Networks, in Interspeech, International Speech Communication Association, August 2011
- Mike Seltzer and Alex Acero, Separating Speaker and Environmental Variability Using Factored Transforms, in Interspeech, International Speech Communication Association, August 2011
- Michael L. Seltzer, Yun-Cheng Ju, Ivan Tashev, Ye-Yi Wang, and Dong Yu, In Car Media Search, in IEEE Signal Processing Magazine, IEEE SPS, June 2011
- Flávio Ribeiro, Dinei Florencio, Cha Zhang, and Michael Seltzer, CROWDMOS: An Approach for Crowdsourcing Mean Opinion Score Studies, in ICASSP, IEEE, May 2011
- Xing Fan, Michael Seltzer, Jasha Droppo, Henrique Malvar, and Alex Acero, Joint Encoding of the Waveform and Speech Recognition Features Using a Transform Codec, in International Conference on Acoustics, Speech and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., May 2011
- Mike Seltzer and Alex Acero, HMM Adaptation Using Linear Spline Interpolation with Integrated Spline Parameter Training for Robust Speech Recognition, in Interspeech, International Speech Communication Association, September 2010
- Li Deng, Mike Seltzer, Dong Yu, Alex Acero, Abdel-rahman Mohamed, and Geoff Hinton, Binary Coding of Speech Spectrograms Using a Deep Auto-encoder, in Interspeech 2010, International Speech Communication Association, September 2010
- Mike Seltzer, Alex Acero, and Kaustubh Kalgaonkar, Acoustic Model Adaptation via Linear Spline Interpolation for Robust Speech Recognition, in ICASSP, IEEE, March 2010
- Ivan Tashev, Michael L. Seltzer, and Yun-Cheng Ju, Speech and sound for in-car infotainment systems, in Automotive User Interfaces and Interactive Vehicular Applications (AutomotiveUI 2009), Association for Computing Machinery, Inc., Essen, Germany, 22 September 2009
- Ivan Tashev, Michael Seltzer, Yun-Cheng Ju, Ye-Yi Wang, and Alex Acero, Commute UX: Voice Enabled In-car Infotainment System, in Mobile HCI '09: Workshop on Speech in Mobile and Pervasive Environments (SiMPE), Association for Computing Machinery, Inc., Bonn, Germany, 15 September 2009
- Yun-Cheng Ju, Michael Seltzer, and Ivan Tashev, Improving Perceived Accuracy for In-Car Media Search, International Speech Communication Association, September 2009
- Young-In Song, Ye-Yi Wang, Yun-Cheng Ju, Mike Seltzer, Ivan Tashev, and Alex Acero, Voice Search of Structured Media Data, in International Conference on Acoustics, Speech and Signal Processing, Institute of Electrical and Electornic Engineers, Inc., Taipei, Taiwan, April 2009
- Michael L. Seltzer and Lei Zhang, The data deluge: challenges and opportunities of unlimited data in statistical signal processing, in Proceedings of International Conference on Acoustics, Speech, and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., Taipei, Taiwan, April 2009
- Ozlem Kalinli, Michael L. Seltzer, and Alex Acero, Noise Adaptive Training Using a Vector Taylor Series Approach for Robust Automatic Speech Recognition, in Proceedings of International Conference on Acoustics, Speech, and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., Taipei, Taiwan, April 2009
- Michael Seltzer and Ivan Tashev, A Log-MMSE Adaptive Filter Using a non-Linear Spatial Filter, in Proceedings of International Workshop on Acoustic, Echo and Noise Control IWAENC 2008, Seattle, USA, September 2008
- Jasha Droppo, Michael L. Seltzer, Alex Acero, and Y.-H. Chiu, Towards a non-parametric acoustic model: an acoustic decision tree for observation probability calculation, in Proceedings of Interspeech, International Speech Communication Association, Brisbane, Australia, September 2008
- Ivan Tashev and Michael Seltzer, Data Driven Beamformer Design for Binaural Headset, in Proceedings of International Workshop on Acoustic, Echo and Noise Control IWAENC 2008, Seattle, USA, September 2008
- Dong Yu, Li Deng, Jasha Droppo, Jian Wu, Yifan Gong, and Alex Acero, Robust speech recognition using cepstral minimum-mean-square-error noise suppressor, in IEEE Trans. Audio, Speech, and Language Processing, vol. 16, no. 5, Institute of Electrical and Electronics Engineers, Inc., July 2008
- Michael L. Seltzer, Bridging the gap: towards a unified framework for hands-free speech recognition using microphone arrays, in Proceedings of the Workshop on Hands-Free Speech Communication and Microphone Arrays, Institute of Electrical and Electronics Engineers, Inc., Trento, Italy, May 2008
- Ivan Tashev, Jasha Droppo, Michael Seltzer, and Alex Acero, Robust Design of Wideband Loudspeaker Arrays, in Proc. of International Conference on Audio, Speech and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., Las Vegas, USA, April 2008
Contact Info
E-mail: mseltzer --at-- microsoft --dot-- com
Postal Mail: Microsoft Corporation, One Microsoft Way, Redmond WA, 98052-6399, USA
Tel: (425) 706-3763
Fax: (425) 706-7329 (This is the main MS FAX number so make sure to send documents to my attention)
