|
|
Milind Mahajan
Senior Researcher
Speech Research Group
Milind Mahajan is a senior researcher in the Speech Research group in
Microsoft Research at Redmond.
- Enhanced audio/video experience
- Multi-modal user interfaces
- Speech Recognition: discriminative acoustic modeling, language modeling.
- Spoken Language Systems: natural language and speech understanding systems.
- Machine Learning: conditional random fields, maximum entropy, perceptron, HMM.
- P. Nguyen, M. Mahajan, and X. He
Training Non-Parametric Features for Statistical Machine Translation.
In ACL 2007 Second Workshop on Statistical Machine Translation. Prague, Czech Republic, June, 2007.
- C. Ma, P. Nguyen, and M. Mahajan
Finding Speaker Identities with a Conditional Maximum Entropy Model.
In Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Honolulu, Hawaii, April, 2007.
- P. Nguyen and M. Mahajan
Audio/Video Navigation with A/V X-Ray.
demonstration in IEEE/ACL 2006 Workshop on Spoken Language Technology. Palm Beach, Aruba, December, 2006.
- Y. Wang, A. Acero, M. Mahajan and J. Lee
Combining Statistical and Knowledge-based Spoken Language Understanding in Conditional Models.
In Proceedings of COLING/ACL. July 2006.
- M. Mahajan, A. Gunawardana, and A. Acero.
Training Algorithms for Hidden Conditional Random Fields.
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Tolouse, France, May, 2006.
- P. Hsu, M. Mahajan and A. Acero.
Multimodal Text Entry on Mobile Devices
, Demo shown at Automatic Speech Recognition & Understanding (ASRU) Workshop.
San Juan, Puero Rico, Dec, 2005.
- J. Droppo, M. Mahajan, A. Gunawardana, and A. Acero.
How to Train a Discriminative Front End with Stochastic Gradient Descent and Maximum Mutual Information,
in Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding. Puerto Rico, Dec, 2005.
- A. Gunawardana, M. Mahajan, A. Acero, and J. Platt.
Hidden Conditional Random Fields for Phone Classification,
in Proc. of the Interspeech Conference. Lisbon, Portugal, Sep, 2005.
- M. Mahajan, A. Gunawardana and A. Acero.
Phone Classification using Hidden Conditional Random Fields, 
Poster, in Snowbird Machine Learning Workshop, May 2005.
- D. Yu, M. Mahajan, P. Mau, and A. Acero.
Maximum Entropy Based Generic Filter for Language Model Adaptation
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Philadelphia, Mar, 2005.
- L. Deng, Y. Wang, K. Wang, A. Acero, H. Hon, J. Droppo, C. Boulis, D. Jacoby, M. Mahajan , C. Chelba, and X. Huang.
Speech and language processing for multimodal
human-computer interaction (invited),
in Journal of VLSI Signal Processing Systems (Special issue on Real-World Speech Processing),
Vol. 36, No. 2, February 2004, pp. 161-187.
- Y. Deng, M. Mahajan and A. Acero.
Estimating Speech Recognition Error Rate without Acoustic Test Data,
in Proc. of the Eurospeech Conference. Geneva, Switzerland, Sep, 2003.
- D. Yu, K. Wang, M. Mahajan , P. Mau and A. Acero.
Improved Name Recognition With User Modeling,
in Proc. of the Eurospeech Conference. Geneva, Switzerland, Sep, 2003.
- C. Chelba, M. Mahajan and A. Acero.
Speech Utterance Classification,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Hong Kong, Apr, 2003.
- C. Chelba. and M. Mahajan .
Information Extraction Using the Structured Language Model,
in Proc. of the Int. Conf. on Empirical Methods in Natural Language Processing. Pittsburgh, PA, June 2001.
- L. Deng, K. Wang, A. Acero, H. Hon, J. Droppo, C. Boulis, Y. Wang, D. Jacoby, M. Mahajan , C. Chelba, and X.D.Huang.
Distributed Speech Processing in MiPad's Multimodal User Interface,
in IEEE Transactions on Speech and Audio Processing. Volume: 10 Issue: 8 , Nov 2002, pp. 605-619.
- X. Huang, A. Acero, C. Chelba, L. Deng, J. Droppo, D. Duchene, J. Goodman,
H. Hon, D. Jacoby, L. Jiang, R. Loynd, M. Mahajan , P. Mau, S. Meredith, S.
Mughal, S. Neto, M. Plumpe, K. Stery,. G. Venolia, K. Wang, Y. Wang.
MIPAD: A Multimodal Interaction Prototype,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Salt Lake City, Utah, May, 2001.
- X. Huang, A. Acero, C. Chelba, L. Deng, D. Duchene, J. Goodman, H. Hon, D.
Jacoby, L. Jiang, R. Loynd, M. Mahajan , P. Mau, S. Meredith, S. Mughal, S. Neto,
M. Plumpe, K. Wang, Y. Wang.
MIPAD: A Next Generation PDA Prototype,
in Proc. of the Int. Conf. on Spoken Language Processing. Beijing, China, Oct, 2000.
- Y. Wang, M. Mahajan and X. Huang.
A Unified Context-Free Grammar And N-Gram Model for Spoken Language Processing,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Istanbul, Turkey, June, 2000.
- M. Mahajan , D. Beeferman and X. Huang.
Improved Topic-Dependent Language Modeling Using Information Retrieval Techniques,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Phoenix, Mar., 1999.
- X. Huang, A. Acero, F. Alleva, M. Hwang, L. Jiang and M. Mahajan.
"From Sphinx-II to Whisper: Making Speech Recognition Usable" in C. Lee,
F. Soong and K. Paliwal eds.
Automatic Speech and Speaker Recognition, Advanced Topics,
Kluwer Academic Publishers, Norwell, MA, 1996.
- X. Huang, A. Acero, F. Alleva, M. Y. Hwang, L. Jiang and M. Mahajan .
"Microsoft Windows Highly Intelligent Speech Recognizer: Whisper"
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Detroit, MI. May 1995.
Co-inventor on 12 issued patents and 10 pending patent applications.
E-mail: milind dot mahajan at microsoft dot com
U.S.Mail: Microsoft Corporation, Microsoft Research, One Microsoft Way, Redmond WA,
98052-6399, USA
Last updated: August 6, 2007
|