|
|
Geoffrey
Zweig
Senior Researcher
Speech Research Group
Research Interests
- Voice Search: Getting information on demand over the cellphone
- Speech Analytics: Understanding the gist of conversations and statements
- Speech Recognition Infrastructure: Trainers and Decoders
- Machine Learning Methods for Automatic Speech Recognition: Boosting, Bayesian Networks
- Dialog Systems: Talking to computers, gracefully coping with errors
Background
I am a speech recognition researcher at
Microsoft Research, which I joined in 2006.
My research interests include applied and scientific areas. On the applied
side, I am interested Voice Search and speech interfaces for mobile devices,
especially for accessing business and product information. Live Search for Windows Mobile
and Voice-Rate are two applications in this area. On the scientific side, I am interested
in improved algorithms for acoustic modeling and decoding, phonetic decoding, and multi-lingual
robustness.
Prior to joining Microsoft, I worked at
IBM Research for eight years, again
focusing on speech research, and most recently working on English, Arabic and
Mandarin speech recognition systems for the DARPA EARS (Effective Affordable
Reusable Speech-to-Text) and GALE (Global Autonomous Language Exploitation)
programs.
I received my PhD in 1998 from the Computer Science Department of the
University of California at Berkeley
where I was advised by Stuart Russell and Nelson Morgan.
Publications
- G. Zweig, D. Bohus, X. Li and P. Nguyen.
Structured Models for Joint Decoding of Repeated Utterances.
In Proceedings of Interspeech. 2008.
- D. Bohus, X. Li, P. Nguyen and G. Zweig.
Learning N-Best Correction Models from Implicit User Feedback in a Multi-Modal Local Search Application.
In Proceedings of SIGdial. 2008.
- Z. Li, P. Nguyen and G. Zweig.
Optimal Dialog in Consumer-Rating Systems using a POMDP Framework.
In Proceedings of SIGdial. 2008.
- G. Zweig and J. Nedel.
Empirical Properties of Multilingual Phone-to-Word Transduction.
In Proceedings of ICASSP. 2008.
- C. White, G. Zweig, L. Burget, P. Schwarz, H. Hermansky.
Confidence Estimation, OOV Detection and Language ID Using Phone-to-Word Transduction and Phone-Level Alignments.
In Proceedings of ICASSP. 2008.
- G. Choueiter, G. Zweig and P. Nguyen.
An Empirical Study of Automatic Accent Classification.
In Proceedings of ICASSP. 2008.
- A. Acero, N. Bernstein, R. Chambers, Y. C. Ju, X. Li, J. Odell, P. Nguyen, O. Scholz and G. Zweig.
Live Search for Mobile: Web Services by Voice on the Cellphone.
In Proceedings of ICASSP. 2008.
- X. Li, Y. C. Ju, G. Zweig and A. Acero.
Language Modeling for Voice Search: A Machine Translation Approach.
In Proceedings of ICASSP. 2008.
- G. Zweig, P. Nguyen, Y. C. Ju, Ye-Yi Wang, D. Yu and A. Acero.
The Voice-Rate Dialog System for Consumer Ratings.
In Proceedings of Interspeech. 2007.
- D. Yu, Y. C. Ju, Ye-Yi Wang, G. Zweig and A. Acero.
Automated Directory Assistance System - From Theory to Practice.
In Proceedings of Interspeech. 2007.
- H. Kuo, G. Zweig, and B. Kingsbury.
Discriminative Training of Decoding Graphs for Large Vocabulary Continuous Speech Recognition.
In Proceedings of ICASSP. 2007.
- H. Soltau, G. Saon, D. Povey, L. Mangu, J. Kuo, M. Omar and G. Zweig.
The IBM 2006 GALE Arabic ASR System.
In Proceedings of ICASSP. 2007.
- G. Zweig, O. Siohan, B. Ramabhadran, D. Povey, L. Mangu and B. Kingsbury.
Automated Quality Monitoring in the Call Center with ASR and Maximum Entropy.
In Proceedings of ICASSP . 2006.
- G. Choueiter, D. Povey, S. Chen, and G. Zweig.
Morpheme Based Language Modeling for Arabic LVCSR.
In Proceedings of ICASSP . 2006.
- S. Chen, B. Kingsbury, L. Mangu, D. Povey, G. Saon, H. Soltau and G. Zweig.
Advances in Speech Transcription at IBM under the DARPA EARS Program.
In IEEE Transactions on Audio, Speech and Language Processing. Vol. 14, No. 5. 2006
- B. Ramabhadran, O. Siohan, L. Mangu, G. Zweig, M. Westphal, H. Schulz and A. Soneiro.
The IBM 2006 Speech Transcription System for European Parliamentary Speeches.
In Proceedings of ICSLP 2006. 2006.
- Y. Qin, Q. Shi, Y.Y. Liu, H. Aronowitz, S. Chu, H. Kuo and G. Zweig.
Advances in Mandarin Broadcast Speech Transcription at IBM under the GALE Program.
In Lecture Notes in Computer Science . Springer Verlag v. 4274, pp. 410-421. 2006.
- H. Soltau, B. Kingsbury, L. Mangu, D. Povey, G. Saon and G. Zweig.
The IBM 2004 Conversational Telephony System for Rich Transcription .
In Proceedings of ICASSP 2005.
- D. Povey, B. Kingsbury, L. Mangu, G. Saon, H. Soltau and G. Zweig.
FMPE: Discriminatively Trained Features for Speech Recognition.
In Proceedings of ICASSP 2005.
- G. Saon, G. Zweig and D. Povey.
Anatomy of an Extremely Fast LVCSR Decoder.
In Proceedings of Interspeech. 2005.
- O. Siohan, B. Ramabhadran, G. Zweig.
Speech Recognition Error Analysis on the English MALACH Corpus.
In Proceedings of ICSLP. 2004.
- B. Ramabhadran, G. Zweig, O. Siohan.
Use of metadata to improve recognition of spontaneous speech and named entities.
In Proceedings of ICSLP. 2004.
- F. Yvon, G. Zweig and G. Saon.
Arc Minimization in Finite State Decoding Graphs with Cross-Word Decoding Context.
In Computer Speech and Language. Vol. 18, 2004.
- G. Zweig and M. Picheny.
Advances in Large Vocabulary Speech Recognition.
In Advances in Computers, Elsevier Science. 2004.
- G. Zweig.
Bayesian Network Structures and Inference Techniques for Automatic Speech Recognition.
In Computer Speech and Language 2003.
- G. Saon, G. Zweig, B. Kingsbury, L. Mangu and U. Chaudhari.
An Architecture for Rapid Decoding of Large Vocabulary Conversational Speech.
In Proceedings of Eurospeech. 2003.
- EE Jan, B. Maison, L. Mangu and G. Zweig.
Automatic construction of Unique Signatures and Confusable sets
for Natural Language Directory Assistance Application.
In Proceedings of Eurospeech. 2003.
- B. Kingsbury, L. Mangu, G. Saon, G. Zweig,
S. Axelrod, V. Goel, K. Visweswariah, and M. Picheny.
Toward Domain-Independent Conversational Speech Recognition.
In Proceedings of Eurospeech 2003.
- G. Zweig, G. Saon, and F. Yvon.
Arc Minimization in Finite State Decoding Graphs with
Cross-Word Decoding Context.
In Proceedings of ICSLP 2002.
- J. Huang, G. Zweig, and M. Padmanabhan.
Information Extraction from Voicemail.
In Proceedings of ACL 2002.
- J. Huang and G. Zweig.
Maximum Entropy Modeling for Punctuation from Speech.
In Proceedings of ICSLP 2002.
- G. Zweig, J. Bilmes, et al.
Structurally Discriminative
Graphical Models for Automatic Speech Recognition: Results from
the 2001 Johns Hopkins Summer Workshop.
In Proceedings of ICASSP 2002.
- J. Bilmes and G. Zweig.
The Graphical Models Toolkit:
An Open Source Software System for Speech and Time Series Processing.
In Proceedings of ICASSP 2002.
- G. Zweig.
Bayesian Network Structures and Inference Techniques for Automatic Speech Recognition.
Unpublished Tutorial, 2001.
- G. Zweig.
Hidden Variable Structures for Training and Decoding with Graphical Models in ASR.
Johns Hopkins Workshop Presentation, 2001.
- G. Saon, G. Zweig, and M Padmanabhan.
Linear Feature Space Transformations for Speaker Adaptation.
In Proceedings of ICASSP 2001.
- G. Zweig, J. Huang, and M. Padmanabhan.
Extracting Caller Information from Voicemail.
In Proceedings of Eurospeech 2001.
- M. Padmanabhan, G. Saon, G. Zweig, J. Huang, B. Kingsbury, L. Mangu.
Evolution of the Performance of Automatic Speech
Recognition Algorithms in Transcribing Conversational Telephone Speech.
In Proceedings of IMTC 2001.
- G. Zweig and M. Padmanabhan.
Exact Alpha-Beta Computation in Logarithmic Space with
Application to MAP Word Graph Construction.
In Proceedings of ICSLP 2000.
- G. Zweig and M. Padmanabhan.
Boosting Gaussian Mixtures in an LVCSR System.
In Proceedings of ICASSP 2000.
- M. Padmanabhan, G. Saon, and G. Zweig.
Lattice-Based Unsupervised MLLR for Speaker Adaptation.
In Proceedings of ASR 2000.
- J. Huang, B. Kingsbury, L. Mangu, M. Padmanabhan,
G. Saon, and G. Zweig.
Recent Improvements in Speech
Recognition Performance on Large Vocabulary Conversational
Speech (Voicemail and Switchboard).
In Proceedings of ICSLP 2000.
- Geoffrey Zweig and Mukund Padmanabhan.
Dependency Modeling with Bayesian Networks in a
Voicemail Transcription System.
In Proceedings of Eurospeech 1999.
- M. Padmanabhan, G. Saon, S. Basu, J. Huang, G. Zweig.
Recent improvements on a VoiceMail transcription Task.
In Proceedings of Eurospeech 1999.
- G. Zweig and S. Russell.
Probabilistic Modeling with Bayesian Networks for Automatic Speech Recognition.
In Australian Journal of Intelligent Information Processing. 1999.
- G. Zweig.
Speech Recognition with Dynamic Bayesian Networks.
PhD Thesis University of California at Berkeley. 1998.
- G. Zweig, S. Russell.
Speech Recognition with Dynamic Bayesian Networks.
In Proceedings of AAAI 1998.
- G. Zweig and S. Russell.
Probabilistic Modeling with Bayesian Networks for ASR.
In Proceddings of ICSLP 1998.
- A. Broder, S. Glassman, M. Manasse and G. Zweig.
Syntactic Clustering of the Web.
In Proceedings of WWW6 and Computer Networks 29 (8-13) and
Digital/HP Technical Report SRC-TN-1997-015
1997. Best Paper Award at WWW6.
- R. Karp, O. Waarts, G. Zweig.
The Bit Vector Intersection Problem.
In Foundations of Computer Science 1995.
- F. Alizadeh, R. Karp, D. Weisser, G. Zweig.
Physical Mapping of Chromosomes Using Unique Probes.
In Symposium on Discrete Algorithms 1994.
- G. Zweig.
An Effective Tour Construction and Improvement Procedure for the
Traveling Salesman Problem. Operations Research , v. 43, n. 6. 1995.
Patents
I am an inventor of several patents, including:
- U.S. Patent #6,842,796: Information Extraction from Documents with Regular Expression Matching (2005),
with M. Padmanabhan.
- U.S. Patent #6,611,678: Device and Method for Trainable Radio Scanning (2003), with C. Neti.
- U.S. Patent #6,411,933: Methods and Apparatus for Correlating Biometric Attributes and Biometric
Attribute Production Features (2002), with S. Maes.
- U.S. Patent #6,119,124: Method for Clustering Closely Resembling Data Objects (2000),
with A. Broder, S. Glassman, M. Manasse, and G. Nelson.
Last updated: July 3, 2008
E-mail: Geoffrey Zweig
U.S.Mail: Microsoft Corporation, One Microsoft Way, Redmond WA, 98052-6399, USA
Tel: (425) 421-6668
Fax: (425) 936-7329 (This is the main MS FAX number so make sure to send documents to my attention) |