Geoffrey Zweig

Geoffrey Zweig

Research Interests

  • Voice Search: Getting information over the cellphone
  • Machine Learning Methods for Automatic Speech Recognition
  • SCARF: Speech Recognition with Segmental Conditional Random Fields
  • Neural Network based Semantic Models


I am a Principal Researcher in the speech group at Microsoft Research, which I joined in 2006. My research interests include applied and scientific areas. On the applied side, I am interested Voice Search and speech interfaces for mobile devices, especially for accessing business and product information. Bing Mobile data provides an ideal avenue for this work. On the scientific side, I am interested in improved algorithms for acoustic modeling and decoding, phonetic decoding, and multi-lingual robustness. An example of this work is the SCARF toolkit I developed with Patrick Nguyen for doing speech recognition with Segmental Conditional Random Fields. Check out the SCARF project page. Recently I have begun to explore neural network models that encode semantic information, and some related publications are here. Prior to joining Microsoft, I worked at IBM Research for eight years, focusing on speech research, and most recently working on English, Arabic and Mandarin speech recognition systems for the DARPA EARS (Effective Affordable Reusable Speech-to-Text) and GALE (Global Autonomous Language Exploitation) programs.

I received my PhD in 1998 from the Computer Science Department of the University of California at Berkeley where I was advised by Stuart Russell and Nelson Morgan. I have served as Associate Editor of the IEEE Transactions on Audio Speech and Language and am currently on the editorial board of Computer Speech and Language. I am a member of the ACM and an IEEE Fellow.



I am an inventor of several patents, including:

  • U.S. Patent #8,145,484: Speech Processing with Predictive Language Modeling (2012)
  • U.S. Patent #7949526: Voice Aware Demographic Personalization (2011), with Y.C. Ju, A. Acero, and N. Bernstein.
  •  U.S. Patent #6,842,796: Information Extraction from Documents with Regular Expression Matching (2005), with M. Padmanabhan.
  • U.S. Patent #6,611,678: Device and Method for Trainable Radio Scanning (2003), with C. Neti.
  • U.S. Patent #6,411,933: Methods and Apparatus for Correlating Biometric Attributes and Biometric Attribute Production Features (2002), with S. Maes.
  • U.S. Patent #6,119,124: Method for Clustering Closely Resembling Data Objects (2000), with A. Broder, S. Glassman, M. Manasse, and G. Nelson.


E-mail: Geoffrey Zweig
U.S.Mail: Microsoft Corporation, One Microsoft Way, Redmond WA, 98052-6399, USA
Tel: (425) 421-6668
Fax: (425) 936-7329 (This is the main MS FAX number so make sure to send documents to my attention)