Toward Domain-Independent Conversational Speech Recognition

Brian Kingsbury; Lidia Mangu; George Saon; Geoffrey Zweig; Scott Axelrod; Vaibhava Goel; Karthik Visweswariah; Michael Picheny

Toward Domain-Independent Conversational Speech Recognition

Brian Kingsbury ,
Lidia Mangu ,
George Saon ,
Geoffrey Zweig ,
Scott Axelrod ,
Vaibhava Goel ,
Karthik Visweswariah ,
Michael Picheny

Proceedings of Eurospeech | January 2003

Download BibTex

We describe a multi-domain, conversational test set developed for IBM’s Superhuman speech recognition project and our 2002 benchmark system for this task. Through the use of multipass decoding, unsupervised adaptation and combination of hypotheses from systems using diverse feature sets and acoustic models, we achieve a word error rate of 32.0% on data drawn from voicemail messages, two-person conversations and multiple-person meetings.