Share on Facebook Tweet on Twitter Share on LinkedIn Share by email
Conversational Speech Transcription Using Context-Dependent Deep Neural Networks

Dong Yu, Frank Seide, and Gang Li

Abstract

Context-Dependent Deep-Neural-Network HMMs, or CD-DNN-HMMs, combine the classic arti?cial-neural-network HMMs with traditional context-dependent acoustic mod- eling and deep-belief-network pre-training. CD-DNN-HMMs greatly outperform conven- tional CD-GMM (Gaussian mixture model) HMMs: The word error rate is reduced by up to one third on the di?cult benchmarking task of speaker-independent single-pass transcription of telephone conversations.

Details

Publication typeInproceedings
Published inICML 2012
> Publications > Conversational Speech Transcription Using Context-Dependent Deep Neural Networks