Conversational Speech Transcription Using Context-Dependent Deep Neural Networks

Context-Dependent Deep-Neural-Network HMMs, or CD-DNN-HMMs, combine the classic arti?cial-neural-network HMMs with traditional context-dependent acoustic mod- eling and deep-belief-network pre-training. CD-DNN-HMMs greatly outperform conven- tional CD-GMM (Gaussian mixture model) HMMs: The word error rate is reduced by up to one third on the di?cult benchmarking task of speaker-independent single-pass transcription of telephone conversations.

CD-DNN-HMM-ICML2012-Invited.pdf
PDF file

In  ICML 2012

Details

TypeInproceedings
> Publications > Conversational Speech Transcription Using Context-Dependent Deep Neural Networks