Minimum Word Classification Error Training of HMMs for Automatic Speech Recognition

Zhijie Yan; Bo Zhu; Yu Hu; Ren-Hua Wang

Minimum Word Classification Error Training of HMMs for Automatic Speech Recognition

Zhijie Yan ,
Bo Zhu ,
Yu Hu ,
Ren-Hua Wang

IEEE International Conference on Acoustics, Speech and Signal Processing, 2008, ICASSP 2008 | May 2008

Published by IEEE

Publication

Download BibTex

This paper presents a novel discriminative training criterion, minimum word classification error (MWCE). By localizing conventional string-level MCE loss function to word-level, a more direct measure of empirical word classification error is approximated and minimized. Because the word-level criterion better matches performance evaluation criteria such as WER, an improved word recognition performance can be achieved. We evaluated and compared MWCE criterion in a unified DT framework, with other commonly-used criteria including MCE, MMI, MWE, and MPE. Experiments on TIMIT and WS JO evaluation tasks suggest that word-level MWCE criterion can achieve consistently better results than string-level MCE. MWCE even outperforms other substring-level criteria on the above two tasks, including MWE and MPE.

© 2008 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.http://www.ieee.org/