Jun Du and Qiang Huo
This paper presents a new feature compensation approach to noisy speech recognition by using high-order vector Taylor series (HOVTS) approximation of an explicit model of environmental distortions. Formulations for maximum likelihood (ML) estimation of noise model parameters and minimum meansquared error (MMSE) estimation of clean speech are derived. Experimental results on Aurora2 database demonstrate that the proposed approach achieves consistently significant improvement in recognition accuracy compared to traditional first-order VTS based feature compensation approach.
|Published in||Proc. of INTERSPEECH 2008|
|Publisher||International Speech Communication Association|
© 2007 ISCA. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the ISCA and/or the author.