Alex Acero and Richard Stern
In this paper we present several algorithms that increase the robustness of SPHINXth, e CMU continuous-spccch speaker-independent recognition system, by normalizing the acoustic spacc via minimization of the overall VQ distortion. We propose an affme transformation of the cepstrum in which a matrix multiplication performs frequency normalization and a vector addition attempts environment normalization. The algorithms for environment normalization are very efficient and they improve dramatically the recognition accuracy when the system is tested on a microphone othcr from the one on which it was trained. The frequency normalization algorithm applies a different warping of the kquency axis to different speakers and it achieves a 10% decrease in error rate.
|Published in||Proc. of the International Conference on Acoustics, Speech and Signal Processing|
|Publisher||Institute of Electrical and Electronics Engineers, Inc.|
© 2007 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.