Alex Acero and Richard Stern
In this paper we present several algorithms that increase the robustness of
SPHINXth, e CMU continuous-spccch speaker-independent recognition
system, by normalizing the acoustic spacc via minimization of the overall
VQ distortion. We propose an affme transformation of the cepstrum in
which a matrix multiplication performs frequency normalization and a
vector addition attempts environment normalization. The algorithms for
environment normalization are very efficient and they improve dramatically
the recognition accuracy when the system is tested on a microphone
othcr from the one on which it was trained. The frequency normalization
algorithm applies a different warping of the kquency axis to different
speakers and it achieves a 10% decrease in error rate.
|Published in||Proc. of the International Conference on Acoustics, Speech and Signal Processing|
|Publisher||Institute of Electrical and Electronics Engineers, Inc.|
© 2007 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.