Robust Speech Recognition by Normalization of the Acoustic Space

In this paper we present several algorithms that increase the robustness of

SPHINXth, e CMU continuous-spccch speaker-independent recognition

system, by normalizing the acoustic spacc via minimization of the overall

VQ distortion. We propose an affme transformation of the cepstrum in

which a matrix multiplication performs frequency normalization and a

vector addition attempts environment normalization. The algorithms for

environment normalization are very efficient and they improve dramatically

the recognition accuracy when the system is tested on a microphone

othcr from the one on which it was trained. The frequency normalization

algorithm applies a different warping of the kquency axis to different

speakers and it achieves a 10% decrease in error rate.

PDF file

In  Proc. of the International Conference on Acoustics, Speech and Signal Processing

Publisher  Institute of Electrical and Electronics Engineers, Inc.
© 2007 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.


> Publications > Robust Speech Recognition by Normalization of the Acoustic Space