Environmental Robustness in Automatic Speech Recognition

In this paper we report our initial efforts to make SPHINXth, e CMU

continuous-speech speaker-independent recognition system, robust to

changes in the environment. To deal with differences in noise level and

spectral tilt between closc-tcking atid desk-top microphones, we propose

two novel methods based on additive corrections in the cepstral domain.

In the first algorithm, the additive correction depends on the instantaneous

SNR of the signal. In the second technique, EM techniques are used to

bes~m atch the cepstral vectors of the input utter.mces to the ensemble of

codebook entries representing a standard acoustical ambience. Use of the

proposed algorithms dramatically improves recognition accuracy when

the system is tested on a microphone other than the one on which it was

trained.

1990-alexac-icassp.pdf
PDF file

In  Proc. of International Conference on Acoustics, Speech, and Signal Processing

Publisher  Institute of Electrical and Electronics Engineers, Inc.
© 2007 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Details

TypeInproceedings
> Publications > Environmental Robustness in Automatic Speech Recognition