Yu Zhang, Jian Xu, Zhi-Jie Yan, and Qiang Huo
25 March 2012
Recently, we proposed an i-vector approach to acoustic sniffing for irrelevant variability normalization based acoustic model training in large vocabulary continuous speech recognition (LVCSR). Its effectiveness has been confirmed by experimental results on Switchboard-1 conversational telephone speech transcription task. In this paper, we study several discriminative feature extraction approaches in i-vector space to improve both recognition accuracy and run-time efficiency. New experimental results are reported on a much larger scale LVCSR task with about 2000 hours training data.
In IEEE International Conference on Acoustics, Speech and Signal Processing, 2012, ICASSP 2012
Publisher International Conference on Acoustics, Speech, and Signal Processing (ICASSP)