Share on Facebook Tweet on Twitter Share on LinkedIn Share by email
A Model-Based Transformational Approach to Robust Speaker Recognition

Remco Teunen, Ben Shahshahani, and Larry Heck

Abstract

A novel statistical modeling and compensation method for robust speaker recognition is presented. The method specifically addresses the degradation in speaker verification performance due to the mismatch in channels (e.g., telephone handsets) between enrollment and testing sessions. In mismatched conditions, the new approach uses speaker-independent channel transformations to synthesize a speaker model that corresponds to the channel of the testing session. Effectively verification is always performed in matched channel conditions. Results on the 1998 NIST Speaker Recognition Evaluation corpus show that the new approach yields performance that matches the best reported results. Specifically, our approach yields similar improvements (19.9% reduction in EER compared to CMN alone) as the HNORM score-based compensation method, but with a fraction of the training time.

Details

Publication typeInproceedings
Published inProceedings of the International Conference on Spoken Language Processing (ICSLP)
PublisherISCA
> Publications > A Model-Based Transformational Approach to Robust Speaker Recognition