Share on Facebook Tweet on Twitter Share on LinkedIn Share by email
Effects of audio and ASR quality on cepstral and high-level speaker verification systems

Andreas Stolcke, Martin Graciarena, and Luciana Ferrer

Abstract

Speech data for NIST speaker recognition evaluations has traditionally been distributed in compressed, telephone quality form, even for microphone data that was originally recorded at higher quality. We evaluate the effect that improved audio quality has for speaker verification performance, using a recently released full-bandwidth version of microphone data from the SRE2010 evaluation. Remarkably, we find substantially improved results even though the underlying speaker recognition models remain based on a telephone-band feature front end. For a cepstral GMM system we show improvements purely from the elimination of lossy (μlaw) coding and more effective noise reduction filtering at the full bandwidth. We also find that higher-level speaker recognition systems can benefit from better ASR quality enabled by the improved audio quality. [...]

Details

Publication typeInproceedings
Published inProceedings Odyssey Speaker and Language Recognition Workshop
PublisherInternational Speech Communication Association
> Publications > Effects of audio and ASR quality on cepstral and high-level speaker verification systems