A Cross-language Study on Automatic Speech Disfluency Detection

Wen Wang, Andreas Stolcke, Jiahong Yuan, and Mark Liberman

Abstract

We investigate two systems for automatic disfluency detection on English and Mandarin conversational speech data. The first system combines various lexical and prosodic features in a ConditionalRandomField model for detecting edit disfluencies. The second system combines acoustic and language model scores for detecting filled pauses through constrained speech recognition. We compare the contributions of different knowledge sources to detection performance between these two languages.

Details

Publication typeInproceedings
Published inProc. NAACL
PublisherAssociation for Computational Linguistics
> Publications > A Cross-language Study on Automatic Speech Disfluency Detection