Share on Facebook Tweet on Twitter Share on LinkedIn Share by email
Bridging the gap: towards a unified framework for hands-free speech recognition using microphone arrays

Michael L. Seltzer

Abstract

In this paper we describe two families of algorithms for hands-free speech recognition using microphone arrays. Enhancement-based approaches use a cascade of independent processing blocks to perform speech enhancement followed by speech recognition. We discuss the reasons why this approach may be sub-optimal and motivate the need for a solution that tightly integrates all processing blocks into a common unified framework. This leads to a second family of algorithms called unified approaches which considers all processing stages to be components of a single system that operates with the common goal of improved recognition accuracy. We describe several examples of such algorithms that have been shown to outperform more traditional signal-processing-based approaches. In doing so, we hope to convey the benefits of performing hands-free speech recognition in this manner and motivate further research in this area.

Details

Publication typeInproceedings
Published inProceedings of the Workshop on Hands-Free Speech Communication and Microphone Arrays
AddressTrento, Italy
PublisherInstitute of Electrical and Electronics Engineers, Inc.
> Publications > Bridging the gap: towards a unified framework for hands-free speech recognition using microphone arrays