Bridging the gap: towards a unified framework for hands-free speech recognition using microphone arrays

In this paper we describe two families of algorithms for hands-free

speech recognition using microphone arrays. Enhancement-based

approaches use a cascade of independent processing blocks to perform

speech enhancement followed by speech recognition. We discuss

the reasons why this approach may be sub-optimal and motivate

the need for a solution that tightly integrates all processing blocks

into a common unified framework. This leads to a second family

of algorithms called unified approaches which considers all processing

stages to be components of a single system that operates with

the common goal of improved recognition accuracy. We describe

several examples of such algorithms that have been shown to outperform

more traditional signal-processing-based approaches. In doing

so, we hope to convey the benefits of performing hands-free speech

recognition in this manner and motivate further research in this area.

mseltzer_hscma08_ieeecompliant.pdf
PDF file

In  Proceedings of the Workshop on Hands-Free Speech Communication and Microphone Arrays

Publisher  Institute of Electrical and Electronics Engineers, Inc.
© 2007 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Details

TypeInproceedings
AddressTrento, Italy
> Publications > Bridging the gap: towards a unified framework for hands-free speech recognition using microphone arrays