Yun-Cheng Ju, Michael Seltzer, and Ivan Tashev
September 2009
Speech recognition technology is prone to mistakes, but this is
not the only source of errors that cause speech recognition
systems to fail; sometimes the user simply does not utter the
command correctly. Usually, user mistakes are not considered
when a system is designed and evaluated. This creates a gap
between the claimed accuracy of the system and the actual
accuracy perceived by the users. We address this issue
quantitatively in our in-car infotainment media search task and
propose expanding the capability of voice command to
accommodate user mistakes while retaining a high percentage
of the performance for queries with correct syntax. As a
result, failures caused by user mistakes were reduced by an
absolute 70% at the cost of a drop in accuracy of only 0.28%.
Publisher: International Speech Communication Association
© 2007 ISCA. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the ISCA and/or the author.
| Type: | Inproceedings |