Share on Facebook Tweet on Twitter Share on LinkedIn Share by email
Creating Speech Recognition Grammars from Regular Expressions for Alphanumeric Concepts

Ye-Yi Wang and Yun-Cheng Ju

Abstract

To bring speech recognition mainstream, researchers have been working on automatic grammar development tools. Most of the work focused on the modeling of sentence level commands for mixed-initiative dialogs. In this paper we describe a novel approach that enables the developers with little grammar authoring experience to construct high performance speech grammars for alphanumeric concepts, which are often needed in the more commonly used directed dialog systems in practice. A developer can simply write down a regular expression for the concept and the algorithm automatically constructs a W3C grammar with appropriate semantic interpretation tags. While the quality of the grammar is ultimately determined by the way in which the regular expression is written, the algorithm relieves the developers from the difficult tasks of optimizing grammar structures and assigning appropriate semantic interpretation tags, thus it greatly speeds up grammar development and reduces the requirement of expertise. Preliminary experimental results have shown that the grammar created with this approach consistently outperformed the general alphanumeric rules in the grammar library. In some cases the semantic error rates were cut by more than 50%.

Details

Publication typeInproceedings
Published inInternational Conference on Spoken Language Processing
Pages2161–2164
AddressJeju, Korea
PublisherInternational Speech Communication Association
> Publications > Creating Speech Recognition Grammars from Regular Expressions for Alphanumeric Concepts