Geoffrey Zweig and Patrick Nguyen
2010
This paper describes a new toolkit - SCARF - for doing speech
recognition with segmental conditional random fields. It is designed
to allow for the integration of numerous, possibly redundant
segment level acoustic features, along with a complete
language model, in a coherent speech recognition framework.
SCARF performs a segmental analysis, where each segment corresponds
to a word, thus allowing for the incorporation of acoustic
features defined at the phoneme, multi-phone, syllable and
word level. SCARF is designed to make it especially convenient
to use acoustic detection events as input, such as the detection
of energy bursts, phonemes, or other events. Language modeling
is done by associating each state in the SCRF with a state in
an underlying n-gram language model, and SCARF supports the
joint and discriminative training of language model and acoustic
model parameters. SCARF is available for download from
http://research.microsoft.com/en-us/projects/scarf/
![]() PDF file |
Publisher International Speech Communication Association
© 2007 ISCA. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the ISCA and/or the author.
| Type | Inproceedings |