Jasha Droppo and Alex Acero
An accurate model for the structure of speech is essential to many speech processing applications, including speech en-hancement, synthesis, recognition, and coding. This paper ex-plores some deficiencies of standard harmonic methods of mod-eling voiced speech. In particular, they ignore the effect of fun-damental frequency changing within an analysis frame, and the fact that the fundamental frequency is not a continuously vary-ing parameter, but a side effect of a series of discrete events. We present an alternative, time-series based framework for modeling the voicing structure of speech called the fine pitch model. By precisely modeling the voicing structure, it can more accurately account for the content in a voiced speech segment. Index Terms: speech analysis, pitch estimation, fundamental
|Published in||Proc. Interspeech Conference|
|Publisher||International Speech Communication Association|
© 2007 ISCA. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the ISCA and/or the author.