Source-Filter Models for Time-Scale Pitch-Scale Modification of Speech

Alex Acero

Source-Filter Models for Time-Scale Pitch-Scale Modification of Speech

Alex Acero

Proceedings of the International Conference on Acoustics, Speech, and Signal Processing | May 1998

Download BibTex

This paper presents two time-scale pitch-scale modification techniques to be used in speech synthesis systems. They have been applied to Microsoft’s Whistler system, which is based on concatenative synthesis. Both methods are based on a source-filter model, one of them using LPC parameters and the other one using cepstral parameters. The proposed methods achieve high quality prosody modification, retain the characteristics of the donor speaker, allow for spectral manipulation (to reduce spectral discontinuities at unit boundaries), yield compact acoustic inventories and improved voiced fricatives.