Reformulating the HMM as a trajectory model by imposing explicit relationship between static and dynamic features

A trajectory model, derived from the HMM by imposing explicit relationship between static and dynamic features, is developed and evaluated. The derived model, named “trajectory-HMM”, can alleviate some limitations of the standard HMM, which are i) piece-wise constant statistics within a state and ii) conditional independence assumption of state output probabilities, without increasing the number of model parameters. In this talk, a Viterbi-type training algorithm is also derived. This model was evaluated both in speech recognition and synthesis experiments. In speaker-dependent continuous speech recognition experiments, the trajectory-HMM achieved error reductions over the standard HMM. The experimental results of subjective listening tests shows that introduction of the trajectory-HMM can improve the quality of synthetic speech generated from HMM-based speech synthesis system which we have proposed.

Speaker Details

Heiga Zen received the B.E. degree in computer science and M.E. degree in electrical and computer engineering from Nagoya Institute of Technology, Nagoya, Japan, in 2001 and 2003, respectively. Currently he is a Ph.D student of the Department of Computer Science at the Nagoya Institute of Technology. From June 2004 to May 2005, he was working in IBM T.J.Watson Research Center as an intern/co-op researcher. His research interests include acoustic modeling for automatic speech recognition and text-to-speech synthesis.

Date:
Speakers:
Heiga Zen
Affiliation:
Nagoya Institute of Technology