Subspace Gaussian Mixture Models for Speech Recognition

Daniel Povey; et al

Subspace Gaussian Mixture Models for Speech Recognition

Daniel Povey ,
et al

ICASSP | January 2010

Download BibTex

We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixture Model structure, and the means and mixture weights vary in a subspace of the total parameter space. We call this a Subspace Gaussian Mixture Model (SGMM). Globally shared parameters define the subspace. This style of acoustic model allows for a much more compact representation and gives better results than a conventional modeling approach, particularly with smaller amounts of training data.