Retrieving a User Language Model from an Unsupervised Document Map

Mikko Kurimo and Krista Lagus

Helsinki University of Technology

 

This work presents a method to automatically retrieve a language model focused on the topic and style of the speech situation at hand. The retrieval is based on a sample text or a first-pass transcription hypothesis of the speech. We use a self-organizing map (SOM) of all the training texts to index and define the different language models by the map nodes. The smoothly organized index enables a fast local search and easy access to models of different topical granularity.

 

Longer abstract as PDF file

 

Return to Machine Learning and User Interface workshop page.