|
Speech Technology (Redmond)
Overview
Microsoft Research has a group in Redmond and another in Beijing working together to improve spoken language technologies. Our main goal is to build applications that make computers available everywhere, and work with Speech Platforms Group product group to make this vision a reality. We are interested not only in creating state-of-the-art spoken language components, but also in how these disparate components can come together with other modes of human-computer interaction to form a unified, consistent computing environment. We are pursuing several projects to help us reach our vision of a fully speech-enabled computer. In Redmond we are working on multimodal user interfaces, and that helps us discover real problems that we need to solve to make speech recognition more useful. The first such application we built was MiPad, a personal information manager used with speech and pen. We are also working on several other areas, including noise robustness, acoustic modeling, language modeling, grammar induction, SALT (Speech Enabled Language Tags) and personalization. Flash overview of speech recognition at MSR (click in "Microsoft Research" and "Speech technology"). We have a few videos that illustrate our technology and a list of publications for more detail. For information about the source code that we have made available to the research community check out our downloads page. People
Former researchers (now in Speech Platforms Group)
Projects
In the past, the speech technology group has worked on other projects, which have been successfully completed, and are either in shipping products (through the Speech Platforms product team), or have moved to the product development stage. These include:
|
||||||||||||||||||||||||||||||||||||||||||||||||||