|
The goal of the Web Search & Data Mining Group of Microsoft Research Asia is to drive the next generation of Web search by leveraging data mining, machine learning, and knowledge discovery techniques for information analysis, organization, retrieval, and visualization. In addition, in contrast with current Web search methods that essentially do document-level ranking and retrieval, the Web Search & Data Mining Group has created search at the object level to bring increased knowledge and intelligence to users.
|
|
The information era has brought us vast amounts of digitized text that are generated, propagated, exchanged, stored, and accessed through the Internet each day across the world. The accumulation of this data is making information acquisition increasingly difficult, with language becoming a critical obstacle to growth. To overcome these difficulties, the Natural Language Computing (NLC) Group is focusing its efforts on a variety of research topics, including multi-language text analysis, machine translation, cross language information retrieval, and question answering. Over the years, the group has made significant contributions to Microsoft products, including a Japanese and Chinese Input Method Editor (IME), English writing assistant for Office 2007, Chinese couplet game for Windows Live, Chinese word breaker, pinyin search and search speller for the MSN search engine, text mining for SQL Servers and SharePoint, and meta data extraction for MSN. Our research achievements have been published at most prestigious NLP conferences, including 21 papers at ACL and eight papers at SIGIR, from 2000-2007. This group was awarded MSRA "stamina award" in 2006 due to the above-mentioned excellent achievements.
|