Web N-gram Services
Access petabytes of data via the Web N-gram services (Public Beta). We invite the whole community to use the Web N-gram services, made available via a cloud-based platform, to drive discovery and innovation in web search, natural language processing, speech, and related areas by conducting research on real-world web-scale data, taking advantage of regular data updates for projects that benefit from dynamic data.
Publications
- Kuansan Wang, Christopher Thrasher, Evelyne Viegas, Xiaolong Li, and Paul Hsu, An Overview of Microsoft Web N-gram Corpus and Applications, June 2010
- Jian Huang, Jianfeng Gao, Jiangbo Miao, Xiaolong Li, Kuansan Wang, and Fritz Behr, Exploring Web Scale Language models for Search Query Processing, in Proceedings of the 19th International World Wide Web Conference (WWW’2010), Raleigh, NC, April 2010

