Share on Facebook Tweet on Twitter Share on LinkedIn Share by email
Our research
Content type
+
Downloads (449)
+
Events (427)
 
Groups (147)
+
News (2666)
 
People (739)
 
Projects (1079)
+
Publications (12262)
+
Videos (5513)
Labs
Research areas
Algorithms and theory47205 (792)
Communication and collaboration47188 (1419)
Computational linguistics47189 (489)
Computational sciences47190 (774)
Computer systems and networking47191 (2055)
Computer vision208594 (1100)
Data mining and data management208595 (222)
Economics and computation47192 (312)
Education47193 (791)
Gaming47194 (380)
Graphics and multimedia47195 (1174)
Hardware and devices47196 (1035)
Health and well-being47197 (472)
Human-computer interaction47198 (2204)
Machine learning and intelligence47200 (1752)
Mobile computing208596 (140)
Quantum computing208597 (62)
Search, information retrieval, and knowledge management47199 (1718)
Security and privacy47202 (775)
Social media208598 (110)
Social sciences47203 (825)
Software development, programming principles, tools, and languages47204 (1491)
Speech recognition, synthesis, and dialog systems208599 (171)
Technology for emerging markets208600 (55)
1–25 of 23282
Sort
Show 25 | 50 | 100
1234567Next 
Yuchen Zhang, Xi Chen, Dengyong Zhou, and Michael I. Jordan

Publication details
Date: 1 December 2015
Type: Inproceeding
Publication details
Date: 1 December 2015
Type: Article
Abram Hindle, Christian Bird, Thomas Zimmermann, and Nachiappan Nagappan

Large organizations like Microsoft tend to rely on formal requirements documentation in order to specify and design the software products that they develop. These documents are meant to be tightly coupled with the actual implementation of the features they describe. In this paper we evaluate the value of high-level topic-based requirements traceability and issue report traceability in the version control system, using Latent Dirichlet Allocation (LDA). We evaluate LDA topics on practitioners...

Publication details
Date: 1 December 2015
Type: Article
Publisher: Springer
Yanjie Fu, Yong Ge, Yu Zheng, Yao, Yanchi Liu, Hui Xiong, and Nicholas Jing Yuan

Ranking residential real estates based on investment values can provide decision making support for home buyers and thus plays an important role in estate marketplace. In this paper, we aim to develop methods for ranking estates based on investment values by mining users opinions about estates from online user reviews and offline moving behaviors (e.g., taxi traces, smart card transactions, check-ins). While a variety of features could be extracted from these data, these features are intercorrelated and...

Publication details
Date: 1 December 2015
Type: Inproceeding
Publisher: IEEE – Institute of Electrical and Electronics Engineers
Yu Zheng

The advances in location-acquisition and mobile computing techniques have generated massive spatial trajectory data, which represent the mobility of a diversity of moving objects, such as people, vehicles and animals. Many techniques have been proposed for processing, managing and mining trajectory data in the past decade, fostering a broad range of applications. In this article, we conduct a systematic survey on the major research into trajectory data mining, providing a panorama of the field...

Publication details
Date: 1 September 2015
Type: Article
Publisher: ACM – Association for Computing Machinery
Mohan Yang, bolin ding, surajit chaudhuri, and kaushik chakrabarti

We aim to provide table answers to keyword queries using a knowledge base. For queries referring to multiple entities, like “Washington cities population” and “Mel Gibson movies”, it is better to represent each relevant answer as a table which aggregates a set of entities or joins of entities within the same table scheme or pattern. In this paper, we study how to find highly relevant patterns in a knowledge base for user-given keyword queries to compose table answers. A knowledge base is...

Publication details
Date: 1 August 2015
Type: Inproceeding
Publisher: VLDB – Very Large Data Bases
Badrish Chandramouli, Jonathan Goldstein, Mike Barnett, Robert DeLine, Danyel Fisher, John C. Platt, James F. Terwilliger, and John Wernsing

This paper introduces Trill – a new query processor for analytics. Trill fulfills a combination of three requirements for a query processor to serve the diverse big data analytics space: (1) Query Model: Trill is based on a tempo-relational model that enables it to handle streaming and relational queries with early results, across the latency spectrum from real-time to offline; (2) Fabric and Language Integration : Trill is architected as a high-level language library that supports...

Publication details
Date: 1 August 2015
Type: Inproceeding
Publisher: VLDB – Very Large Data Bases
Software engineering for education focuses on developing technologies that make programming, testing and analysis more accessible to students. This workshop explores testing through gaming, which is popular with students, and can produce data worthy of analysis. Code Hunt is an industrial strength programming game which is now open in the community and available for research.
Event details
Date: 14 July 2015
Location: Baltimore, MD
Type: Workshop
At the the sixteenth annual Microsoft Research Faculty Summit, leading academic researchers and educators will meet with Microsoft researchers and engineers to share ideas and results about some of today’s most exciting new directions in computer science. The key focus of this year’s summit is artificial intelligence (AI) with sessions on questioning and answering, vision to language, commonsense reasoning, and integrative AI.
Event details
Date: 8–9 July 2015
Location: Redmond, WA, US
Type: Conference
Each year, Microsoft Research sponsors a semester-long class at leading design schools. Students are asked to form interdisciplinary teams of two to four students to design a user experience prototype that solves a real-world problem. From these groups, a representative team from each school presents its work to Microsoft. This year, Design Expo is excited to align with Imagine Cup and Microsoft's 40th anniversary. The event will take place from July 28th to 31st in Redmond, Washington.
Event details
Date: 8–9 July 2015
Location: Microsoft Redmond
Type: Conference
This annual event is for PhD students in their first or second year from universities and research institutions with which Microsoft Research partners, as well as all Microsoft Research PhD Scholars. It includes a series of talks of academic interest, transferable skills talks, and poster sessions that provide invited students the opportunity to present their work to Microsoft researchers.
Event details
Date: 29 June–3 July 2015
Location: Cambridge, UK
Type: Other
The goal of this workshop is to foster the communication of communities broadly working in the area of data science, with a particular focus of stimulating increased interactions between statisticians, computer scientists, and domain experts in order to ambitiously attack important scientific problems involving big and complex data.
Event details
Date: 10–11 June 2015
Location: Cambridge, Mass.
Type: Workshop
Neeraj Kayal and Chandan Saha

Shpilka and Wigderson [SW99] had posed the problem of proving exponential lower bounds for (nonhomogeneous) depth three arithmetic circuits with bounded bottom fanin over a field F of characteristic zero. We resolve this problem by proving a NOmega(d/t) lower bound for (nonhomogeneous) depth three arithmetic circuits with bottom fanin at most t computing an explicit N-variate polynomial of degree d over F.

Publication details
Date: 1 June 2015
Type: Inproceeding
Publisher: LIPICS
Alessandro Sordoni, Michel Galley, Michael Auli, Chris Brockett, Yangfeng Ji, Meg Mitchell, Jian-Yun Nie, and Bill Dolan
Publication details
Date: 1 June 2015
Type: Inproceeding
Publisher: Conference of the North American Chapter of the Association for Computational Linguistics – Human Language Technologies (NAACL-HLT 2015)
Fuzheng Zhang, Nicholas Jing Yuan, David Wilkie, Yu Zheng, and Xing Xie

Urban transportation is an important factor in energy consumption and pollution, and is of increasing concern due to its complexity and economic significance. Its importance will only increase as urbanization continues around the world. In this paper, we explore drivers’ refueling behavior in urban areas. Compared to questionnaire-based methods of the past, we propose a complete data-driven system that pushes towards real-time sensing of individual refueling behavior and citywide petrol consumption. Our...

Publication details
Date: 1 June 2015
Type: Article
Publisher: ACM – Association for Computing Machinery
Pantazis Deligiannis, Jeroen Ketema, Paul Thomson, Alastair Donaldson, and Akash Lal

Programming efficient asynchronous systems is challenging because it can often be hard to express the design declaratively, or to defend against interleaving-dependent bugs such as data races and other assertion violations. Previous work has only addressed these challenges individually, either by designing a new declarative language, or a new data race detection tool, or a new testing technique. We present P#, a language for high-reliability asynchronous programming co-designed with a static analysis...

Publication details
Date: 1 June 2015
Type: Inproceeding
Publisher: ACM
Shuo Ma, Yu Zheng, and Ouri Wolfson

We proposed and developed a taxi-sharing system that accepts taxi passengers’ real-time ride requests sent from smartphones and schedules proper taxis to pick up them via ridesharing, subject to time, capacity, and monetary constraints. The monetary constraints provide incentives for both passengers and taxi drivers: passengers will not pay more compared with no ridesharing and get compensated if their travel time is lengthened due to ridesharing; taxi drivers will make money for all the detour distance...

Publication details
Date: 1 June 2015
Type: Article
Publisher: IEEE
Hao Fang, Saurabh Gupta, Forrest Iandola, Rupesh Srivastava, Li Deng, Piotr Dollar, Jianfeng Gao, Xiaodong He, Margaret Mitchell, John Platt, Lawrence Zitnick, and Geoffrey Zweig

This paper presents a novel approach for automatically generating image descriptions: visual detectors and language models learn directly from a dataset of image captions.We use Multiple Instance Learning to train visual detectors for words that commonly occur in captions, including many different parts of speech such as nouns, verbs, and adjectives. The word detector outputs serve as conditional inputs to a maximum-entropy language model. The language model learns from a set of over 400,000 image...

Publication details
Date: 1 June 2015
Type: Article
Publisher: IEEE – Institute of Electrical and Electronics Engineers
Akash Lal and Shaz Qadeer

A hierarchical program is one with multiple procedures but no loops or recursion. This paper studies the problem of deciding reachability queries in hierarchical programs. This problem is fundamental to verification and most directly applicable to doing bounded reachability in programs, i.e., reachability under a bound on the number of loop iterations and recursive calls.

The usual method of deciding reachability in hierarchical programs is to first
inline all procedures and then do...

Publication details
Date: 1 June 2015
Type: Inproceeding
Publisher: ACM
Jiansong Zhang, Jin Zhang, Kun Tan, Lin Yang, Yongguang Zhang, and Qian Zhang
Publication details
Date: 1 June 2015
Type: Inproceeding
Publisher: ACM Mobihoc 2015
Jialu Liu, Jingbo Shang, Chi Wang, Xiang Ren, and Jiawei Han

Text data are ubiquitous and play an essential role in big data applications. However, text data are mostly unstructured. Transforming unstructured text into structured units (e.g., semantically meaningful phrases) will substantially reduce semantic ambiguity and enhance the power and efficiency at manipulating such data using database technology. Thus mining quality phrases is a critical research problem in the field of databases. In this paper, we propose a new framework that extracts quality...

Publication details
Date: 1 June 2015
Type: Inproceeding
Publisher: ACM – Association for Computing Machinery
Philip A. Bernstein, Sudipto Das, Bailu Ding, and Markus Pilman

Scaling-out a database system typically requires partitioning the database across multiple servers. If applications do not partition perfectly, then transactions accessing multiple partitions end up being distributed, which has well-known scalability challenges. To address them, we describe a high-performance transaction mechanism that uses optimistic concurrency control on a multi-versioned tree-structured database stored in a shared log. The system scales out by adding servers, without partitioning...

Publication details
Date: 31 May 2015
Type: Inproceeding
Publisher: ACM – Association for Computing Machinery
The third annual New England Machine Learning Day will be held May 13th, 2014, at Microsoft Research New England, One Memorial Drive, Cambridge, MA 02142. The event will bring together local academics and researchers in machine learning and its applications.
Event details
Date: 18 May 2015
Location: Microsoft Research New England
Type: Workshop
Kuansan Wang

Human is the only species on earth that has mastered the technologies in writing and printing to capture ephemeral thoughts and scientific discoveries. The capabilities to pass along knowledge, not only geographically but also generationally, have formed the bedrock of our civilizations. We are in the midst of a silent revolution driven by the technological advancements: no longer are computers just a fixture of our physical world but have they been so deeply woven into our daily routines that they are...

Publication details
Date: 18 May 2015
Type: Inproceeding
Publisher: ACM – Association for Computing Machinery
Elad Yom-Tov

Syndromic surveillance refers to the analysis of medical information for the purpose of detecting outbreaks of disease earlier than would have been possible otherwise and to estimate the prevalence of the disease in a population. Internet data, especially search engine queries and social media postings, have shown promise in contributing to syndromic surveillance for in uenza and dengue fever. Here we focus on the recent outbreak of Ebola Virus Disease and ask whether three major sources of Internet...

Publication details
Date: 18 May 2015
Type: Inproceeding
Publisher: ACM – Association for Computing Machinery
1–25 of 23282
Sort
Show 25 | 50 | 100
1234567Next 
> Our research