Share on Facebook Tweet on Twitter Share on LinkedIn Share by email
Our research
Content type
+
Downloads (448)
+
Events (423)
 
Groups (146)
+
News (2658)
 
People (737)
 
Projects (1072)
+
Publications (12214)
+
Videos (5468)
Labs
Research areas
Algorithms and theory47205 (776)
Communication and collaboration47188 (1405)
Computational linguistics47189 (479)
Computational sciences47190 (768)
Computer systems and networking47191 (2045)
Computer vision208594 (1090)
Data mining and data management208595 (209)
Economics and computation47192 (308)
Education47193 (787)
Gaming47194 (376)
Graphics and multimedia47195 (1165)
Hardware and devices47196 (1024)
Health and well-being47197 (464)
Human-computer interaction47198 (2185)
Machine learning and intelligence47200 (1730)
Mobile computing208596 (126)
Quantum computing208597 (59)
Search, information retrieval, and knowledge management47199 (1709)
Security and privacy47202 (765)
Social media208598 (103)
Social sciences47203 (819)
Software development, programming principles, tools, and languages47204 (1472)
Speech recognition, synthesis, and dialog systems208599 (157)
Technology for emerging markets208600 (51)
1–25 of 23166
Sort
Show 25 | 50 | 100
1234567Next 
Nicholas Jing Yuan, Yu Zheng, Xing Xie, Yingzi Wang, Kai Zheng, and Hui Xiong

The step of urbanization and modern civilization fosters different functional zones in a city, such as residential areas, business districts, and educational areas. In a metropolis, people commute between these functional zones every day to engage in different socioeconomic activities, e.g., working, shopping, and entertaining. In this paper, we propose a data-driven framework to discover functional zones in a city. Specifically, we introduce the concept of Latent Activity Trajectory (LAT), which...

Publication details
Date: 1 August 2016
Type: Article
Publisher: IEEE – Institute of Electrical and Electronics Engineers
Abram Hindle, Christian Bird, Thomas Zimmermann, and Nachiappan Nagappan

Large organizations like Microsoft tend to rely on formal requirements documentation in order to specify and design the software products that they develop. These documents are meant to be tightly coupled with the actual implementation of the features they describe. In this paper we evaluate the value of high-level topic-based requirements traceability and issue report traceability in the version control system, using Latent Dirichlet Allocation (LDA). We evaluate LDA topics on practitioners...

Publication details
Date: 1 December 2015
Type: Article
Publisher: Springer
Emerson Murphy-Hill, Thomas Zimmermann, Christian Bird, and Nachiappan Nagappan

When software engineers fix bugs, they may have several options as to how to fix those bugs. Which fix they choose has many implications, both for practitioners and researchers: What is the risk of introducing other bugs during the fix? Is the bug fix in the same code that caused the bug? Is the change fixing the cause or just covering a symptom? In this paper, we investigate alternative fixes to bugs and present an empirical study of how engineers make design choices about how to fix bugs. We start...

Publication details
Date: 1 December 2015
Type: Article
Publisher: IEEE – Institute of Electrical and Electronics Engineers
Yanjie Fu, Yong Ge, Yu Zheng, Yao, Yanchi Liu, Hui Xiong, and Nicholas Jing Yuan

Ranking residential real estates based on investment values can provide decision making support for home buyers and thus plays an important role in estate marketplace. In this paper, we aim to develop methods for ranking estates based on investment values by mining users opinions about estates from online user reviews and offline moving behaviors (e.g., taxi traces, smart card transactions, check-ins). While a variety of features could be extracted from these data, these features are intercorrelated and...

Publication details
Date: 1 December 2015
Type: Inproceeding
Publisher: IEEE – Institute of Electrical and Electronics Engineers
Publication details
Date: 1 December 2015
Type: Article
Yuchen Zhang, Xi Chen, Dengyong Zhou, and Michael I. Jordan

Publication details
Date: 1 December 2015
Type: Inproceeding
Badrish Chandramouli, Jonathan Goldstein, Mike Barnett, Robert DeLine, Danyel Fisher, John C. Platt, James F. Terwilliger, and John Wernsing

This paper introduces Trill – a new query processor for analytics. Trill fulfills a combination of three requirements for a query processor to serve the diverse big data analytics space: (1) Query Model: Trill is based on a tempo-relational model that enables it to handle streaming and relational queries with early results, across the latency spectrum from real-time to offline; (2) Fabric and Language Integration : Trill is architected as a high-level language library that supports...

Publication details
Date: 1 August 2015
Type: Inproceeding
Publisher: VLDB – Very Large Data Bases
Mohan Yang, bolin ding, surajit chaudhuri, and kaushik chakrabarti

We aim to provide table answers to keyword queries using a knowledge base. For queries referring to multiple entities, like “Washington cities population” and “Mel Gibson movies”, it is better to represent each relevant answer as a table which aggregates a set of entities or joins of entities within the same table scheme or pattern. In this paper, we study how to find highly relevant patterns in a knowledge base for user-given keyword queries to compose table answers. A knowledge base is...

Publication details
Date: 1 August 2015
Type: Inproceeding
Publisher: VLDB – Very Large Data Bases
Software engineering for education focuses on developing technologies that make programming, testing and analysis more accessible to students. This workshop explores gaming through testing, which is popular with students, and can produce data worthy of analysis. Code Hunt is an industrial strength programming game which is now open in the community and available for research.
Event details
Date: 12 July 2015
Location: Baltimore, MD
Type: Workshop
Each year, Microsoft Research sponsors a semester-long class at leading design schools. Students are asked to form interdisciplinary teams of two to four students to design a user experience prototype that solves a real-world problem. From these groups, a representative team from each school presents its work to Microsoft. This year, Design Expo is excited to align with Imagine Cup and Microsoft's 40th anniversary. The event will take place from July 28th to 31st in Redmond, Washington.
Event details
Date: 8–9 July 2015
Location: Microsoft Redmond
Type: Conference
This annual event is for PhD students in their first or second year from universities and research institutions with which Microsoft Research partners, as well as all Microsoft Research PhD Scholars. It includes a series of talks of academic interest, transferable skills talks, and poster sessions that provide invited students the opportunity to present their work to Microsoft researchers.
Event details
Date: 29 June–3 July 2015
Location: Cambridge, UK
Type: Other
Akash Lal and Shaz Qadeer

A hierarchical program is one with multiple procedures but no loops or recursion. This paper studies the problem of deciding reachability queries in hierarchical programs. This problem is fundamental to verification and most directly applicable to doing bounded reachability in programs, i.e., reachability under a bound on the number of loop iterations and recursive calls.

The usual method of deciding reachability in hierarchical programs is to first
inline all procedures and then do...

Publication details
Date: 1 June 2015
Type: Inproceeding
Publisher: ACM
Pantazis Deligiannis, Jeroen Ketema, Paul Thomson, Alastair Donaldson, and Akash Lal

Programming efficient asynchronous systems is challenging because it can often be hard to express the design declaratively, or to defend against interleaving-dependent bugs such as data races and other assertion violations. Previous work has only addressed these challenges individually, either by designing a new declarative language, or a new data race detection tool, or a new testing technique. We present P#, a language for high-reliability asynchronous programming co-designed with a static analysis...

Publication details
Date: 1 June 2015
Type: Inproceeding
Publisher: ACM
Shuo Ma, Yu Zheng, and Ouri Wolfson

We proposed and developed a taxi-sharing system that accepts taxi passengers’ real-time ride requests sent from smartphones and schedules proper taxis to pick up them via ridesharing, subject to time, capacity, and monetary constraints. The monetary constraints provide incentives for both passengers and taxi drivers: passengers will not pay more compared with no ridesharing and get compensated if their travel time is lengthened due to ridesharing; taxi drivers will make money for all the detour distance...

Publication details
Date: 1 June 2015
Type: Article
Publisher: IEEE
Fuzheng Zhang, Nicholas Jing Yuan, David Wilkie, Yu Zheng, and Xing Xie

Urban transportation is an important factor in energy consumption and pollution, and is of increasing concern due to its complexity and economic significance. Its importance will only increase as urbanization continues around the world. In this paper, we explore drivers’ refueling behavior in urban areas. Compared to questionnaire-based methods of the past, we propose a complete data-driven system that pushes towards real-time sensing of individual refueling behavior and citywide petrol consumption. Our...

Publication details
Date: 1 June 2015
Type: Article
Publisher: ACM – Association for Computing Machinery
Hao Fang, Saurabh Gupta, Forrest Iandola, Rupesh Srivastava, Li Deng, Piotr Dollar, Jianfeng Gao, Xiaodong He, Margaret Mitchell, John Platt, Lawrence Zitnick, and Geoffrey Zweig

This paper presents a novel approach for automatically generating image descriptions: visual detectors and language models learn directly from a dataset of image captions.We use Multiple Instance Learning to train visual detectors for words that commonly occur in captions, including many different parts of speech such as nouns, verbs, and adjectives. The word detector outputs serve as conditional inputs to a maximum-entropy language model. The language model learns from a set of over 400,000 image...

Publication details
Date: 1 June 2015
Type: Article
Publisher: IEEE – Institute of Electrical and Electronics Engineers
Philip A. Bernstein, Sudipto Das, Bailu Ding, and Markus Pilman

Scaling-out a database system typically requires partitioning the database across multiple servers. If applications do not partition perfectly, then transactions accessing multiple partitions end up being distributed, which has well-known scalability challenges. To address them, we describe a high-performance transaction mechanism that uses optimistic concurrency control on a multi-versioned tree-structured database stored in a shared log. The system scales out by adding servers, without partitioning...

Publication details
Date: 31 May 2015
Type: Inproceeding
Publisher: ACM – Association for Computing Machinery
The Devices and Networking Summit 2015 will bring together 250 thought leaders, researchers and practitioners from academia and industry, drawn from a broad range of disciplines including computer science, engineering and design. Together we will discuss some of the key advances and challenges relating to the development and deployment of the next generation of devices and services.
Event details
Date: 11–13 May 2015
Location: Paris, France
Type: Conference
Jacob R. Lorch, Andrew Baumann, Lisa Glendenning, Dutch T. Meyer, and Andrew Warfield

Many services need to survive machine failures, but designing and deploying fault-tolerant services can be difficult and error-prone. In this work, we present Tardigrade, a system that deploys an existing, unmodified binary as a fault-tolerant service. Tardigrade replicates the service on several machines so that it continues running even when some of them fail. Yet, it keeps the service states synchronized so clients see strongly consistent results. To achieve this efficiently, we use lightweight...

Publication details
Date: 4 May 2015
Type: Inproceeding
Publisher: USENIX – Advanced Computing Systems Association
Jonathan Mace, Peter Bodik, Rodrigo Fonseca, and Madanlal Musuvathi

In distributed systems shared by multiple tenants, effective resource management is an important pre-requisite to providing quality of service guarantees. Many systems deployed today lack performance isolation and experience contention, slowdown, and even outages caused by aggressive workloads or by improperly throttled maintenance tasks such as data replication. In this work we present Retro, a resource management framework for shared distributed systems. Retro monitors per-tenant resource usage both...

Publication details
Date: 1 May 2015
Type: Inproceeding
Ryen W. White, Matthew Richardson, and Wen-tau Yih

Search systems traditionally require searchers to formulate information needs as keywords rather than in a more natural form, such as questions. Recent studies have found that Web search engines are observing an increase in the fraction of queries phrased as natural language. As part of building better search engines, it is important to understand the nature and prevalence of these intentions, and the impact of this increase on search engine performance. In this work, we show that while 10.3% of queries...

Publication details
Date: 1 May 2015
Type: Inproceeding
Publisher: ACM – Association for Computing Machinery
Publication details
Date: 1 May 2015
Type: Inproceeding
Publisher: IEEE – Institute of Electrical and Electronics Engineers
Christopher Theisen, Kim Herzig, Patrick Morrison, Brendan Murphy, and Laurie Williams

Security testing and reviewing efforts are a necessity for software projects, but are time-consuming and expensive to apply. Identifying vulnerable code supports decision-making during all phases of software development. An approach for identifying vulnerable code is to identify its attack surface, the sum of all paths for untrusted data into and out of a system. Identifying the code that lies on the attack surface requires expertise and significant manual effort. This paper proposes an...

Publication details
Date: 1 May 2015
Type: Inproceeding
Publisher: IEEE – Institute of Electrical and Electronics Engineers
Kim Herzig and Nachiappan Nagappan

Applying code changes to software systems and testing these code changes can be a complex task that involves many different types of software testing strategies, e.g. system and integration tests. However, not all test failures reported during code integration are hinting towards code defects. Testing large systems such as the Microsoft Windows operating system requires complex test infrastructures, which may lead to test failures caused by faulty tests and test infrastructure issues. Such false test...

Publication details
Date: 1 May 2015
Type: Inproceeding
Publisher: IEEE – Institute of Electrical and Electronics Engineers
Ali Mamdouh Elkahky, Yang Song, and Xiaodong He

Recent online services rely heavily on automatic personalization to recommend relevant content to a large number of users. This requires systems to scale promptly to accommodate the stream of new users visiting the online services for the first time. In this work, we propose a content-based recommendation system to address both the recommendation quality and the system scalability. We propose to use a rich feature set to represent users, according to their web browsing history and search queries. We use...

Publication details
Date: 1 May 2015
Type: Inproceeding
Publisher: WWW – World Wide Web Consortium (W3C)
1–25 of 23166
Sort
Show 25 | 50 | 100
1234567Next 
> Our research