A domain specific language for writing and analyzing string encoders and decoders.
There is some evidence that a gap exists between the neural network research and software development communities. Although neural networks have been investigated for decades, source code examples available to software developers are often incomplete, misleading, or just plain incorrect. The goal of this project is to bridge that gap by providing a series of high quality demo programs and code examples.
Pex4Fun is a browser-based teaching and learning environment targeting teachers and students for introductory to advanced programming or software engineering courses. At the core of the platform is an automated grading engine based on symbolic execution. In Pex4Fun, teachers can create virtual classrooms, customize existing courses, and publish new learning material including learning games.
Column store technology can provide very substantial performance improvements on data warehousing workloads. This project investigated how to integrate columnar storage into SQL Server. The solution adopted was to add a new index type, columnstore index, that stores data column wise instead of row wise. Columnstore indexes first shipped in SQL Server 2012 and significant enhancements will be included in the next release.
This project focuses on rural government maternal health workers in India (called Accredited Social Health Activists, or ASHAs), using a tool called ASHA Assist to help ASHAs engage their clients in persuasive discussions about various topics related to maternal health. ASHA Assist consists of interactive videos on mobile phones, covering topics related to maternal health for use in counseling their clients.
Spoken language understanding (SLU) is an emerging field in between the areas of speech processing and natural language processing. The term spoken language understanding has largely been coined for targeted understanding of human speech directed at machines. This project covers our research on SLU tasks such as domain detection, intent determination, and slot filling, using data-driven methods.
With proliferation of ubiquitous access to information, the question arises of how distracting processing information can be in social settings, especially during a face-to-face conversation. In this paper, we investigate how much information users can consume during a conversation and what information delivery mode, via audio or visual aids, helps them effectively conceal the fact that they are receiving information.
Data is all the buzz. It's being seen in everything and found everywhere. But what are the consequences of this vision of a data-rich world for those of us on the street; what impact if any does it have on our everyday experiences and with the things that matter most to us. Here, we aim to reflect on the rise of (big) data and investigate what it does mean for us, and what it could come to mean.
Automatic program verification tool for proving termination and other liveness properties
One Click Access evaluation at NTCIR
Subtopic Mining and Diversified Search evaluation at NTCIR
Evaluating summaries, ranked retrieval and sessions seamlessly
This research project in MSR SVC aims to answer the following question: Can we allow programmers to write cloud applications as though they are accessing centralized, strongly consistent data while at the same time allowing them to specify their consistency/availability/performance (CAP) requirements in terms of service-level agreements (SLAs) that are enforced by the cloud storage system at runtime?
Labs: Silicon Valley
The XCG Lab Security and Cryptography teams do development, applied research, and theoretical research in the fields of systems security and cryptography. These teams include the Cryptography Research team, the Security & Cryptography team, and the Systems Incubation team.
Labs: eXtreme Computing Group
We investigate how people's behaviour online can be characterized in terms of psychometric measurements such as the Big-5 personality traits openness, conscientiousness, extraversion, agreeableness, and neuroticism as well as general intelligence and satisfaction-with-life. We investigate patterns of Facebook usage, website preferences, query logs, and Facebook Likes and look for interesting correlations which can be used to predict users behaviours, preferences or characteristics.
In order to render a high quality, versatile 3D talking head, a stable, high frame rate AV data acquisition system is constructed. It can capture 3D position, surface orientation and albedo texture of the talking head video images along with the corresponding speech signals.
We propose a new photo-realistic, voice driven only (i.e. no linguistic info of the voice input is needed) talking head.
Two important performance metrics in collaborative systems are local and remote response times. These response times depend on three important factors: processing architecture, communication architecture, and scheduling of tasks dictated by these two architectures. We show that it is possible to create a system that improves response times by dynamically adjusting these three system parameters in response to changes to collaboration parameters.
We conducted a study comparing avatar conferencing with video and audio conferencing for work scenarios. We studied nine four-person teams using a within-subjects design that measured users’ perceptions and preferences across the three conferencing conditions.
People sometimes miss small parts of meetings and need to quickly catch up without disrupting the rest of the meeting. We developed an Accelerated Instant Replay (AIR) Conferencing system for videoconferencing that enables users to catch up on missed content while the meeting is ongoing. AIR can replay parts of the conference using four different modalities: audio, video, conversation transcript, and shared workspace.
Code Digger is a Microsoft® Visual Studio® 2012 extension that analyzes possible execution paths through your .NET code. The result is a table where each row shows a unique behavior of your code. The table helps you understand the behavior of the code, and it may also uncover hidden bugs.
In recent years the Web has evolved substantially, transforming from a place where we primarily find information to a place where we also leave, share and keep it. This presents a fresh set of challenges for the management of personal information, which include how to underpin greater awareness and more control over digital belongings and other personally meaningful content that is hosted online.
Optimus is a framework for dynamically rewriting an execution plan graph in distributed data-parallel computing at runtime. It enables optimizations that require knowledge of the semantics of the computation, such as language customizations for domain-specific computations including matrix algebra. We address several problems arising in distributed execution including data skew, dynamic data re-partitioning, unbounded iterative computations, and fault tolerance.
Labs: Silicon Valley
The great thing about large displays is their size. But their size is also the bad news - in terms of conventional interface design. Conventional UI elements may be too far to conveniently reach, or reach at all. This work is directed at exploring alternative modes of interaction which bring the interaction to the user, rather than the reverse - using various techniques and technologies. Emerging from this are new insights in how to work in natural, appropriate and engaging ways.
An increased dependence on medical imaging for patient diagnosis and treatment places new challenges upon the clinical community. Existing image processing workflows struggle to keep up with the pace at which imaging technology is developing. Microsoft Research is working with top research institutes around the world to make available data and tools and advance the state of the art in automatic analysis of medical scans.