NewsPad is a collaborative news editor designed to empower small communities to write articles collaboratively through: community sourcing, structured stories, and the ability to embed the story anywhere.
Labs: FUSE Labs
Site under construction
We develop novel eye-gaze tracking technologies in order to make eye-gaze tracking technology ubiquitously available for improved natural user interaction (NUI).
ViiBoard uses vision techniques to significantly enhance the user experience on large touch displays (e.g. Microsoft Perceptive Pixels) in two directions: human computer interaction and immersive remote collaboration. the first
3D scanning on portable device is a technology that empower people turn physical objects into high-quality 3D models with the normal RGB camera capturing on mobile device and support for 3D printing. The Skynet UI design aims to enhance user experience to lower the barriers between common consumers and 3D reconstruction.
This presents next-generation webcam hardware and software prototype which captures stereo video and high-accuracy depth images simultaneously in order to support some near-field Kinect scenarios on pc with non-touch screen. Showcases here are finger-controlled windows 8 UI and slingshot game.
Phone-to-Phone Mobile Motion Games (MMG) represent interesting and novel gameplay for ad hoc users in the same location. Seeking to offer Kinect-like experiences using only mobile devices, leading to new mobile gaming experiences. Key technical achievements include development of fast, accurate and robust phone-to-phone distance measurement system, and a 3D phone-to-phone measurement techniques. Several innovative games have been prototyped.
If search and Twitter data are to be treated as a survey, they would follow a very peculiar methodology: participation is a time-varying, demographically biased sample of the population, participants are effectively continuously answering different “survey” questions, and, finally, participants can choose how often they are allowed to answer the question. In response, we show alternative methods for interpreting and using online and social media data fruitfully.
TerminAnt is a conceptual AR game on Windows Phone. This game puts street cars from the real urban environment against virtual ants on mobile screen, the user could simply point mobile device at a source of real-time street traffic, based on GPRS equipped phone’s data and car’s speed motion detection data, to help create the database of dynamic traffic information collection.
Alternating minimization is a popular approach to solve several optimization problems. In this work, we explore theoretical properties of this method (and its variants) for several non-convex optimization problems that feature prominently in several important areas such as recommendation systems, compressive sensing, computer vision etc.
Eventful is a system that helps produce news reports by recruiting and guiding remote and locative crowd workers who attend events in person to perform information collection missions. Eventful explores and hopes to problematize the concept of journalism as a service. For initial details please read the short paper below. Full paper coming soon.
Labs: FUSE Labs
FlexStroke is a pen-based input device with a deformable tip that simulates several real paint brush sensations, providing new digital drawing experience for users with both visual and tactile feedback. It is configured as a Chinese brush, an oil brush, and a crayon through changing the jamming tip. This tip has different levels of stiffness based on its jamming structure.
With RoCuModel, common user is able to design a sketchy model from the scratch by hand shaping a tangible curve which can be detected by IR camera in order to generate 3D geometric model file and also directly preview the 3D volumetric image of the model in real before sending to the 3D printer. As a result, to simplify the process of personal fabrication.
This project targets on using automatic techniques to reduce MTTR of large-scale online service systems.
We are exploring ways to reconcile (national) security and privacy through the use of advanced cryptography.
We built the Sketch2Cartoon system, which is an automatic cartoon making system. It enables users to sketch major curves of characters and props in their mind, and real-time search results from millions of clipart images could be selected to compose the cartoon images. The selected com- ponents are vectorized and thus could be further edited. By enabling sketch-based input, even a child who is too young to read or write can draw whatever he/she imagines and get interesting cartoon images.
We built the Sketch2Tag system for hand-drawn sketch recognition. Due to large variations presented in hand-drawn sketches, most of existing work was limited to a particular domain or limited pre-defined classes. Different from existing work, Sketch2Tag is a general sketch recognition system, towards recognizing any semantically meaningful object that a child can recognize. This system enables a user to draw a sketch on the query panel, and then provides real-time recognition results.
Microsoft Research in partnership with Bing is happy to launch the second MSR-Bing Challenge on Image Retrieval. Do you have what it takes to build the best image retrieval system? Enter the MSR-Bing Image Retrieval Challenge in ACM Multimedia and/or ICME to develop an image scoring system for a search query.
Using a diversity of big data to infer and predict fine-grained air quality throughout a city, and finally tackle air pollutions.
Sequence Error (SE) Minimization Training of Neural Network for Voice Conversion
Project CodaLab is an open source platform that empowers communities to explore experiments together and create competitions designed to advance the state-of-the-art in machine learning.
Conversational systems interact with people through language to assist, enable, or entertain. Research at Microsoft spans dialogs that use language exclusively, or in conjunctions with additional modalities like gesture; where language is spoken or in text; and in a variety of settings, such as conversational systems in apps or devices, and situated interactions in the real world.
We argue that the massive amount of click data from commercial search engines provides a data set that is unique in the bridging of the semantic and intent gap. Search engines generate millions of click data (a.k.a. image-query pairs), which provide almost "unlimited" yet strong connections between semantics and images, as well as connections between users' intents and queries. This site is to introduce such as dataset, Clickture.
MODIST is a practical software model checker for unmodified concurrent, distributed and cloud systems. MODIST explores different execution paths systematically as well as simulating a variety of environment faults to discover subtle corner-case defects. We have applied MODIST in Oracle Berkely DB, MPS(Paxos implementation), SQL Azure, Windows Azure Storage and other real systems, and found many new bugs.
Programming today involves editing code while also running it in our head. To augment this mental simulation, live programming promises for much more fluid feedback between the programmer and a program that is executing while it is being edited.