A tool that analyze the Arabic text and generates the parse tree
Labs: ATL Cairo
This tool converts the colloquial Arabic to modern standard Arabic.
Labs: ATL Cairo
Computer Aided Language Learning (CALL)
NewsPad is a collaborative news editor designed to empower small communities to write articles collaboratively through: community sourcing, structured stories, and the ability to embed the story anywhere.
Labs: FUSE Labs
Site under construction
We develop novel eye-gaze tracking technologies in order to make eye-gaze tracking technology ubiquitously available for improved natural user interaction (NUI).
ViiBoard uses vision techniques to significantly enhance the user experience on large touch displays (e.g. Microsoft Perceptive Pixels) in two directions: human computer interaction and immersive remote collaboration. the first
If search and Twitter data are to be treated as a survey, they would follow a very peculiar methodology: participation is a time-varying, demographically biased sample of the population, participants are effectively continuously answering different “survey” questions, and, finally, participants can choose how often they are allowed to answer the question. In response, we show alternative methods for interpreting and using online and social media data fruitfully.
Alternating minimization is a popular approach to solve several optimization problems. In this work, we explore theoretical properties of this method (and its variants) for several non-convex optimization problems that feature prominently in several important areas such as recommendation systems, compressive sensing, computer vision etc.
Eventful is a system that helps produce news reports by recruiting and guiding remote and locative crowd workers who attend events in person to perform information collection missions. Eventful explores and hopes to problematize the concept of journalism as a service. For initial details please read the short paper below. Full paper coming soon.
Labs: FUSE Labs
This project targets on using automatic techniques to reduce MTTR of large-scale online service systems.
We are exploring ways to reconcile (national) security and privacy through the use of advanced cryptography.
We built the Sketch2Cartoon system, which is an automatic cartoon making system. It enables users to sketch major curves of characters and props in their mind, and real-time search results from millions of clipart images could be selected to compose the cartoon images. The selected com- ponents are vectorized and thus could be further edited. By enabling sketch-based input, even a child who is too young to read or write can draw whatever he/she imagines and get interesting cartoon images.
We built the Sketch2Tag system for hand-drawn sketch recognition. Due to large variations presented in hand-drawn sketches, most of existing work was limited to a particular domain or limited pre-defined classes. Different from existing work, Sketch2Tag is a general sketch recognition system, towards recognizing any semantically meaningful object that a child can recognize. This system enables a user to draw a sketch on the query panel, and then provides real-time recognition results.
Microsoft Research in partnership with Bing is happy to launch the second MSR-Bing Challenge on Image Retrieval. Do you have what it takes to build the best image retrieval system? Enter the MSR-Bing Image Retrieval Challenge in ACM Multimedia and/or ICME to develop an image scoring system for a search query.
Using a diversity of big data to infer and predict fine-grained air quality throughout a city, and finally tackle air pollutions.
Sequence Error (SE) Minimization Training of Neural Network for Voice Conversion
Project CodaLab is an open source platform that empowers communities to explore experiments together and create competitions designed to advance the state-of-the-art in machine learning.
Conversational systems interact with people through language to assist, enable, or entertain. Research at Microsoft spans dialogs that use language exclusively, or in conjunctions with additional modalities like gesture; where language is spoken or in text; and in a variety of settings, such as conversational systems in apps or devices, and situated interactions in the real world.
We argue that the massive amount of click data from commercial search engines provides a data set that is unique in the bridging of the semantic and intent gap. Search engines generate millions of click data (a.k.a. image-query pairs), which provide almost "unlimited" yet strong connections between semantics and images, as well as connections between users' intents and queries. This site is to introduce such as dataset, Clickture.
MODIST is a practical software model checker for unmodified concurrent, distributed and cloud systems. MODIST explores different execution paths systematically as well as simulating a variety of environment faults to discover subtle corner-case defects. We have applied MODIST in Oracle Berkely DB, MPS(Paxos implementation), SQL Azure, Windows Azure Storage and other real systems, and found many new bugs.
Programming today involves editing code while also running it in our head. To augment this mental simulation, live programming promises for much more fluid feedback between the programmer and a program that is executing while it is being edited.
A Bright Minds Internship is your opportunity to work on real-world projects alongside some of the brightest minds in computer science at Microsoft Research Cambridge.
This is the website of the rack-scale computing research project at MSRC