Share on Facebook Tweet on Twitter Share on LinkedIn Share by email
Our research
Content type
+
Downloads (461)
+
Events (468)
 
Groups (151)
+
News (2817)
 
People (717)
 
Projects (1136)
+
Publications (12933)
+
Videos (6003)
Labs
Research areas
Algorithms and theory47205 (5)
Communication and collaboration47188 (6)
Computational linguistics47189 (13)
Computational sciences47190 (24)
Computer systems and networking47191 (24)
Computer vision208594 (1)
Data mining and data management208595 (0)
Economics and computation47192 (0)
Education47193 (2)
Gaming47194 (3)
Graphics and multimedia47195 (22)
Hardware and devices47196 (10)
Health and well-being47197 (15)
Human-computer interaction47198 (23)
Machine learning and intelligence47200 (15)
Mobile computing208596 (2)
Quantum computing208597 (0)
Search, information retrieval, and knowledge management47199 (23)
Security and privacy47202 (14)
Social media208598 (0)
Social sciences47203 (3)
Software development, programming principles, tools, and languages47204 (27)
Speech recognition, synthesis, and dialog systems208599 (1)
Technology for emerging markets208600 (1)
1–25 of 461
Sort
Show 25 | 50 | 100
1234567Next 
This data set identifies 38M tweets collected for the analysis of social media messages related to the 2012 U.S. Presidential election. The data set provides tweet IDs for tweets containing the words "obama", "romney", or both (case-insensitive matching) during the period from July 1, 2012 through November 7, 2012. The paper, “Online and Social Media Data As an Imperfect Continuous Panel Survey.” PLoS ONE 11(1): e0145406 by Diaz et al. provides further description of the dataset.
Details
Date: 29 January 2016
Version: 1.0
Size: 279.02 MB
Type: Download
In addition to physical TPM devices, the TSS.MSR libraries can also connect to a TPM simulator to enable application development and debugging on platforms that do not have a TPM 2.0 device. The connection to the simulator is over a TCP/IP socket so the simulator may be running on a remote machine or in another process on the same machine. Below you will find a link to download the TPM2 Simulator binary for use with the TSS.MSR TPM2 libraries.
Details
Date: 18 December 2015
Version: 2.0
Size: 0.24 MB
Type: Download
Microsoft Hyperlapse is a new technology that creates smooth and stabilized time lapses from first-person videos. Want to show your friends what you saw on that 12-mile hike you took last weekend or let them experience how it felt to fly down the mountain on your recent ski trip? With Microsoft Hyperlapse, you can time lapse those experiences, distilling them into easily consumable, enjoyable experiences.
Details
Date: 11 December 2015
Version: 1.5
Size: 19.37 MB
Type: Download
The Microsoft Research JavaScript Cryptography Library has been developed for use with cloud services in an HTML5 compliant and forward-looking manner. The algorithms are exposed via the W3C WebCrypto interface, and are tested against the Microsoft Edge implementation of that interface. The library currently supports RSA encrypt/decrypt (PKCS#1 v1.5, OAEP, and PSS), AES-CBC and GCM encrypt/decrypt, SHA-256/384/512, HMAC with supported hash functions, PRNG (AES-CTR based) as specified by NIST, ECDH, ECDSA,...
Details
Date: 2 December 2015
Version: 1.4
Size: 255.81 MB
Type: Download
Demonstration code of the project presented at SIGGRAPH Asia 2015: a fast automatic technique for converting a short (5-second) input video to a seamless loop that can play forever without spatial or temporal artifacts.
Details
Date: 13 November 2015
Version: 1.1
Size: 32.54 MB
Type: Download
This dataset contains knowledge base relation triples and textual mentions of Freebase entity pairs, as used in the work published in (Toutanova and Chen CVSM-2015) and (Toutanova et al. EMNLP-2015). The knowledge base triples are a subset of the FB15K set (Bordes et al. NIPS-2013), originally derived from Freebase. The textual mentions are derived from 200 million sentences from the ClueWeb12 corpus coupled with FACC1 Freebase entity mention annotations. More details can be found in the included README.
Details
Date: 30 October 2015
Version: 1.0
Size: 139.42 MB
Type: Download
We are releasing the code of some models used in our EMNLP-2015 paper, “WikiQA: A Challenge Dataset for Open-Domain Question Answering.” The code includes implementation of two models: Convolutional Neural Networks (CNN), and Logistic regression with CNN and count features (CNN-Cnt). The models intend to judge whether a given sentence can be used to answer the input question. Please refer to the README.txt file in the download and also the paper for detail.
Details
Date: 30 October 2015
Version: 1.0
Size: 1.98 MB
Type: Download
DRAW is a collection of algebra word problems that are semi-automatically mined from http://algebra.com. Compared to existing evaluation datasets, DRAW is more diverse in terms of problem types, and poses a challenging evaluation setting for automatic solvers. DRAW's annotation schema also provides annotation for the alignment of the coefficients to numbers appearing in the text, which was absent in earlier datasets. For template-based approaches, we also induce the templates from the data and de-duplicate...
Details
Date: 15 October 2015
Version: 1.0
Size: 0.19 MB
Type: Download
Microsoft Hyperlapse is a new technology that creates smooth and stabilized time lapses from first-person videos. Want to show your friends what you saw on that 12-mile hike you took last weekend or let them experience how it felt to fly down the mountain on your recent ski trip? With Microsoft Hyperlapse, you can time lapse those experiences, distilling them into easily consumable, enjoyable experiences.
Details
Date: 13 October 2015
Version: 1.3
Size: 18.36 MB
Type: Download
FourQlib is an efficient and portable math library that provides functions for computing essential elliptic curve operations on a new, high-performance curve called "FourQ". This curve targets the 128-bit security level.
Details
Date: 10 September 2015
Version: 1.0
Size: 0.08 MB
Type: Download
The FingerPaint Dataset contains video-sequences of several individuals performing hand gestures, as captured by a depth camera. The ground truth locations of the fingertips are included as an annotation for each frame of the video. This dataset was developed for the paper: T. Sharp et al. "Accurate, Robust, and Flexible Real-time Hand Tracking." In Proc. CHI, vol. 8. 2015. http://research.microsoft.com/apps/pubs/default.aspx?id=238453 Users of the dataset are requested to cite this paper. Note: The...
Details
Date: 28 August 2015
Version: 1.0
Size: 2048.00 MB
Type: Download
The WikiQA corpus is a new publicly available set of question and sentence pairs, collected and annotated for research on open-domain question answering. In order to reflect the true information need of general users, we used Bing query logs as the question source. Each question is linked to a Wikipedia page that potentially has the answer. Because the summary section of a Wikipedia page provides the basic and usually most important information about the topic, we used sentences in this section as the...
Details
Date: 28 August 2015
Version: 1.0
Size: 6.53 MB
Type: Download
Sent2vec maps a pair of short text strings (e.g., sentences or query-answer pairs) to a pair of feature vectors in a continuous, low-dimensional space where the semantic similarity between the text strings is computed as the cosine similarity between their vectors in that space. sent2vec performs the mapping using the Deep Structured Semantic Model (DSSM) proposed in (Huang et al. 2013), or the DSSM with convolutional-pooling structure (CDSSM) proposed in (Shen et al. 2014; Gao et al. 2014).
Details
Date: 28 July 2015
Version: 2.0
Size: 463.50 MB
Type: Download
This download primarily contains a list of URLs with paired natural language descriptions and code, as well as a separate of those URLs into training, development, and test data. In addition, code is included to help the downloader retrieve those URLs and their contents. This data is released in support of the ACL 2015 paper entitled “Language to Code: Learning Semantic Parsers for If-This-Then-That Recipes” by Chris Quirk, Raymond Mooney, and Michel Galley.
Details
Date: 27 July 2015
Version: 1.0
Size: 2.98 MB
Type: Download
Dataset and Evaluation code for the paper: “S-MART: Novel Tree-based Structured Learning Algorithms Applied to Tweet Entity Linking”. Releasing part of the datasets and python evaluation code used in the paper: S-MART: Novel Tree-based Structured Learning Algorithms Applied to Tweet Entity Linking, ACL 2015. Part of the data is based on the Making Sense of Microposts (#Microposts2014) Challenge (http://www.scc.lancs.ac.uk/microposts2014/challenge/index.html).
Details
Date: 2 July 2015
Version: 1.0
Size: 0.12 MB
Type: Download
This zip file contains CAD files and source code necessary to build and use an improved lens for the Oculus Rift HMD. The source code works with the Unity game engine to correct the lens distortion for the Oculus display. The lens was automatically designed by the LensFactory program developed at Microsoft Research. The optical quality is significantly better than the lenses that come with the Oculus. The lens uses off the shelf lens elements from Edmund Optics.
Details
Date: 1 July 2015
Version: 1.2
Size: 2.78 MB
Type: Download
MSR ECCLib is an efficient cryptographic library that provides functions for computing essential elliptic curve operations on a new set of high-security curves. All computations on secret data exhibit regular, constant-time execution, providing protection against timing and cache attacks. For more information, see http://research.microsoft.com/en-us/projects/nums/default.aspx.
Details
Date: 8 June 2015
Version: 2.0
Size: 0.15 MB
Type: Download
The R2 Probabilistic Programming Tool is a research project within the Programming Languages and Tools group at Microsoft Research on probabilistic programming. Our goal is to build a user friendly and scalable probabilistic programming system by employing powerful techniques from language design, program analysis and verification.
Details
Date: 1 June 2015
Version: 1.0
Size: 6.15 MB
Type: Download
Rural government maternal health workers in India are called Accredited Social Health Activists, or ASHAs. ASHA Assist is a tool designed to help ASHAs engage their clients in persuasive discussions about various topics related to maternal health. ASHA Assist consists of interactive videos on mobile phones, covering topics related to maternal health for use in counseling their clients. Currently ASHA Assist is designed to work on Java-enabled feature phones with touch screens such as the Nokia Asha 500.
Details
Date: 1 June 2015
Version: 2.1
Size: 0.19 MB
Type: Download
Plato is a C++ open-source neural network library which supports the specification of a large range of graph types, several activation functions and training losses. The library supports backpropagation and truncated BPTT, especially useful for Recurrent Neural Networks. It supports SGD, minibatched SGD and we also implemented several second-order like optimizers such as Adagrad and AdaDelta. It runs on CUDA 6.
Details
Date: 1 June 2015
Version: 1.0
Size: 15.18 MB
Type: Download
A collection of 12,696 Tweet Ids representing 4,232 three-step conversational snippets extracted from Twitter logs. Each row in the dataset represents a single context-message-response triple that has been evaluated by crowdsourced annotators as scoring an average of 4 or higher on a 5-point Likert scale measuring quality of the response in the context. The data has been randomly binned into tuning (development) and test sets, comprising 2118 and 2114 triples respectively. It is released to the natural...
Details
Date: 1 June 2015
Version: 1.0
Size: 0.11 MB
Type: Download
Fast R-CNN (Region-based Convolutional Network) is a clean and fast framework for object detection. Compared to traditional R-CNN, and its accelerated version SPPnet, Fast R-CNN trains networks using a multi-task loss in a single training stage. The multi-task loss simplifies learning and improves detection accuracy. Unlike SPPnet, all network layers can be updated during fine-tuning. We show that this difference has practical ramifications for very deep networks, such as VGG16, where mAP suffers when only...
Details
Date: 14 May 2015
Version: 1.0
Size: 6.98 MB
Type: Download
This is a dataset that can be used for training and evaluating knowledge base completion approaches for inferring missing entity type instances. We construct our training snapshot by taking the Freebase snapshot on 3rd September, 2013 and consider entities that have a link to their Wikipedia page. The development and test data consists of facts that were added to the 1st June, 2014 snapshot of Freebase. To get negative data, we make a closed world assumption treating any unobserved instance in Freebase as...
Details
Date: 18 March 2015
Version: 1.0
Size: 1043.76 MB
Type: Download
Project Colletta is an extension of the Windows UI that supports lightweight management of the user's activities through tagging.
Details
Date: 13 March 2015
Version: 3.0.1
Size: 2.22 MB
Type: Download
The Microsoft Research Cambridge demosaicing data set consists of set of raw images, and their downscaled versions which can be used for learning and evaluating demosaicing (and possibly other tasks like denoising), both in linear-space and color-space.
Details
Date: 28 February 2015
Version: 1.0
Type: Download
1–25 of 461
Sort
Show 25 | 50 | 100
1234567Next 
> Our research