Our research
Content type
+
Downloads (425)
+
Events (356)
 
Groups (147)
+
News (2477)
 
People (826)
 
Projects (1017)
+
Publications (11449)
+
Videos (4867)
Labs
Research areas
Algorithms and theory47205 (3)
Communication and collaboration47188 (6)
Computational linguistics47189 (13)
Computational sciences47190 (23)
Computer systems and networking47191 (25)
Computer vision208594 (0)
Data mining and data management208595 (0)
Economics and computation47192 (0)
Education47193 (2)
Gaming47194 (1)
Graphics and multimedia47195 (17)
Hardware and devices47196 (8)
Health and well-being47197 (13)
Human-computer interaction47198 (21)
Machine learning and intelligence47200 (14)
Mobile computing208596 (0)
Quantum computing208597 (0)
Search, information retrieval, and knowledge management47199 (22)
Security and privacy47202 (9)
Social media208598 (0)
Social sciences47203 (2)
Software development, programming principles, tools, and languages47204 (27)
Speech recognition, synthesis, and dialog systems208599 (0)
Technology for emerging markets208600 (0)
1–22 of 22
Sort
Show 25 | 50 | 100
1
The Scalable Hyperlink Store is a specialized "database" for the web graph. SHS maintains the web graph in main memory, distributed over many machines.
Details
Date: 5 February 2014
Version: 1.0.1
Size: 1.19 MB
Type: Download
Project Colletta is an extension of the Windows UI that supports lightweight management of the user's activities through tagging.
Details
Date: 22 October 2013
Version: 3.0.0
Size: 2.22 MB
Type: Download
This package implements several algorithms for language identification, and includes two sets of pre-compiled language profiles. One set covers 52 languages and was trained on Wikipedia (i.e. a well-written corpus); the other covers 26 languages and was constructed from Twitter (i.e. a highly colloquial corpus). The language identifiers are packaged up as a C# library, and be easily embedded into other C# projects.
Details
Date: 8 August 2013
Version: 1.0
Size: 0.09 MB
Type: Download
The primary function of this add-in is to add a few buttons to the Outlook ribbon to prevent people from replying to all the recipients of your message or forwarding it, etc. The add-in uses a facility built into Outlook and Exchange that is more lightweight than information-rights management but is not exposed in the existing UI. The add-in also includes a check for common email errors, such as omitting attachments or subject lines.
Details
Date: 2 August 2013
Version: 3.1.4
Size: 0.92 MB
Type: Download
This GPS trajectory dataset was collected in (Microsoft Research Asia) Geolife project by 182 users in a period of over three years (from April 2007 to August 2012). A GPS trajectory of this dataset is represented by a sequence of time-stamped points, each of which contains the information of latitude, longitude and altitude. This dataset contains 17,621 trajectories with a total distance of about 1.2 million kilometers and a total duration of 48,000+ hours. These trajectories were recorded by different...
Details
Date: 9 August 2012
Version: 1.2.2
Size: 298.66 MB
Type: Download
Zentity 2.1 includes a new Resource Manager web user interface that enables users to query the database, review and update records, and create and edit relationships between items that are stored in Zentity. The Resource Manager will work with custom data models and gives users the ability to save searches for later use. Zentity 2.1 also offers an option to install a localized Spanish-language version of the software.
Details
Date: 12 December 2011
Version: 2.1
Size: 413.80 MB
Type: Download
Zentity is a research output-repository platform that provides a suite of building blocks, tools, and services that help you create and maintain your organization’s digital-library ecosystem.
Details
Date: 20 September 2011
Version: 2.1
Size: 101.40 MB
Type: Download
Zentity is a research output-repository platform that provides a suite of building blocks, tools, and services that help you create and maintain your organization’s digital-library ecosystem.
Details
Date: 20 September 2011
Version: 2.1
Size: 151.14 MB
Type: Download
Zentity is a research output-repository platform that provides a suite of building blocks, tools, and services that help you create and maintain your organization’s digital-library ecosystem.
Details
Date: 20 September 2011
Version: 2.1
Size: 101.50 MB
Type: Download
Zentity is a research output-repository platform that provides a suite of building blocks, tools, and services that help you create and maintain your organization’s digital-library ecosystem.
Details
Date: 20 September 2011
Version: 2.1
Size: 101.40 MB
Type: Download
The Query Representation and Understanding (QRU) data set contains a set of similar queries that can be used in web research such as query transformation and relevance ranking. QRU contains similar queries that are related to existing benchmark data sets, such as TREC query sets. The QRU data set was created by extracting 100 TREC queries, training a query-generation model and a commercial search engine, generating similar queries from TREC queries with the model, and removal of mistakenly generated...
Details
Date: 9 August 2011
Version: 1.0
Size: 0.01 MB
Type: Download
This data set is used to test various models for creating translingual document representations. We sampled 60,730 English Wikipedia articles and their Spanish counterparts and transformed each of them to 20,000-dimensional sparse term vectors. The data set will not contain the original articles, just the term vectors and the vocabulary file.
Details
Date: 8 August 2011
Version: 1.0.0
Size: 218.44 MB
Type: Download
The basic idea of AdaRank is constructing “weak rankers” repeatedly based on reweighted training queries and linearly combining the weak rankers for making ranking predictions. In learning, AdaRank minimizes a loss function directly defined on performance measures. The details of AdaRank can be found in the paper “AdaRank: A Boosting Algorithm for Information Retrieval.”
Details
Date: 11 April 2011
Version: 1.0
Size: 0.89 MB
Type: Download
This download is provided for the purpose of the Speller Challenge. This is a development dataset based on the publicly available TREC queries (2008 Million Query Track). Queries are annotated by using the same guidelines and processes as in the creation of the Bing Test Dataset.
Details
Date: 14 January 2011
Version: 1.0
Size: 0.32 MB
Type: Download
Pivot is an experimental application for exploring large data sets with smooth visual interactions. The application originally was released by Microsoft Live Labs in October 2009, and it is being re-released by Microsoft Research to enable the research community to continue to use it for experiments. If you have Internet Explorer 9 installed, disable GPU rendering in Internet Explorer to enable Pivot to work correctly. The Pivot collection home page points to content no longer available, but Pivot still...
Details
Date: 17 December 2010
Version: CTP1
Size: 27.78 MB
Type: Download
Diff-IE is a prototype Internet Explorer add-on that highlights the changes to a page since the last time you visited it and enables you to view and compare previously cached versions of the page.
Details
Date: 7 December 2010
Version: 2.0.1046.0
Size: 4.76 MB
Type: Download
This is a .NET assembly with a PowerShell front end to enable interactive physical-design tuning sessions over SQL Server databases.
Details
Date: 30 August 2010
Version: 0.1.2.0
Size: 0.31 MB
Type: Download
We offer a collection of common information-retrieval tools written in the DryadLINQ data parallel language. The tools are useful to the information-retrieval practitioner and instructive in the use of DryadLINQ.
Details
Date: 16 July 2010
Version: 1.0
Size: 0.17 MB
Type: Download
This is a GPS trajectory dataset collected in (Microsoft Research Asia) GeoLife project by 165 users in a period of over two years (from April 2007 to August 2009). These trajectories were recorded by different GPS loggers or GPS-phones, and have a variety of sampling rates. 95 percent of the trajectories are logged in a dense representation, e.g., every 2~5 seconds or every 5~10 meters per point, while a few of them do not have such a high density being constrained by the devices. This dataset...
Details
Date: 24 May 2010
Version: 1.0
Size: 331.86 MB
Type: Download
Privacy Integrated Queries (PINQ) is a LINQ-like API for writing programs against sensitive data sets, while providing differential privacy guarantees for the underlying records. This first release provides the PINQ infrastructure, several example data analysis applications, and should be suitable for prototyping many differentially-private data analyses.
Details
Date: 18 August 2009
Version: 0.1.1
Size: 0.27 MB
Type: Download
Our library can support basic features for site analysis, such as site-map building, forum-page structuralization, URL-pattern generation, and page random sampling.
Details
Date: 27 July 2009
Version: 1.0.000
Size: 0.11 MB
Type: Download
A collection of short programs to compute standard information-retrieval performance measures—Recall, Precision, F-measure, Mean Average Precision, Mean Reciprocal Rank, Normalized Discounted Cumulative Gain—in the presence of tied scores.
Details
Date: 2 May 2007
Version: 1
Size: 0.02 MB
Type: Download
1–22 of 22
Sort
Show 25 | 50 | 100
1
> Our research