Microsoft Research Paraphrase Corpus

This download consists of data only: a text file containing 5800 pairs of sentences which have been extracted from news sources on the web, along with human annotations indicating whether each pair captures a paraphrase/semantic equivalence relationship. No more than 1 sentence has been extracted from any given news article. We have made a concerted effort to correctly associate with each sentence information about its provenance and any associated information about its author. If any attribution information is incorrect or missing, please send email to billdol@microsoft.com and we will update the file.

Download details

File Name MSRParaphraseCorpus.msi
Version 1.0
Date Published 3 March 2005
Download Size 1.30 MB

Note By installing, copying, or otherwise using this software, you agree to be bound by the terms of its license. Read the license.

Share
Share this page on Facebook
Share this page on Twitter
Share this page on LinkedIn
E-mail this page
RSS feeds