Datasets for Time-aware Information Access and Retrieval Experiments
This page is designed to be a public hub for researchers interested in sharing different tools and datasets for time-aware information access and retrieval experiments.
- Topic Tracking and news data: http://projects.ldc.upenn.edu/TDT/
- Full NYT data: http://www.nytimes.com/ref/membercenter/nytarchive.html
- Crawling datasets over time: http://people.oii.ox.ac.uk/escher/resources/web-crawling-crawl-datasets/; http://commoncrawl.org/data/
Contact: Milad Shokouhi