Smart Selection Dataset

Smart selection is the task of predicting the span of text that a user intended to select after they touched on a single word on a touch-enabled device. The Smart Selection Dataset consists of crowd-sourced smart selection annotations on publicly available data. Specifically, we start from book from, which consists of publicly available textbooks. We randomly sampled 100 textbooks from and further randomly sampled one paragraph from each textbook. For each paragraph, we asked 100 crowd workers to select the phrases in the paragraph that they find interesting and would like to learn more about. See Pantel et al. 2014 (ACL) for a detailed description of the training and test data.


Date Published10 April 2014
Download Size10.55 MB

Note By installing, copying, or otherwise using this software, you agree to be bound by the terms of its license. Read the license.