The Value of Semantic Parse Labeling for Knowledge Base Question Answering

  • Scott Wen-tau Yih ,
  • Matthew Richardson ,
  • Chris Meek ,
  • Ming-Wei Chang ,

Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics |

Publication

We demonstrate the value of collecting semantic parse labels for knowledge base question answering. In particular, (1) unlike previous studies on small-scale datasets, we show that learning from labeled semantic parses significantly improves overall performance, resulting in absolute 5 point gain compared to learning from answers, (2) we show that with an appropriate user interface, one can obtain semantic parses with high accuracy and at a cost comparable or lower than obtaining just answers, and (3) we have created and shared the largest semantic-parse labeled dataset to date in order to advance research in question answering.

Publication Downloads

WebQuestions Semantic Parses Dataset

May 19, 2016

The WebQuestionsSP dataset is released as part of our ACL-2016 paper “The Value of Semantic Parse Labeling for Knowledge Base Question Answering”, in which we evaluated the value of gathering semantic parses, vs. answers, for a set of questions that originally comes from WebQuestions [Berant et al., 2013]. The WebQuestionsSP dataset contains full semantic parses in SPARQL queries for 4,737 questions, and “partial” annotations for the remaining 1,073 questions for which a valid parse could not be formulated or where the question itself is bad or needs a descriptive answer. This release also includes an evaluation script and the output of the STAGG semantic parsing system when trained using the full semantic parses. More detail can be found in the document and labeling instructions included in this release, as well as the paper.

MSR FastRDFStore Package – Data Release

January 2, 2017

This data release is part of the MSR FastRDFStore Package (https://github.com/Microsoft/FastRDFStore/) and includes the last dump of Freebase, as well as the processed version ready to load directly into FastRDFStore.