This corpus contains the original data analyzed in the following paper: Basu, Jacobs, and Vanderwende, "Powergrading: a Clustering Approach to Amplify Human Effort for Short Answer Grading,” Transactions of the ACL, 2013. It consists of responses from 100 + 698 crowdsourced workers to each of 20 short-answer questions. These questions are taken from the 100 questions published by the United States Citizenship and Immigration Services as preparation for the citizenship test. It also contains labels of response correctness (grades) from three judges for a subset of 10 questions for the set of 698 responses (3 x 6980 labels).
Note By installing, copying, or otherwise using this software, you agree to be bound by the terms of its license. Read the license.