(2016. 6) Squad
Submitted on 2016. 6
Pranav Rajpurkar, Jian Zhang, Konstantin Lopyrev and Percy Liang
Simple Summary
Stanford Question Answering Dataset (SQuAD), a new reading comprehension dataset consisting of 100,000+ questions posed by crowdworkers on a set of Wikipedia articles, where the answer to each question is a segment of text from the corresponding reading passage.
a large reading comprehension dataset on Wikipedia articles with crowdsourced.
107,785 question-answer pairs on 536 articles ~500 articles from Wikipedia and size 100K.
human performance: 86.8 F1, 77% exact match
answer types
Last updated