QUASAR-T (QUestion Answering by Search And Reading – Trivia)

Introduced by Dhingra et al. in Quasar: Datasets for Question Answering by Search and Reading

QUASAR-T is a large-scale dataset aimed at evaluating systems designed to comprehend a natural language query and extract its answer from a large corpus of text. It consists of 43,013 open-domain trivia questions and their answers obtained from various internet sources. ClueWeb09 serves as the background corpus for extracting these answers. The answers to these questions are free-form spans of text, though most are noun phrases.

Source: Quasar: Datasets for Question Answering by Search and Reading

Homepage