Open-Domain Question Answering

197 papers with code • 15 benchmarks • 26 datasets

Open-domain question answering is the task of question answering on open-domain datasets such as Wikipedia.

Benchmarks

Add a Result

These leaderboards are used to track progress in Open-Domain Question Answering

Dataset	Best Model	Compare
SearchQA	Cluster-Former (#C=512)	See all
Quasar	Evidence Aggregation via R^3 Re-Ranking	See all
ELI5	Fourier Transformer	See all
Natural Questions	FiE	See all
SQuAD1.1 dev	SPARTA	See all
WebQuestions	UniK-QA	See all
SQuAD1.1	DrQA	See all
DuReader	ERNIE 2.0 Large	See all
KILT: Natural Questions	Re2G	See all
KILT: TriviaQA	Re2G	See all
KILT: ELI5	arxiv.org/abs/2103.06332	See all
TQA	UniK-QA	See all
KILT: HotpotQA	Multitask DPR + BART	See all
Natural Questions (short)	EMDR2	See all
TriviaQA	UnitedQA (Hybrid)	See all

Show all 15 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Open-Domain Question Answering models and implementations

huggingface/transformers

5 papers

125,545

princeton-nlp/DensePhrases

3 papers

593

DevSinghSachan/unsupervised-passage…

3 papers

deepset-ai/haystack

2 papers

13,748

See all 11 libraries.

Datasets

Latest papers

Most implemented Social Latest No code

Beyond Memorization: The Challenge of Random Memory Access in Language Models

sail-sg/lm-random-memory-access • • 12 Mar 2024

Through carefully-designed synthetic tasks, covering the scenarios of full recitation, selective recitation and grounded question answering, we reveal that LMs manage to sequentially access their memory while encountering challenges in randomly accessing memorized content.

12 Mar 2024

Paper
Code

REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering

rucaibox/rear • • 27 Feb 2024

By combining the improvements in both architecture and training, our proposed REAR can better utilize external knowledge by effectively perceiving the relevance of retrieved documents.

27 Feb 2024

Paper
Code

RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering

hyintell/retrievalqa • • 26 Feb 2024

Based on our findings, we propose Time-Aware Adaptive Retrieval (TA-ARE), a simple yet effective method that helps LLMs assess the necessity of retrieval without calibration or additional training.

26 Feb 2024

Paper
Code

Pre-training Cross-lingual Open Domain Question Answering with Large-scale Synthetic Supervision

fantabulous-j/class • • 26 Feb 2024

Cross-lingual question answering (CLQA) is a complex problem, comprising cross-lingual retrieval from a multilingual knowledge base, followed by answer generation either in English or the query language.

26 Feb 2024

Paper
Code

VerAs: Verify then Assess STEM Lab Reports

psunlpgroup/veras • • 7 Feb 2024

With an increasing focus in STEM education on critical thinking skills, science writing plays an ever more important role in curricula that stress inquiry skills.

07 Feb 2024

Paper
Code

Can AI Assistants Know What They Don't Know?

openmoss/say-i-dont-know • • 24 Jan 2024

To answer this question, we construct a model-specific "I don't know" (Idk) dataset for an assistant, which contains its known and unknown questions, based on existing open-domain question answering datasets.

24 Jan 2024

Paper
Code

Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence Regularization

wangskygit/passage-sieve • • 30 Dec 2023

Hard negative sampling, which is commonly used to improve contrastive learning, can introduce more noise in training.

30 Dec 2023

Paper
Code

Learning to Filter Context for Retrieval-Augmented Generation

zorazrw/filco • 14 Nov 2023

To alleviate these problems, we propose FILCO, a method that improves the quality of the context provided to the generator by (1) identifying useful context based on lexical and information-theoretic approaches, and (2) training context filtering models that can filter retrieved contexts at test time.

148

14 Nov 2023

Paper
Code

Detrimental Contexts in Open-Domain Question Answering

xfactlab/emnlp2023-damaging-retrieval • • 27 Oct 2023

However, counter-intuitively, too much context can have a negative impact on the model when evaluated on common question answering (QA) datasets.

27 Oct 2023

Paper
Code

Knowledge Corpus Error in Question Answering

xfactlab/emnlp2023-knowledge-corpus-error • 27 Oct 2023

This error arises when the knowledge corpus used for retrieval is only a subset of the entire string space, potentially excluding more helpful passages that exist outside the corpus.

27 Oct 2023

Paper
Code

Open-Domain Question Answering

Benchmarks Add a Result

Libraries

Datasets

Latest papers

Content

Benchmarks

Add a Result