Question Answering

2911 papers with code • 131 benchmarks • 362 datasets

Question Answering is the task of answering questions (typically reading comprehension questions), but abstaining when presented with a question that cannot be answered based on the provided context.

Question answering can be segmented into domain-specific tasks like community question answering and knowledge-base question answering. Popular benchmark datasets for evaluation question answering systems include SQuAD, HotPotQA, bAbI, TriviaQA, WikiQA, and many others. Models for question answering are typically evaluated on metrics like EM and F1. Some recent top performing models are T5 and XLNet.

( Image credit: SQuAD )

Libraries

Use these libraries to find Question Answering models and implementations
27 papers
125,545
5 papers
2,548
See all 11 libraries.

Latest papers with no code

Multi-hop Question Answering over Knowledge Graphs using Large Language Models

no code yet • 30 Apr 2024

Knowledge graphs (KGs) are large datasets with specific structures representing large knowledge bases (KB) where each node represents a key entity and relations amongst them are typed edges.

TableVQA-Bench: A Visual Question Answering Benchmark on Multiple Table Domains

no code yet • 30 Apr 2024

QA pairs are generated by exploiting the large language model (LLM) where the input is a text-formatted table.

When to Retrieve: Teaching LLMs to Utilize Information Retrieval Effectively

no code yet • 30 Apr 2024

Through our analysis, we demonstrate that Adapt-LLM is able to generate the <RET> token when it determines that it does not know how to answer a question, indicating the need for IR, while it achieves notably high accuracy levels when it chooses to rely only on its parametric memory.

QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering

no code yet • 30 Apr 2024

Extractive Question Answering (EQA) in Machine Reading Comprehension (MRC) often faces the challenge of dealing with semantically identical but format-variant inputs.

Neuro-Vision to Language: Image Reconstruction and Language enabled Interaction via Brain Recordings

no code yet • 30 Apr 2024

This extractor consolidates multi-level visual features into one network, simplifying integration with Large Language Models (LLMs).

Capabilities of Gemini Models in Medicine

no code yet • 29 Apr 2024

We evaluate Med-Gemini on 14 medical benchmarks, establishing new state-of-the-art (SoTA) performance on 10 of them, and surpass the GPT-4 model family on every benchmark where a direct comparison is viable, often by a wide margin.

QANA: LLM-based Question Generation and Network Analysis for Zero-shot Key Point Analysis and Beyond

no code yet • 29 Apr 2024

The proliferation of social media has led to information overload and increased interest in opinion mining.

Automated Construction of Theme-specific Knowledge Graphs

no code yet • 29 Apr 2024

Specifically, we start with an entity ontology of the theme from Wikipedia, based on which we then generate candidate relations by Large Language Models (LLMs) to construct a relation ontology.

Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing Japanese Language Capabilities

no code yet • 27 Apr 2024

The results showed that the efficiency gained through vocabulary expansion had no negative impact on performance, except for the summarization task, and that the combined use of parallel corpora enhanced translation ability.

Transfer Learning Enhanced Single-choice Decision for Multi-choice Question Answering

no code yet • 27 Apr 2024

Multi-choice Machine Reading Comprehension (MMRC) aims to select the correct answer from a set of options based on a given passage and question.