Data-to-Text Generation

105 papers with code • 24 benchmarks • 22 datasets

A classic problem in natural-language generation (NLG) involves taking structured data, such as a table, as input, and producing text that adequately and fluently describes this data as output. Unlike machine translation, which aims for complete transduction of the sentence to be translated, this form of NLG is usually taken to require addressing (at least) two separate challenges: what to say, the selection of an appropriate subset of the input data to discuss, and how to say it, the surface realization of a generation.

( Image credit: Data-to-Text Generation with Content Selection and Planning )

Benchmarks

Add a Result

These leaderboards are used to track progress in Data-to-Text Generation

Dataset	Best Model	Compare
WebNLG	Control Prefixes (A1, T5-large)	See all
E2E NLG Challenge	S_1^R	See all
WebNLG Full	Control Prefixes (A1, A2, T5-large)	See all
Cleaned E2E NLG Challenge	Control Prefixes (T5-large)	See all
RotoWire (Relation Generation)	SeqPlan	See all
RotoWire	HierarchicalEncoder + NR + IR	See all
ToTTo	T5-3B	See all
XAlign	Fact-aware embedding with mT5	See all
Rotowire (Content Selection)	Hierarchical Transformer Encoder + conditional copy	See all
RotoWire (Content Ordering)	Hierarchical Transformer Encoder + conditional copy	See all
MULTIWOZ 2.1	T5-Base	See all
MLB Dataset (Relation Generation)	SeqPlan	See all
MLB Dataset	SeqPlan	See all
MLB Dataset (Content Ordering)	SeqPlan	See all
Czech Restaurant NLG	binmt	See all
MLB Dataset (Content Selection)	Force-Copy	See all
SR11Deep	Transition based Deep Input Linearization	See all
ViGGO	DataTuner_FC	See all
WebNLG en	mBART	See all
WebNLG ru	mBART	See all
E2E	self-mem + new data (random)	See all
AMR3.0	StructAdapt	See all
Wikipedia Person and Animal Dataset	Ours	See all
DART	self-mem + new data	See all

Show all 24 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Data-to-Text Generation models and implementations

UFAL-DSG/tgen

2 papers

204

Datasets

Subtasks

Latest papers with no code

Most implemented Social Latest No code

A Systematic Review of Data-to-Text NLG

no code yet • 13 Feb 2024

Relevant literature in this field on datasets, evaluation metrics, application areas, multilingualism, language models, and hallucination mitigation methods is reviewed.

Paper
Add Code

TrICy: Trigger-guided Data-to-text Generation with Intent aware Attention-Copy

no code yet • IEEE/ACM Transactions on Audio, Speech, and Language Processing 2024

Data-to-text (D2T) generation is a crucial task in many natural language understanding (NLU) applications and forms the foundation of task-oriented dialog systems.

Paper
Add Code

Beyond Reference-Based Metrics: Analyzing Behaviors of Open LLMs on Data-to-Text Generation

no code yet • 18 Jan 2024

We analyze the behaviors of open large language models (LLMs) on the task of data-to-text (D2T) generation, i. e., generating coherent and relevant text from structured data.

Paper
Add Code

Compositional Generalization for Data-to-Text Generation

no code yet • 5 Dec 2023

Data-to-text generation involves transforming structured data, often represented as predicate-argument tuples, into coherent textual descriptions.

Paper
Add Code

Prompt Optimization via Adversarial In-Context Learning

no code yet • 5 Dec 2023

We propose a new method, Adversarial In-Context Learning (adv-ICL), to optimize prompt for in-context learning (ICL) by employing one LLM as a generator, another as a discriminator, and a third as a prompt modifier.

Paper
Add Code

PixT3: Pixel-based Table To Text generation

no code yet • 16 Nov 2023

Table-to-text generation involves generating appropriate textual descriptions given structured tabular data.

Paper
Add Code

Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning

no code yet • 10 Aug 2023

We present a novel approach for structured data-to-text generation that addresses the limitations of existing methods that primarily focus on specific types of structured data.

Paper
Add Code

You Can Generate It Again: Data-to-text Generation with Verification and Correction Prompting

no code yet • 28 Jun 2023

In this paper, we propose a novel approach that goes beyond traditional one-shot generation methods by introducing a multi-step process consisting of generation, verification, and correction stages.

Paper
Add Code

ReTAG: Reasoning Aware Table to Analytic Text Generation

no code yet • 19 May 2023

The task of table summarization involves generating text that both succinctly and accurately represents the table or a specific set of highlighted cells within a table.

Paper
Add Code

Stylized Data-to-Text Generation: A Case Study in the E-Commerce Domain

no code yet • 5 May 2023

This task is non-trivial, due to three challenges: the logic of the generated text, unstructured style reference, and biased training samples.

Paper
Add Code

Data-to-Text Generation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result