Data-to-Text Generation

105 papers with code • 24 benchmarks • 22 datasets

A classic problem in natural-language generation (NLG) involves taking structured data, such as a table, as input, and producing text that adequately and fluently describes this data as output. Unlike machine translation, which aims for complete transduction of the sentence to be translated, this form of NLG is usually taken to require addressing (at least) two separate challenges: what to say, the selection of an appropriate subset of the input data to discuss, and how to say it, the surface realization of a generation.

( Image credit: Data-to-Text Generation with Content Selection and Planning )

Libraries

Use these libraries to find Data-to-Text Generation models and implementations
2 papers
204

Latest papers with no code

A Systematic Review of Data-to-Text NLG

no code yet • 13 Feb 2024

Relevant literature in this field on datasets, evaluation metrics, application areas, multilingualism, language models, and hallucination mitigation methods is reviewed.

TrICy: Trigger-guided Data-to-text Generation with Intent aware Attention-Copy

no code yet • IEEE/ACM Transactions on Audio, Speech, and Language Processing 2024

Data-to-text (D2T) generation is a crucial task in many natural language understanding (NLU) applications and forms the foundation of task-oriented dialog systems.

Beyond Reference-Based Metrics: Analyzing Behaviors of Open LLMs on Data-to-Text Generation

no code yet • 18 Jan 2024

We analyze the behaviors of open large language models (LLMs) on the task of data-to-text (D2T) generation, i. e., generating coherent and relevant text from structured data.

Compositional Generalization for Data-to-Text Generation

no code yet • 5 Dec 2023

Data-to-text generation involves transforming structured data, often represented as predicate-argument tuples, into coherent textual descriptions.

Prompt Optimization via Adversarial In-Context Learning

no code yet • 5 Dec 2023

We propose a new method, Adversarial In-Context Learning (adv-ICL), to optimize prompt for in-context learning (ICL) by employing one LLM as a generator, another as a discriminator, and a third as a prompt modifier.

PixT3: Pixel-based Table To Text generation

no code yet • 16 Nov 2023

Table-to-text generation involves generating appropriate textual descriptions given structured tabular data.

Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning

no code yet • 10 Aug 2023

We present a novel approach for structured data-to-text generation that addresses the limitations of existing methods that primarily focus on specific types of structured data.

You Can Generate It Again: Data-to-text Generation with Verification and Correction Prompting

no code yet • 28 Jun 2023

In this paper, we propose a novel approach that goes beyond traditional one-shot generation methods by introducing a multi-step process consisting of generation, verification, and correction stages.

ReTAG: Reasoning Aware Table to Analytic Text Generation

no code yet • 19 May 2023

The task of table summarization involves generating text that both succinctly and accurately represents the table or a specific set of highlighted cells within a table.

Stylized Data-to-Text Generation: A Case Study in the E-Commerce Domain

no code yet • 5 May 2023

This task is non-trivial, due to three challenges: the logic of the generated text, unstructured style reference, and biased training samples.