SG-NLG (Schema-Guided Natural Language Generation)

Introduced by Du et al. in Schema-Guided Natural Language Generation

The SG-NLG dataset is a pre-processed version of the DSTC8 Schema-Guided Dialogue SGD dataset, designed specifically for data-to-text Natural Language Generation (NLG). The original DSTC8 SGD contains ~20,000 dialogues spanning across ~20 domains.

This SG-NLG dataset is designed to make it easier to conduct NLG experiments on the SGD data. It consists of pre-processed SGD data by pairing the schema for each system turn with the corresponding set of natural language strings that realize it. It also “delexicalizes” the prompts (replace related values with fixed names) to convert them into templates that make them more generic for use within a dialog system.

The final SG-NLG dataset is composed of nearly 4K MRs and over 140K templates.

Source: The Schema-Guided Natural Language Generation (SG-NLG) Dataset

Homepage

Benchmarks

Add a new result Link an existing benchmark

No benchmarks yet. Start a new benchmark or link an existing one.

Papers

Paper	Code	Results	Date	Stars

SG-NLG (Schema-Guided Natural Language Generation)

Benchmarks

Add a new result Link an existing benchmark

Papers

Dataset Loaders

Add Remove

Tasks

Similar Datasets

RotoWire

Usage

License

Modalities

Languages

SG-NLG (Schema-Guided Natural Language Generation)

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit

Similar Datasets

RotoWire

Usage

License Edit

Modalities Edit

Languages Edit

Benchmarks

Add a new result Link an existing benchmark

Dataset Loaders

Add Remove

Tasks

License

Modalities

Languages