CLEVR-Dialog

Introduced by Kottur et al. in CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual Dialog

CLEVR-Dialog is a large diagnostic dataset for studying multi-round reasoning in visual dialog. Specifically, that authors construct a dialog grammar that is grounded in the scene graphs of the images from the CLEVR dataset. This combination results in a dataset where all aspects of the visual dialog are fully annotated. In total, CLEVR-Dialog contains 5 instances of 10-round dialogs for about 85k CLEVR images, totaling to 4.25M question-answer pairs.

The CLEVR-Dialog is used to benchmark performance of standard visual dialog models; in particular, on visual coreference resolution (as a function of the coreference distance). This is the first analysis of its kind for visual dialog models that was not possible without this dataset.

CLEVR-Dialog is aims to help inform the development of future models for visual dialog.

Source: CLEVR-Dialog

Homepage

Benchmarks

Add a new result Link an existing benchmark

No benchmarks yet. Start a new benchmark or link an existing one.

Papers

Paper	Code	Results	Date	Stars

Dataset Loaders

Add Remove

satwikkottur/clevr-dialog

Tasks

Similar Datasets

Iconary

Wikidata-14M

Flickr-8k

ORGaze

CLEVR-Dialog

Benchmarks

Add a new result Link an existing benchmark

Papers

Dataset Loaders

Add Remove

Tasks

Similar Datasets

Iconary

Wikidata-14M

Flickr-8k

ORGaze

Usage

License

Modalities

Languages

CLEVR-Dialog

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit

Similar Datasets

Iconary

Wikidata-14M

Flickr-8k

ORGaze

Usage

License Edit

Modalities Edit

Languages Edit

Benchmarks

Add a new result Link an existing benchmark

Dataset Loaders

Add Remove

Tasks

License

Modalities

Languages