Texts

e-ViL

Introduced by Kayser et al. in e-ViL: A Dataset and Benchmark for Natural Language Explanations in Vision-Language Tasks

e-ViL is a benchmark for explainable vision-language tasks. e-ViL spans across three datasets of human-written NLEs (natural language explanations), and provides a unified evaluation framework that is designed to be re-usable for future works.

This benchmark uses the following datasets: e-SNLI-VE, VCR, VQA-X.

Homepage

Benchmarks

Add a new result Link an existing benchmark

No benchmarks yet. Start a new benchmark or link an existing one.

Papers

Paper	Code	Results	Date	Stars

VCR

WinoGAViL

Usage

License

Multiple licenses

Modalities

Images
Texts

Languages

English

e-ViL

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit

Similar Datasets

CodRED

N-Digit MNIST

VCR

WinoGAViL

Usage

License Edit

Modalities Edit

Languages Edit

Benchmarks

Add a new result Link an existing benchmark

Dataset Loaders

Add Remove

Tasks

License

Modalities

Languages