TDIUC (Task Directed Image Understanding Challenge)

Introduced by Kafle et al. in An Analysis of Visual Question Answering Algorithms

Task Directed Image Understanding Challenge (TDIUC) dataset is a Visual Question Answering dataset which consists of 1.6M questions and 170K images sourced from MS COCO and the Visual Genome Dataset. The image-question pairs are split into 12 categories and 4 additional evaluation matrices which help evaluate models’ robustness against answer imbalance and its ability to answer questions that require higher reasoning capability. The TDIUC dataset divides the VQA paradigm into 12 different task directed question types. These include questions that require a simpler task (e.g., object presence, color attribute) and more complex tasks (e.g., counting, positional reasoning). The dataset includes also an “Absurd” question category in which questions are irrelevant to the image contents to help balance the dataset.

Source: Question-Agnostic Attention for Visual Question Answering

Homepage

Benchmarks

Add a new result Link an existing benchmark

Trend	Task	Dataset Variant	Best Model	Paper	Code
	Visual Question Answering (VQA)	TDIUC	Accuracy

Papers

Paper	Code	Results	Date	Stars

Dataset Loaders

Add Remove

No data loaders found. You can submit your data loader here.

Tasks

Visual Question Answering (VQA)

Similar Datasets

MemexQA

VQA-VS

VQA-MHUG

COCO-QA

Source: https://kushalkafle.com/projects/tdiuc.html.

Usage

License

Unknown

Modalities

Images

TDIUC (Task Directed Image Understanding Challenge)

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit