Browse SoTA > Reasoning > Visual Reasoning

Visual Reasoning

35 papers with code ยท Reasoning

Benchmarks

Latest papers without code

CLEVR Parser: A Graph Parser Library for Geometric Learning on Language Grounded Image Scenes

19 Sep 2020

The CLEVR dataset has been used extensively in language grounded visual reasoning in Machine Learning (ML) and Natural Language Processing (NLP) domains.

VISUAL REASONING

A Distance-preserving Matrix Sketch

8 Sep 2020

This selection is designed to preserve relative distances as closely as possible.

FEATURE SELECTION VISUAL REASONING

Video Captioning Using Weak Annotation

2 Sep 2020

Through traversing the dependency trees, the sentences are generated to train the captioning model.

VIDEO CAPTIONING VISUAL REASONING

Few-shot Visual Reasoning with Meta-analogical Contrastive Learning

23 Jul 2020

While humans can solve a visual puzzle that requires logical reasoning by observing only few samples, it would require training over large amount of data for state-of-the-art deep reasoning models to obtain similar performance on the same task.

CONTRASTIVE LEARNING VISUAL REASONING

Multi-Granularity Modularized Network for Abstract Visual Reasoning

9 Jul 2020

Abstract visual reasoning connects mental abilities to the physical world, which is a crucial factor in cognitive development.

VISUAL REASONING

Self-Segregating and Coordinated-Segregating Transformer for Focused Deep Multi-Modular Network for Visual Question Answering

25 Jun 2020

Self-segregation strategy for attention contributes in better understanding and filtering the information that can be most helpful for answering the question and create diversity of visual-reasoning for attention.

QUESTION ANSWERING VISUAL QUESTION ANSWERING VISUAL REASONING

Neuro-Symbolic Visual Reasoning: Disentangling "Visual" from "Reasoning"

ICML 2020

To address this, we propose (1) a framework to isolate and evaluate the reasoning aspect of VQA separately from its perception, and (2) a novel top-down calibration technique that allows the model to answer reasoning questions even with imperfect perception.

GRAPH GENERATION QUESTION ANSWERING REPRESENTATION LEARNING SCENE GRAPH GENERATION VISUAL QUESTION ANSWERING VISUAL REASONING

Abstract Diagrammatic Reasoning with Multiplex Graph Networks

ICLR 2020

We have tested MXGNet on two types of diagrammatic reasoning tasks, namely Diagram Syllogisms and Raven Progressive Matrices (RPM).

VISUAL REASONING

Deep Visual Reasoning: Learning to Predict Action Sequences for Task and Motion Planning from an Initial Scene Image

9 Jun 2020

This is possible by encoding the objects of the scene in images as input to the neural network, instead of a fixed feature vector.

MOTION PLANNING VISUAL REASONING

Structured Multimodal Attentions for TextVQA

1 Jun 2020

Most of the state-of-the-art (SoTA) VQA methods fail to answer these questions because of i) poor text reading ability; ii) lacking of text-visual reasoning capacity; and iii) adopting a discriminative answering mechanism instead of a generative one which is hard to cover both OCR tokens and general text tokens in the final answer.

OPTICAL CHARACTER RECOGNITION QUESTION ANSWERING VISUAL QUESTION ANSWERING VISUAL REASONING