Visual Relationship Detection

36 papers with code • 5 benchmarks • 5 datasets

Visual relationship detection (VRD) is one newly developed computer vision task aiming to recognize relations or interactions between objects in an image. It is a further learning task after object recognition and is essential for fully understanding images, even the visual world.

Spatial-Temporal Transformer for Dynamic Scene Graph Generation

yrcong/sttran ICCV 2021

Compared to the task of scene graph generation from images, it is more challenging because of the dynamic relationships between objects and the temporal dependencies between frames allowing for a richer semantic interpretation.

175
26 Jul 2021

Recovering the Unbiased Scene Graphs from the Biased Ones

coldmanck/recovering-unbiased-scene-graphs 5 Jul 2021

Given input images, scene graph generation (SGG) aims to produce comprehensive, graphical representations describing visual relationships among salient objects.

76
05 Jul 2021

2.5D Visual Relationship Detection

google-research-datasets/2.5vrd 26 Apr 2021

To enable progress on this task, we create a new dataset consisting of 220k human-annotated 2. 5D relationships among 512K objects from 11K images.

14
26 Apr 2021

RelTransformer: A Transformer-Based Long-Tail Visual Relationship Recognition

Vision-CAIR/RelTransformer CVPR 2022

This paper shows that modeling an effective message-passing flow through an attention mechanism can be critical to tackling the compositionality and long-tail challenges in VRR.

28
24 Apr 2021

Grounding Consistency: Distilling Spatial Common Sense for Precise Visual Relationship Detection

deeplab-ai/grounding-consistent-vrd ICCV 2021

Scene Graph Generators (SGGs) are models that, given an image, build a directed graph where each edge represents a predicted subject predicate object triplet.

8
01 Jan 2021

LIGHTEN: Learning Interactions with Graph and Hierarchical TEmporal Networks for HOI in videos

praneeth11009/LIGHTEN-Learning-Interactions-with-Graphs-and-Hierarchical-TEmporal-Networks-for-HOI 17 Dec 2020

Analyzing the interactions between humans and objects from a video includes identification of the relationships between humans and the objects present in the video.

15
17 Dec 2020

One Metric to Measure them All: Localisation Recall Precision (LRP) for Evaluating Visual Detection Tasks

kemaloksuz/LRP-Error 21 Nov 2020

Despite being widely used as a performance measure for visual detection tasks, Average Precision (AP) is limited in (i) reflecting localisation quality, (ii) interpretability and (iii) robustness to the design choices regarding its computation, and its applicability to outputs without confidence scores.

48
21 Nov 2020

Visualization of Contributions to Open-Source Projects

onyame/Git2PROV 17 Oct 2020

We want to analyze visually, to what extend team members and external developers contribute to open-source projects.

2
17 Oct 2020

Visual Relationship Detection with Visual-Linguistic Knowledge from Multimodal Representations

coldmanck/RVL-BERT 10 Sep 2020

Visual relationship detection aims to reason over relationships among salient objects in images, which has drawn increasing attention over the past few years.

17
10 Sep 2020

Explanation-based Weakly-supervised Learning of Visual Relations with Graph Networks

baldassarreFe/ws-vrd ECCV 2020

Visual relationship detection is fundamental for holistic image understanding.

16
16 Jun 2020