Scene Graph Generation

110 papers with code • 5 benchmarks • 7 datasets

A scene graph is a structured representation of an image, where nodes in a scene graph correspond to object bounding boxes with their object categories, and edges correspond to their pairwise relationships between objects. The task of Scene Graph Generation is to generate a visually-grounded scene graph that most accurately correlates with an image.

Source: Scene Graph Generation by Iterative Message Passing

Benchmarks

Add a Result

These leaderboards are used to track progress in Scene Graph Generation

Dataset	Best Model	Compare
Visual Genome	SpeaQ (without reweighting)	See all
4D-OR	ORacle	See all
VRD	FactorizableNet	See all
3R-Scan	SceneGraphFusion	See all
MS-COCO	NeuSyRE	See all

Libraries

Use these libraries to find Scene Graph Generation models and implementations

rafa-cxg/PySGG-cxg

3 papers

suprosanna/relationformer

2 papers

shikorab/SceneGraph

2 papers

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

Mapping Images to Scene Graphs with Permutation-Invariant Structured Prediction

shikorab/SceneGraph • • NeurIPS 2018

Machine understanding of complex images is a key goal of artificial intelligence.

Paper
Code

Factorizable Net: An Efficient Subgraph-based Framework for Scene Graph Generation

yikang-li/FactorizableNet • • ECCV 2018

Generating scene graph to describe all the relations inside an image gains increasing interests these years.

Paper
Code

The Limited Multi-Label Projection Layer

locuslab/lml • • 20 Jun 2019

We propose the Limited Multi-Label (LML) projection layer as a new primitive operation for end-to-end learning systems.

Paper
Code

Bridging Knowledge Graphs to Generate Scene Graphs

alirezazareian/gbnet • • ECCV 2020

Scene graphs are powerful representations that parse images into their abstract semantic elements, i. e., objects and their interactions, which facilitates visual comprehension and explainable reasoning.

Paper
Code

Weakly Supervised Visual Semantic Parsing

alirezazareian/vspnet • • CVPR 2020

Scene Graph Generation (SGG) aims to extract entities, predicates and their semantic structure from images, enabling deep understanding of visual content, with many applications such as visual reasoning and image retrieval.

Paper
Code

NODIS: Neural Ordinary Differential Scene Understanding

yrcong/NODIS • • ECCV 2020

Detected objects, their labels and the discovered relations can be used to construct a scene graph which provides an abstract semantic interpretation of an image.

Paper
Code

GPS-Net: Graph Property Sensing Network for Scene Graph Generation

taksau/GPS-Net • • CVPR 2020

There are three key properties of scene graph that have been underexplored in recent works: namely, the edge direction information, the difference in priority between nodes, and the long-tailed distribution of relationships.

Paper
Code

Graph Density-Aware Losses for Novel Compositions in Scene Graph Generation

bknyaz/sgg • • 17 May 2020

We show that such models can suffer the most in their ability to generalize to rare compositions, evaluating two different models on the Visual Genome dataset and its more recent, improved version, GQA.

Paper
Code