Scene Graph Generation

110 papers with code • 5 benchmarks • 7 datasets

A scene graph is a structured representation of an image, where nodes in a scene graph correspond to object bounding boxes with their object categories, and edges correspond to their pairwise relationships between objects. The task of Scene Graph Generation is to generate a visually-grounded scene graph that most accurately correlates with an image.

Source: Scene Graph Generation by Iterative Message Passing

Libraries

Use these libraries to find Scene Graph Generation models and implementations

Most implemented papers

Mapping Images to Scene Graphs with Permutation-Invariant Structured Prediction

shikorab/SceneGraph NeurIPS 2018

Machine understanding of complex images is a key goal of artificial intelligence.

Factorizable Net: An Efficient Subgraph-based Framework for Scene Graph Generation

yikang-li/FactorizableNet ECCV 2018

Generating scene graph to describe all the relations inside an image gains increasing interests these years.

The Limited Multi-Label Projection Layer

locuslab/lml 20 Jun 2019

We propose the Limited Multi-Label (LML) projection layer as a new primitive operation for end-to-end learning systems.

Bridging Knowledge Graphs to Generate Scene Graphs

alirezazareian/gbnet ECCV 2020

Scene graphs are powerful representations that parse images into their abstract semantic elements, i. e., objects and their interactions, which facilitates visual comprehension and explainable reasoning.

Weakly Supervised Visual Semantic Parsing

alirezazareian/vspnet CVPR 2020

Scene Graph Generation (SGG) aims to extract entities, predicates and their semantic structure from images, enabling deep understanding of visual content, with many applications such as visual reasoning and image retrieval.

NODIS: Neural Ordinary Differential Scene Understanding

yrcong/NODIS ECCV 2020

Detected objects, their labels and the discovered relations can be used to construct a scene graph which provides an abstract semantic interpretation of an image.

GPS-Net: Graph Property Sensing Network for Scene Graph Generation

taksau/GPS-Net CVPR 2020

There are three key properties of scene graph that have been underexplored in recent works: namely, the edge direction information, the difference in priority between nodes, and the long-tailed distribution of relationships.

Graph Density-Aware Losses for Novel Compositions in Scene Graph Generation

bknyaz/sgg 17 May 2020

We show that such models can suffer the most in their ability to generalize to rare compositions, evaluating two different models on the Visual Genome dataset and its more recent, improved version, GQA.

Generative Compositional Augmentations for Scene Graph Prediction

bknyaz/sgg ICCV 2021

However, test images might contain zero- and few-shot compositions of objects and relationships, e. g. <cup, on, surfboard>.

Sketching Image Gist: Human-Mimetic Hierarchical Scene Graph Generation

Kenneth-Wong/het-eccv20 ECCV 2020

Scene graph aims to faithfully reveal humans' perception of image content.