Scene Graph Generation

110 papers with code • 5 benchmarks • 7 datasets

A scene graph is a structured representation of an image, where nodes in a scene graph correspond to object bounding boxes with their object categories, and edges correspond to their pairwise relationships between objects. The task of Scene Graph Generation is to generate a visually-grounded scene graph that most accurately correlates with an image.

Source: Scene Graph Generation by Iterative Message Passing

Libraries

Use these libraries to find Scene Graph Generation models and implementations

Most implemented papers

RLIPv2: Fast Scaling of Relational Language-Image Pre-training

jacobyuan7/rlipv2 ICCV 2023

In this paper, we propose RLIPv2, a fast converging model that enables the scaling of relational pre-training to large-scale pseudo-labelled scene graph data.

Panoptic Video Scene Graph Generation

jingkang50/openpvsg CVPR 2023

PVSG relates to the existing video scene graph generation (VidSGG) problem, which focuses on temporal interactions between humans and objects grounded with bounding boxes in videos.

Visual Graphs from Motion (VGfM): Scene understanding with object geometry reasoning

paulgay/VGfM 16 Jul 2018

Recent approaches on visual scene understanding attempt to build a scene graph -- a computational representation of objects and their pairwise relationships.

Relation Transformer Network

rajatkoner08/rtn 13 Apr 2020

In this work, we propose a novel transformer formulation for scene graph generation and relation prediction.

Learning Visual Commonsense for Robust Scene Graph Generation

ZhecanJamesWang/GLAT_SGG ECCV 2020

Scene graph generation models understand the scene through object and predicate recognition, but are prone to mistakes due to the challenges of perception in the wild.

Learning and Reasoning with the Graph Structure Representation in Robotic Surgery

mobarakol/Surgical_SceneGraph_Generation 7 Jul 2020

Learning to infer graph representations and performing spatial reasoning in a complex surgical environment can play a vital role in surgical scene understanding in robotic surgery.

SceneGraphFusion: Incremental 3D Scene Graph Prediction from RGB-D Sequences

ShunChengWu/SceneGraphFusion CVPR 2021

Scene graphs are a compact and explicit representation successfully used in a variety of 2D scene understanding tasks.

Fine-Grained Scene Graph Generation with Data Transfer

waxnkw/ietrans-sgg.pytorch 22 Mar 2022

Scene graph generation (SGG) is designed to extract (subject, predicate, object) triplets in images.

Scene Graph Generation from Objects, Phrases and Region Captions

yikang-li/MSDN ICCV 2017

Object detection, scene graph generation and region captioning, which are three scene understanding tasks at different semantic levels, are tied together: scene graphs are generated on top of objects detected in an image with their pairwise relationship predicted, while region captioning gives a language description of the objects, their attributes, relations, and other context information.

Mapping Images to Scene Graphs with Permutation-Invariant Structured Prediction

shikorab/SceneGraph NeurIPS 2018

Machine understanding of complex images is a key goal of artificial intelligence.