Scene Parsing

75 papers with code • 2 benchmarks • 4 datasets

Scene parsing is to segment and parse an image into different image regions associated with semantic categories, such as sky, road, person, and bed. MIT Description

Benchmarks

Add a Result

These leaderboards are used to track progress in Scene Parsing

Trend	Dataset	Best Model	Paper	Code	Compare
	PGDP5K	PGDPNet			See all
	Cityscapes test	VCD No Coarse			See all

Libraries

Use these libraries to find Scene Parsing models and implementations

PaddlePaddle/PaddleSeg

4 papers

8,273

open-mmlab/mmsegmentation

3 papers

7,443

CSAILVision/semantic-segmentation-p…

2 papers

4,844

sithu31296/semantic-segmentation

2 papers

762

See all 5 libraries.

Datasets

Subtasks

Scene Recognition

Face Parsing

Indoor Scene Synthesis

Indoor Scene Reconstruction

Scene Labeling

Street Scene Parsing

Latest papers with no code

Most implemented Social Latest No code

Cross-CBAM: A Lightweight network for Scene Segmentation

no code yet • 4 Jun 2023

And we propose a Cross Convolutional Block Attention Module(CCBAM), in which a cross-multiply operation is employed in the CCBAM module to make high-level semantic information guide low-level detail information.

Paper
Add Code

Treasure What You Have: Exploiting Similarity in Deep Neural Networks for Efficient Video Processing

no code yet • 10 May 2023

Deep learning has enabled various Internet of Things (IoT) applications.

Paper
Add Code

Local and Global Contextual Features Fusion for Pedestrian Intention Prediction

no code yet • 1 May 2023

The pedestrian features include body pose and local context features that represent the pedestrian's behaviour.

Paper
Add Code

Weakly Supervised Class-Agnostic Motion Prediction for Autonomous Driving

no code yet • CVPR 2023

To this end, we propose a two-stage weakly supervised approach, where the segmentation model trained with the incomplete binary masks in Stage1 will facilitate the self-supervised learning of the motion prediction network in Stage2 by estimating possible moving foregrounds in advance.

Paper
Add Code

Re:PolyWorld - A Graph Neural Network for Polygonal Scene Parsing

no code yet • ICCV 2023

Re:PolyWorld not only outperforms the original model on building extraction in aerial images, thanks to the proposed joint analysis of vertices and edges, but also beats the state-of-the-art in multiple other domains.

Paper
Add Code

Visual Traffic Knowledge Graph Generation from Scene Images

no code yet • ICCV 2023

Although previous works on traffic scene understanding have achieved great success, most of them stop at a lowlevel perception stage, such as road segmentation and lane detection, and few concern high-level understanding.

Paper
Add Code

Multi-Sem Fusion: Multimodal Semantic Fusion for 3D Object Detection

no code yet • 10 Dec 2022

Most multi-modal 3D object detection frameworks integrate semantic knowledge from 2D images into 3D LiDAR point clouds to enhance detection accuracy.

Paper
Add Code

GEBNet: Graph-Enhancement Branch Network for RGB-T Scene Parsing

no code yet • journal 2022

RGB-T (red–green–blue and thermal) scene parsing has recently drawn considerable research attention.

Paper
Add Code

Boundary Corrected Multi-scale Fusion Network for Real-time Semantic Segmentation

no code yet • 1 Mar 2022

Image semantic segmentation aims at the pixel-level classification of images, which has requirements for both accuracy and speed in practical application.

Paper
Add Code

Aerial Scene Parsing: From Tile-level Scene Classification to Pixel-wise Semantic Labeling

no code yet • 6 Jan 2022

Finally, we perform ASP by unifying the tile-level scene classification and object-based image analysis to achieve pixel-wise semantic labeling.

Paper
Add Code

Scene Parsing

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result