Scene Parsing

75 papers with code • 2 benchmarks • 4 datasets

Scene parsing is to segment and parse an image into different image regions associated with semantic categories, such as sky, road, person, and bed. MIT Description

Benchmarks

Add a Result

These leaderboards are used to track progress in Scene Parsing

Trend	Dataset	Best Model	Paper	Code	Compare
	PGDP5K	PGDPNet			See all
	Cityscapes test	VCD No Coarse			See all

Libraries

Use these libraries to find Scene Parsing models and implementations

PaddlePaddle/PaddleSeg

4 papers

8,256

open-mmlab/mmsegmentation

3 papers

7,408

CSAILVision/semantic-segmentation-p…

2 papers

4,836

sithu31296/semantic-segmentation

2 papers

756

See all 5 libraries.

Datasets

Subtasks

Scene Recognition

Face Parsing

Indoor Scene Synthesis

Indoor Scene Reconstruction

Scene Labeling

Street Scene Parsing

Latest papers with no code

Most implemented Social Latest No code

Feature boosting with efficient attention for scene parsing

no code yet • 29 Feb 2024

This paper presents a novel feature-boosting network that gathers spatial context from multiple levels of feature extraction and computes the attention weights for each level of representation to generate the final class labels.

Paper
Add Code

SAI3D: Segment Any Instance in 3D Scenes

no code yet • 17 Dec 2023

Advancements in 3D instance segmentation have traditionally been tethered to the availability of annotated datasets, limiting their application to a narrow spectrum of object categories.

Paper
Add Code

A Review and A Robust Framework of Data-Efficient 3D Scene Parsing with Traditional/Learned 3D Descriptors

no code yet • 3 Dec 2023

To the best of our knowledge, there exists no unified framework that simultaneously solves the downstream high-level understanding tasks including both segmentation and detection, especially when labels are extremely limited.

Paper
Add Code

CaveSeg: Deep Semantic Segmentation and Scene Parsing for Autonomous Underwater Cave Exploration

no code yet • 20 Sep 2023

In this paper, we present CaveSeg - the first visual learning pipeline for semantic segmentation and scene parsing for AUV navigation inside underwater caves.

Paper
Add Code

RoadFormer: Duplex Transformer for RGB-Normal Semantic Road Scene Parsing

no code yet • 19 Sep 2023

Additionally, we release SYN-UDTIRI, the first large-scale road scene parsing dataset that contains over 10, 407 RGB images, dense depth images, and the corresponding pixel-level annotations for both freespace and road defects of different shapes and sizes.

Paper
Add Code

CACFNet: Cross-Modal Attention Cascaded Fusion Network for RGB-T Urban Scene Parsing

no code yet • journal 2023

Color–thermal (RGB-T) urban scene parsing has recently attracted widespread interest.

Paper
Add Code

EGFNet: Edge-Aware Guidance Fusion Network for RGB–Thermal Urban Scene Parsing

no code yet • journal 2023

To address these problems, an edge-aware guidance fusion network (EGFNet) was developed in this study for RGB–thermal urban scene parsing.

Paper
Add Code

Semantic Segmentation on VSPW Dataset through Contrastive Loss and Multi-dataset Training Approach

no code yet • 6 Jun 2023

Video scene parsing incorporates temporal information, which can enhance the consistency and accuracy of predictions compared to image scene parsing.

Paper
Add Code

Recyclable Semi-supervised Method Based on Multi-model Ensemble for Video Scene Parsing

no code yet • 5 Jun 2023

Pixel-level Scene Understanding is one of the fundamental problems in computer vision, which aims at recognizing object classes, masks and semantics of each pixel in the given image.

Paper
Add Code

Cross-CBAM: A Lightweight network for Scene Segmentation

no code yet • 4 Jun 2023

And we propose a Cross Convolutional Block Attention Module(CCBAM), in which a cross-multiply operation is employed in the CCBAM module to make high-level semantic information guide low-level detail information.

Paper
Add Code

Scene Parsing

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result