Panoptic Segmentation

213 papers with code • 24 benchmarks • 32 datasets

Panoptic Segmentation is a computer vision task that combines semantic segmentation and instance segmentation to provide a comprehensive understanding of the scene. The goal of panoptic segmentation is to segment the image into semantically meaningful parts or regions, while also detecting and distinguishing individual instances of objects within those regions. In a given image, every pixel is assigned a semantic label, and pixels belonging to "things" classes (countable objects with instances, like cars and people) are assigned unique instance IDs. ( Image credit: Detectron2 )

Benchmarks

Add a Result

These leaderboards are used to track progress in Panoptic Segmentation

Dataset	Best Model	Compare
COCO test-dev	Mask DINO (single scale)	See all
Cityscapes val	OneFormer (ConvNeXt-L, single-scale, 512x1024, Mapillary Vistas-pretrained)	See all
COCO minival	OneFormer (InternImage-H, emb_dim=1024, single-scale)	See all
ADE20K val	OneFormer (InternImage-H, emb_dim=256, single-scale, 896x896)	See all
Mapillary val	OneFormer (DiNAT-L, single-scale)	See all
Cityscapes test	OneFormer (ConvNeXt-L, single-scale, Mapillary Vistas-Pretrained)	See all
LaRS	Mask2Former (Swin-B)	See all
S3DIS Area5	SuperCluster	See all
KITTI Panoptic Segmentation	EfficientPS	See all
Indian Driving Dataset	EfficientPS	See all
ScanNetV2	OneFormer3D	See all
ScanNet	OneFormer3D	See all
PASTIS	Exchanger+Mask2Former	See all
SemanticKITTI	P3Former	See all
PanNuke	CellViT-SAM-H	See all
COCO panoptic	VAN-B6*	See all
NYU Depth v2	EMSANet	See all
SUN-RGBD	EMSANet	See all
Panoptic nuScenes val	PolarSeg-Panoptic	See all
Panoptic nuScenes test	(AF)2-S3Net + CenterPoint	See all
PASTIS-R	Early Fusion	See all
S3DIS	SuperCluster	See all
KITTI-360	SuperCluster	See all
DALES	SuperCluster	See all

Show all 24 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Panoptic Segmentation models and implementations

open-mmlab/mmdetection

9 papers

27,716

huggingface/transformers

7 papers

124,593

google-research/deeplab2

7 papers

985

PaddlePaddle/PaddleDetection

5 papers

12,034

See all 15 libraries.

Datasets

Subtasks

Video Panoptic Segmentation

Latest papers with no code

Most implemented Social Latest No code

Generalizable Entity Grounding via Assistance of Large Language Model

no code yet • 4 Feb 2024

In this work, we propose a novel approach to densely ground visual entities from a long caption.

Paper
Add Code

UrbanGenAI: Reconstructing Urban Landscapes using Panoptic Segmentation and Diffusion Models

no code yet • 25 Jan 2024

In contemporary design practices, the integration of computer vision and generative artificial intelligence (genAI) represents a transformative shift towards more interactive and inclusive processes.

Paper
Add Code

MUSES: The Multi-Sensor Semantic Perception Dataset for Driving under Uncertainty

no code yet • 23 Jan 2024

Achieving level-5 driving automation in autonomous vehicles necessitates a robust semantic visual perception system capable of parsing data from different sensors across diverse conditions.

Paper
Add Code

Self-supervised Learning of LiDAR 3D Point Clouds via 2D-3D Neural Calibration

no code yet • 23 Jan 2024

First, we propose the learnable transformation alignment to bridge the domain gap between image and point cloud data, converting features into a unified representation space for effective comparison and matching.

Paper
Add Code

UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding

no code yet • 12 Jan 2024

Vision-language foundation models, represented by Contrastive language-image pre-training (CLIP), have gained increasing attention for jointly understanding both vision and textual tasks.

Paper
Add Code

CoSSegGaussians: Compact and Swift Scene Segmenting 3D Gaussians with Dual Feature Fusion

no code yet • 11 Jan 2024

We propose Compact and Swift Segmenting 3D Gaussians(CoSSegGaussians), a method for compact 3D-consistent scene segmentation at fast rendering speed with only RGB images input.

Paper
Add Code

3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation

no code yet • 4 Jan 2024

3D panoptic segmentation is a challenging perception task, especially in autonomous driving.

Paper
Add Code

EfficientPPS: Part-aware Panoptic Segmentation of Transparent Objects for Robotic Manipulation

no code yet • 21 Dec 2023

The use of autonomous robots for assistance tasks in hospitals has the potential to free up qualified staff and im-prove patient care.

Paper
Add Code

Beyond the Label Itself: Latent Labels Enhance Semi-supervised Point Cloud Panoptic Segmentation

no code yet • 13 Dec 2023

Second, in the Image Branch, we propose the Instance Position-scale Learning (IPSL) Module to learn and fuse the information of instance position and scale, which is from a 2D pre-trained detector and a type of latent label obtained from 3D to 2D projection.

Paper
Add Code

Digital Histopathology with Graph Neural Networks: Concepts and Explanations for Clinicians

no code yet • 4 Dec 2023

To address the challenge of the ``black-box" nature of deep learning in medical settings, we combine GCExplainer - an automated concept discovery solution - along with Logic Explained Networks to provide global explanations for Graph Neural Networks.

Paper
Add Code

Panoptic Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result