Panoptic Segmentation

214 papers with code • 24 benchmarks • 32 datasets

Panoptic Segmentation is a computer vision task that combines semantic segmentation and instance segmentation to provide a comprehensive understanding of the scene. The goal of panoptic segmentation is to segment the image into semantically meaningful parts or regions, while also detecting and distinguishing individual instances of objects within those regions. In a given image, every pixel is assigned a semantic label, and pixels belonging to "things" classes (countable objects with instances, like cars and people) are assigned unique instance IDs. ( Image credit: Detectron2 )

Benchmarks

Add a Result

These leaderboards are used to track progress in Panoptic Segmentation

Dataset	Best Model	Compare
COCO test-dev	Mask DINO (single scale)	See all
Cityscapes val	OneFormer (ConvNeXt-L, single-scale, 512x1024, Mapillary Vistas-pretrained)	See all
COCO minival	OneFormer (InternImage-H, emb_dim=1024, single-scale)	See all
ADE20K val	OneFormer (InternImage-H, emb_dim=256, single-scale, 896x896)	See all
Mapillary val	OneFormer (DiNAT-L, single-scale)	See all
Cityscapes test	OneFormer (ConvNeXt-L, single-scale, Mapillary Vistas-Pretrained)	See all
LaRS	Mask2Former (Swin-B)	See all
S3DIS Area5	SuperCluster	See all
KITTI Panoptic Segmentation	EfficientPS	See all
Indian Driving Dataset	EfficientPS	See all
ScanNetV2	OneFormer3D	See all
ScanNet	OneFormer3D	See all
PASTIS	Exchanger+Mask2Former	See all
SemanticKITTI	P3Former	See all
PanNuke	CellViT-SAM-H	See all
COCO panoptic	VAN-B6*	See all
NYU Depth v2	EMSANet	See all
SUN-RGBD	EMSANet	See all
Panoptic nuScenes val	PolarSeg-Panoptic	See all
Panoptic nuScenes test	(AF)2-S3Net + CenterPoint	See all
PASTIS-R	Early Fusion	See all
S3DIS	SuperCluster	See all
KITTI-360	SuperCluster	See all
DALES	SuperCluster	See all

Show all 24 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Panoptic Segmentation models and implementations

open-mmlab/mmdetection

9 papers

27,852

huggingface/transformers

7 papers

125,334

google-research/deeplab2

7 papers

989

PaddlePaddle/PaddleDetection

5 papers

12,086

See all 15 libraries.

Datasets

Subtasks

Video Panoptic Segmentation

Latest papers

Most implemented Social Latest No code

CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction

wusize/clipself • • 2 Oct 2023

However, when transferring the vision-language alignment of CLIP from global image representation to local region representation for the open-vocabulary dense prediction tasks, CLIP ViTs suffer from the domain shift from full images to local image regions.

135

02 Oct 2023

Paper
Code

Mask4Former: Mask Transformer for 4D Panoptic Segmentation

YilmazKadir/Mask4Former • • 28 Sep 2023

With this intention, we propose Mask4Former for the challenging task of 4D panoptic segmentation of LiDAR point clouds.

28 Sep 2023

Paper
Code

Finite Scalar Quantization: VQ-VAE Made Simple

google-research/google-research • • 27 Sep 2023

Each dimension is quantized to a small set of fixed values, leading to an (implicit) codebook given by the product of these sets.

32,870

27 Sep 2023

Paper
Code

ClusterFormer: Clustering As A Universal Visual Learner

clusterformer/clusterformer • • 22 Sep 2023

This paper presents CLUSTERFORMER, a universal vision model that is based on the CLUSTERing paradigm with TransFORMER.

22 Sep 2023

Paper
Code

Few-Shot Panoptic Segmentation With Foundation Models

robot-learning-freiburg/SPINO • • 19 Sep 2023

Concurrently, recent breakthroughs in visual representation learning have sparked a paradigm shift leading to the advent of large foundation models that can be trained with completely unlabeled images.

19 Sep 2023

Paper
Code

UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg Codebase

pjlab-adg/pcseg • • ICCV 2023

Besides, we construct the OpenPCSeg codebase, which is the largest and most comprehensive outdoor LiDAR segmentation codebase.

296

11 Sep 2023

Paper
Code

Panoptic Vision-Language Feature Fields

ethz-asl/autolabel • • 11 Sep 2023

In this paper, we propose to the best of our knowledge the first algorithm for open-vocabulary panoptic segmentation in 3D scenes.

11 Sep 2023

Paper
Code

Tracking Anything with Decoupled Video Segmentation

hkchengrex/Tracking-Anything-with-DEVA • • ICCV 2023

To 'track anything' without training on video data for every individual task, we develop a decoupled video segmentation approach (DEVA), composed of task-specific image-level segmentation and class/task-agnostic bi-directional temporal propagation.

1,068

07 Sep 2023

Paper
Code

Learning to Upsample by Learning to Sample

tiny-smart/dysample • • ICCV 2023

We present DySample, an ultra-lightweight and effective dynamic upsampler.

29 Aug 2023

Paper
Code

LaRS: A Diverse Panoptic Maritime Obstacle Detection Dataset and Benchmark

lojzezust/lars_evaluator • ICCV 2023

The progress in maritime obstacle detection is hindered by the lack of a diverse dataset that adequately captures the complexity of general maritime environments.

18 Aug 2023

Paper
Code

Panoptic Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result