Panoptic Segmentation

213 papers with code • 24 benchmarks • 32 datasets

Panoptic Segmentation is a computer vision task that combines semantic segmentation and instance segmentation to provide a comprehensive understanding of the scene. The goal of panoptic segmentation is to segment the image into semantically meaningful parts or regions, while also detecting and distinguishing individual instances of objects within those regions. In a given image, every pixel is assigned a semantic label, and pixels belonging to "things" classes (countable objects with instances, like cars and people) are assigned unique instance IDs. ( Image credit: Detectron2 )

Benchmarks

Add a Result

These leaderboards are used to track progress in Panoptic Segmentation

Dataset	Best Model	Compare
COCO test-dev	Mask DINO (single scale)	See all
Cityscapes val	OneFormer (ConvNeXt-L, single-scale, 512x1024, Mapillary Vistas-pretrained)	See all
COCO minival	OneFormer (InternImage-H, emb_dim=1024, single-scale)	See all
ADE20K val	OneFormer (InternImage-H, emb_dim=256, single-scale, 896x896)	See all
Mapillary val	OneFormer (DiNAT-L, single-scale)	See all
Cityscapes test	OneFormer (ConvNeXt-L, single-scale, Mapillary Vistas-Pretrained)	See all
LaRS	Mask2Former (Swin-B)	See all
S3DIS Area5	SuperCluster	See all
KITTI Panoptic Segmentation	EfficientPS	See all
Indian Driving Dataset	EfficientPS	See all
ScanNetV2	OneFormer3D	See all
ScanNet	OneFormer3D	See all
PASTIS	Exchanger+Mask2Former	See all
SemanticKITTI	P3Former	See all
PanNuke	CellViT-SAM-H	See all
COCO panoptic	VAN-B6*	See all
NYU Depth v2	EMSANet	See all
SUN-RGBD	EMSANet	See all
Panoptic nuScenes val	PolarSeg-Panoptic	See all
Panoptic nuScenes test	(AF)2-S3Net + CenterPoint	See all
PASTIS-R	Early Fusion	See all
S3DIS	SuperCluster	See all
KITTI-360	SuperCluster	See all
DALES	SuperCluster	See all

Show all 24 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Panoptic Segmentation models and implementations

open-mmlab/mmdetection

9 papers

27,765

huggingface/transformers

7 papers

124,889

google-research/deeplab2

7 papers

988

PaddlePaddle/PaddleDetection

5 papers

12,048

See all 15 libraries.

Datasets

Subtasks

Video Panoptic Segmentation

Latest papers with no code

Most implemented Social Latest No code

The revenge of BiSeNet: Efficient Multi-Task Image Segmentation

no code yet • 15 Apr 2024

Recent advancements in image segmentation have focused on enhancing the efficiency of the models to meet the demands of real-time applications, especially on edge devices.

Paper
Add Code

kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies

no code yet • 15 Apr 2024

Rapid advancements in continual segmentation have yet to bridge the gap of scaling to large continually expanding vocabularies under compute-constrained scenarios.

Paper
Add Code

COCONut: Modernizing COCO Segmentation

no code yet • 12 Apr 2024

By enhancing the annotation quality and expanding the dataset to encompass 383K images with more than 5. 18M panoptic masks, we introduce COCONut, the COCO Next Universal segmenTation dataset.

Paper
Add Code

Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation

no code yet • 4 Apr 2024

A key challenge in panoptic UDA is reducing the domain gap between a labeled source and an unlabeled target domain while harmonizing the subtasks of semantic and instance segmentation to limit catastrophic interference.

Paper
Add Code

JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments

no code yet • 2 Apr 2024

JRDB-PanoTrack includes (1) various data involving indoor and outdoor crowded scenes, as well as comprehensive 2D and 3D synchronized data modalities; (2) high-quality 2D spatial panoptic segmentation and temporal tracking annotations, with additional 3D label projections for further spatial understanding; (3) diverse object classes for closed- and open-world recognition benchmarks, with OSPA-based metrics for evaluation.

Paper
Add Code

Using Images as Covariates: Measuring Curb Appeal with Deep Learning

no code yet • 29 Mar 2024

Motivated by forecasting sales prices for residential real estate, we harness the power of deep learning to add "information" contained in images as covariates.

Paper
Add Code

Better Call SAL: Towards Learning to Segment Anything in Lidar

no code yet • 19 Mar 2024

We propose $\texttt{SAL}$ ($\texttt{S}$egment $\texttt{A}$nything in $\texttt{L}$idar) method consisting of a text-promptable zero-shot model for segmenting and classifying any object in Lidar, and a pseudo-labeling engine that facilitates model training without manual supervision.

Paper
Add Code

Small, Versatile and Mighty: A Range-View Perception Framework

no code yet • 1 Mar 2024

Our proposed Small, Versatile, and Mighty (SVM) network utilizes a pure convolutional architecture to fully unleash the efficiency and multi-tasking potentials of the range view representation.

Paper
Add Code

Benchmarking the Robustness of Panoptic Segmentation for Automated Driving

no code yet • 23 Feb 2024

Motivated by such a need, this work proposes a unifying pipeline to assess the robustness of panoptic segmentation models for AAD, correlating it with traditional image quality.

Paper
Add Code

Generalizable Semantic Vision Query Generation for Zero-shot Panoptic and Semantic Segmentation

no code yet • 21 Feb 2024

However, one of the gaps in synthesizing pseudo vision queries, ie, vision queries for unseen categories, is describing fine-grained visual details through semantic embeddings.

Paper
Add Code

Panoptic Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result