Panoptic Segmentation

213 papers with code • 24 benchmarks • 32 datasets

Panoptic Segmentation is a computer vision task that combines semantic segmentation and instance segmentation to provide a comprehensive understanding of the scene. The goal of panoptic segmentation is to segment the image into semantically meaningful parts or regions, while also detecting and distinguishing individual instances of objects within those regions. In a given image, every pixel is assigned a semantic label, and pixels belonging to "things" classes (countable objects with instances, like cars and people) are assigned unique instance IDs. ( Image credit: Detectron2 )

Libraries

Use these libraries to find Panoptic Segmentation models and implementations

Latest papers with no code

The revenge of BiSeNet: Efficient Multi-Task Image Segmentation

no code yet • 15 Apr 2024

Recent advancements in image segmentation have focused on enhancing the efficiency of the models to meet the demands of real-time applications, especially on edge devices.

kNN-CLIP: Retrieval Enables Training-Free Segmentation on Continually Expanding Large Vocabularies

no code yet • 15 Apr 2024

Rapid advancements in continual segmentation have yet to bridge the gap of scaling to large continually expanding vocabularies under compute-constrained scenarios.

COCONut: Modernizing COCO Segmentation

no code yet • 12 Apr 2024

By enhancing the annotation quality and expanding the dataset to encompass 383K images with more than 5. 18M panoptic masks, we introduce COCONut, the COCO Next Universal segmenTation dataset.

Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation

no code yet • 4 Apr 2024

A key challenge in panoptic UDA is reducing the domain gap between a labeled source and an unlabeled target domain while harmonizing the subtasks of semantic and instance segmentation to limit catastrophic interference.

JRDB-PanoTrack: An Open-world Panoptic Segmentation and Tracking Robotic Dataset in Crowded Human Environments

no code yet • 2 Apr 2024

JRDB-PanoTrack includes (1) various data involving indoor and outdoor crowded scenes, as well as comprehensive 2D and 3D synchronized data modalities; (2) high-quality 2D spatial panoptic segmentation and temporal tracking annotations, with additional 3D label projections for further spatial understanding; (3) diverse object classes for closed- and open-world recognition benchmarks, with OSPA-based metrics for evaluation.

Using Images as Covariates: Measuring Curb Appeal with Deep Learning

no code yet • 29 Mar 2024

Motivated by forecasting sales prices for residential real estate, we harness the power of deep learning to add "information" contained in images as covariates.

Better Call SAL: Towards Learning to Segment Anything in Lidar

no code yet • 19 Mar 2024

We propose $\texttt{SAL}$ ($\texttt{S}$egment $\texttt{A}$nything in $\texttt{L}$idar) method consisting of a text-promptable zero-shot model for segmenting and classifying any object in Lidar, and a pseudo-labeling engine that facilitates model training without manual supervision.

Small, Versatile and Mighty: A Range-View Perception Framework

no code yet • 1 Mar 2024

Our proposed Small, Versatile, and Mighty (SVM) network utilizes a pure convolutional architecture to fully unleash the efficiency and multi-tasking potentials of the range view representation.

Benchmarking the Robustness of Panoptic Segmentation for Automated Driving

no code yet • 23 Feb 2024

Motivated by such a need, this work proposes a unifying pipeline to assess the robustness of panoptic segmentation models for AAD, correlating it with traditional image quality.

Generalizable Semantic Vision Query Generation for Zero-shot Panoptic and Semantic Segmentation

no code yet • 21 Feb 2024

However, one of the gaps in synthesizing pseudo vision queries, ie, vision queries for unseen categories, is describing fine-grained visual details through semantic embeddings.