Instance Segmentation

972 papers with code • 25 benchmarks • 83 datasets

Instance Segmentation is a computer vision task that involves identifying and separating individual objects within an image, including detecting the boundaries of each object and assigning a unique label to each object. The goal of instance segmentation is to produce a pixel-wise segmentation map of the image, where each pixel is assigned to a specific object instance.

Image Credit: Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers, CVPR'21

Benchmarks

Add a Result

These leaderboards are used to track progress in Instance Segmentation

Dataset	Best Model	Compare
COCO test-dev	EVA	See all
COCO minival	InternImage-H	See all
LVIS v1.0 val	Co-DETR (single-scale)	See all
Cityscapes val	OneFormer (InternImage-H, emb_dim=256, single-scale)	See all
ADE20K val	OneFormer (InternImage-H, emb_dim=1024, single-scale, 896x896, COCO-Pretrained)	See all
Cityscapes test	Deep Watershed Transform	See all
Occluded COCO	Swin-B + Cascade Mask R-CNN (tri-layer modelling)	See all
Separated COCO	Swin-B + Cascade Mask R-CNN (tri-layer modelling)	See all
iSAID	PANet++	See all
TBBR	Swin-T (ImageNet-1k pretrain)	See all
COCO 2017 val	SparK (ConvNeXt V1-B Mask R-CNN)	See all
BDD100K val	Mask Transfiner	See all
COCO val (panoptic labels)	OneFormer (InternImage-H, emb_dim=1024, single-scale)	See all
UIIS	WaterMask RCNN	See all
NYU Depth v2	SGPN-CNN	See all
KINS	BCNet	See all
nuScenes	TraDeS	See all
coco minval	R3-CNN (ResNet-50-FPN, GC-Net)	See all
Leaf Segmentation Challenge	LeafMask	See all
iShape	ASIS(baseline)	See all
LVIS v1.0 test-dev	R50-FPN-MaskRCNN-TTA	See all
PartNet	Semantic Segmentation-Assisted Instance Feature Fusion	See all
COCO val2017	MogaNet-S (256x192)	See all
Object Detection on COCO minival	DaViT-T (Mask R-CNN, 36 epochs)	See all
LDD	R^3-CNN	See all

Show all 25 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Instance Segmentation models and implementations

open-mmlab/mmdetection

31 papers

27,866

PaddlePaddle/PaddleDetection

18 papers

12,094

rwightman/pytorch-image-models

17 papers

29,863

huggingface/transformers

6 papers

125,425

See all 20 libraries.

Datasets

Subtasks

Unsupervised Object Segmentation

Amodal Instance Segmentation

Box-supervised Instance Segmentation

Image-level Supervised Instance Segmentation

Unseen Object Instance Segmentation

3D Semantic Instance Segmentation

Open-World Instance Segmentation

Human Instance Segmentation

One-Shot Instance Segmentation

Semi-Supervised Person Instance Segmentation

Point-Supervised Instance Segmentation

Solar Cell Segmentation

Latest papers with no code

Most implemented Social Latest No code

Structured Model Pruning for Efficient Inference in Computational Pathology

no code yet • 12 Apr 2024

In this work, we demonstrate that model pruning, as a model compression technique, can effectively reduce inference cost for computational and digital pathology based analysis with a negligible loss of analysis performance.

Paper
Add Code

Automated National Urban Map Extraction

no code yet • 9 Apr 2024

Developing countries usually lack the proper governance means to generate and regularly update a national rooftop map.

Paper
Add Code

Panoptic Perception: A Novel Task and Fine-grained Dataset for Universal Remote Sensing Image Interpretation

no code yet • 6 Apr 2024

Experimental results on FineGrip demonstrate the feasibility of the panoptic perception task and the beneficial effect of multi-task joint optimization on individual tasks.

Paper
Add Code

OW-VISCap: Open-World Video Instance Segmentation and Captioning

no code yet • 4 Apr 2024

To address these issues, we propose Open-World Video Instance Segmentation and Captioning (OW-VISCap), an approach to jointly segment, track, and caption previously seen or unseen objects in a video.

Paper
Add Code

CORP: A Multi-Modal Dataset for Campus-Oriented Roadside Perception Tasks

no code yet • 4 Apr 2024

Numerous roadside perception datasets have been introduced to propel advancements in autonomous driving and intelligent transportation systems research and development.

Paper
Add Code

Language-Guided Instance-Aware Domain-Adaptive Panoptic Segmentation

no code yet • 4 Apr 2024

A key challenge in panoptic UDA is reducing the domain gap between a labeled source and an unlabeled target domain while harmonizing the subtasks of semantic and instance segmentation to limit catastrophic interference.

Paper
Add Code

Segment Any 3D Object with Language

no code yet • 2 Apr 2024

In addition, to align the 3D segmentation model with various language instructions and enhance the mask quality, we introduce three types of multimodal associations as supervision.

Paper
Add Code

Teeth-SEG: An Efficient Instance Segmentation Framework for Orthodontic Treatment based on Anthropic Prior Knowledge

no code yet • 1 Apr 2024

Teeth localization, segmentation, and labeling in 2D images have great potential in modern dentistry to enhance dental diagnostics, treatment planning, and population-based studies on oral health.

Paper
Add Code

SUGAR: Pre-training 3D Visual Representations for Robotics

no code yet • 1 Apr 2024

SUGAR employs a versatile transformer-based model to jointly address five pre-training tasks, namely cross-modal knowledge distillation for semantic learning, masked point modeling to understand geometry structures, grasping pose synthesis for object affordance, 3D instance segmentation and referring expression grounding to analyze cluttered scenes.

Paper
Add Code

What is Point Supervision Worth in Video Instance Segmentation?

no code yet • 1 Apr 2024

Video instance segmentation (VIS) is a challenging vision task that aims to detect, segment, and track objects in videos.

Paper
Add Code

Instance Segmentation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result