Semantic Segmentation

5199 papers with code • 125 benchmarks • 311 datasets

Semantic Segmentation is a computer vision task in which the goal is to categorize each pixel in an image into a class or object. The goal is to produce a dense pixel-wise segmentation map of an image, where each pixel is assigned to a specific class or object. Some example benchmarks for this task are Cityscapes, PASCAL VOC and ADE20K. Models are usually evaluated with the Mean Intersection-Over-Union (Mean IoU) and Pixel Accuracy metrics.

( Image credit: CSAILVision )

Libraries

Use these libraries to find Semantic Segmentation models and implementations
53 papers
8,252
30 papers
2,917
See all 39 libraries.

Latest papers with no code

Efficient Transformer Encoders for Mask2Former-style models

no code yet • 23 Apr 2024

The third step is to use the aforementioned derived dataset to train a gating network that predicts the number of encoder layers to be used, conditioned on the input image.

Ultrasound SAM Adapter: Adapting SAM for Breast Lesion Segmentation in Ultrasound Images

no code yet • 23 Apr 2024

To address these issues, in this paper, we develop a novel Breast Ultrasound SAM Adapter, termed Breast Ultrasound Segment Anything Model (BUSSAM), which migrates the SAM to the field of breast ultrasound image segmentation by using the adapter technique.

CFPFormer: Feature-pyramid like Transformer Decoder for Segmentation and Detection

no code yet • 23 Apr 2024

Feature pyramids have been widely adopted in convolutional neural networks (CNNs) and transformers for tasks like medical image segmentation and object detection.

OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks

no code yet • 22 Apr 2024

Models pretrained with our method exhibit improved BEV semantic segmentation performance, particularly in low-data scenarios.

360VOTS: Visual Object Tracking and Segmentation in Omnidirectional Videos

no code yet • 22 Apr 2024

Visual object tracking and segmentation in omnidirectional videos are challenging due to the wide field-of-view and large spherical distortion brought by 360{\deg} images.

PM-VIS: High-Performance Box-Supervised Video Instance Segmentation

no code yet • 22 Apr 2024

Our PM-VIS model, trained with high-quality pseudo mask annotations, demonstrates strong ability in instance mask prediction, achieving state-of-the-art performance on the YouTube-VIS 2019, YouTube-VIS 2021, and OVIS validation sets, notably narrowing the gap between box-supervised and fully supervised VIS methods.

PV-S3: Advancing Automatic Photovoltaic Defect Detection using Semi-Supervised Semantic Segmentation of Electroluminescence Images

no code yet • 21 Apr 2024

Traditional manual health check, using Electroluminescence (EL) imaging, is expensive and logistically challenging making automated defect detection essential.

PEMMA: Parameter-Efficient Multi-Modal Adaptation for Medical Image Segmentation

no code yet • 21 Apr 2024

In this work, we propose a parameter-efficient multi-modal adaptation (PEMMA) framework for lightweight upgrading of a transformer-based segmentation model trained only on CT scans to also incorporate PET scans.

Semantic-Rearrangement-Based Multi-Level Alignment for Domain Generalized Segmentation

no code yet • 21 Apr 2024

SRMA first incorporates a Semantic Rearrangement Module (SRM), which conducts semantic region randomization to enhance the diversity of the source domain sufficiently.

A Complete System for Automated 3D Semantic-Geometric Mapping of Corrosion in Industrial Environments

no code yet • 21 Apr 2024

Corrosion, a naturally occurring process leading to the deterioration of metallic materials, demands diligent detection for quality control and the preservation of metal-based objects, especially within industrial contexts.