Scene Segmentation
120 papers with code • 5 benchmarks • 7 datasets
Scene segmentation is the task of splitting a scene into its various object components.
Image adapted from Temporally coherent 4D reconstruction of complex dynamic scenes.
Libraries
Use these libraries to find Scene Segmentation models and implementationsLatest papers
No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation
To reduce the reliance on large-scale datasets, recent works in 3D segmentation resort to few-shot learning.
GTA-HDR: A Large-Scale Synthetic Dataset for HDR Image Reconstruction
High Dynamic Range (HDR) content (i. e., images and videos) has a broad range of applications.
One model to use them all: Training a segmentation model with complementary datasets
In this work, we propose a method to combine multiple partially annotated datasets, which provide complementary annotations, into one model, enabling better scene segmentation and the use of multiple readily available datasets.
Learning Generalized Segmentation for Foggy-scenes by Bi-directional Wavelet Guidance
We argue that an ideal segmentation model that can be well generalized to foggy-scenes need to simultaneously enhance the content, de-correlate the urban-scene style and de-correlate the fog style.
Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review
This review thoroughly examines the role of semantically-aware Neural Radiance Fields (NeRFs) in visual scene understanding, covering an analysis of over 250 scholarly papers.
Fourier Prompt Tuning for Modality-Incomplete Scene Segmentation
Integrating information from multiple modalities enhances the robustness of scene perception systems in autonomous vehicles, providing a more comprehensive and reliable sensory framework.
The Endoscapes Dataset for Surgical Scene Segmentation, Object Detection, and Critical View of Safety Assessment: Official Splits and Benchmark
This technical report provides a detailed overview of Endoscapes, a dataset of laparoscopic cholecystectomy (LC) videos with highly intricate annotations targeted at automated assessment of the Critical View of Safety (CVS).
SAMPro3D: Locating SAM Prompts in 3D for Zero-Shot Scene Segmentation
We introduce SAMPro3D for zero-shot 3D indoor scene segmentation.
CD-COCO: A Versatile Complex Distorted COCO Database for Scene-Context-Aware Computer Vision
These new local distortions are generated by considering the scene context of the images that guarantees a high level of photo-realism.
GNeSF: Generalizable Neural Semantic Fields
We propose a novel soft voting mechanism to aggregate the 2D semantic information from different views for each 3D point.