Video Segmentation

105 papers with code • 1 benchmarks • 9 datasets

This task has no description! Would you like to contribute one?

Latest papers with no code

SimLVSeg: Simplifying Left Ventricular Segmentation in 2D+Time Echocardiograms with Self- and Weakly-Supervised Learning

no code yet • 30 Sep 2023

From calculating biomarkers such as ejection fraction to the probability of a patient's heart failure, accurate segmentation of the heart structures allows doctors to assess the heart's condition and devise treatments with greater precision and accuracy.

SANPO: A Scene Understanding, Accessibility, Navigation, Pathfinding, Obstacle Avoidance Dataset

no code yet • 21 Sep 2023

All synthetic sessions and a subset of real sessions have temporally consistent dense panoptic segmentation labels.

MoDA: Leveraging Motion Priors from Videos for Advancing Unsupervised Domain Adaptation in Semantic Segmentation

no code yet • 21 Sep 2023

Then, we propose a semantic mining module that takes the object masks to refine the pseudo labels in the target domain.

MEGA: Multimodal Alignment Aggregation and Distillation For Cinematic Video Segmentation

no code yet • ICCV 2023

Previous research has studied the task of segmenting cinematic videos into scenes and into narrative acts.

Large-scale environment mapping and immersive human-robot interaction for agricultural mobile robot teleoperation

no code yet • 14 Aug 2023

For control, a closed-loop system utilizing TCP for VR control and positioning of agricultural machinery was introduced.

Stochastic positional embeddings improve masked image modeling

no code yet • 31 Jul 2023

Masked Image Modeling (MIM) is a promising self-supervised learning approach that enables learning from unlabeled images.

Automatic Interaction and Activity Recognition from Videos of Human Manual Demonstrations with Application to Anomaly Detection

no code yet • 19 Apr 2023

This paper presents a new method to describe spatio-temporal relations between objects and hands, to recognize both interactions and activities within video demonstrations of manual tasks.

A Unified Multiscale Encoder-Decoder Transformer for Video Segmentation

no code yet • CVPR 2023

In this paper, we present an end-to-end trainable unified multiscale encoder-decoder transformer that is focused on dense prediction tasks in video.

A Threefold Review on Deep Semantic Segmentation: Efficiency-oriented, Temporal and Depth-aware design

no code yet • 8 Mar 2023

Semantic image and video segmentation stand among the most important tasks in computer vision nowadays, since they provide a complete and meaningful representation of the environment by means of a dense classification of the pixels in a given scene.

Learning to Adapt to Online Streams with Distribution Shifts

no code yet • 2 Mar 2023

Test-time adaptation (TTA) is a technique used to reduce distribution gaps between the training and testing sets by leveraging unlabeled test data during inference.