Video Salient Object Detection

20 papers with code • 10 benchmarks • 4 datasets

Video salient object detection (VSOD) is significantly essential for understanding the underlying mechanism behind HVS during free-viewing in general and instrumental to a wide range of real-world applications, e.g., video segmentation, video captioning, video compression, autonomous driving, robotic interaction, weakly supervised attention. Besides its academic value and practical significance, VSOD presents great difficulties due to the challenges carried by video data (diverse motion patterns, occlusions, blur, large object deformations, etc.) and the inherent complexity of human visual attention behavior (i.e., selective attention allocation, attention shift) during dynamic scenes. Online benchmark: http://dpfan.net/davsod.

( Image credit: Shifting More Attention to Video Salient Object Detection, CVPR2019-Best Paper Finalist )

Depth Quality-Inspired Feature Manipulation for Efficient RGB-D and Video Salient Object Detection

zwbx/DFM-Net 8 Aug 2022

Inspired by the fact that depth quality is a key factor influencing the accuracy, we propose an efficient depth quality-inspired feature manipulation (DQFM) process, which can dynamically filter depth features according to depth quality.

46
08 Aug 2022

Motion-aware Memory Network for Fast Video Salient Object Detection

zhaoxing2022/mmn-vsod 1 Aug 2022

Moreover, inspired by the boundary supervision commonly used in image salient object detection (ISOD), we design a motion-aware loss for predicting object boundary motion and simultaneously perform multitask learning for VSOD and object motion prediction, which can further facilitate the model to extract spatiotemporal features accurately and maintain the object integrity.

2
01 Aug 2022

Hierarchical Feature Alignment Network for Unsupervised Video Object Segmentation

NUST-Machine-Intelligence-Laboratory/HFAN 18 Jul 2022

Optical flow is an easily conceived and precious cue for advancing unsupervised video object segmentation (UVOS).

28
18 Jul 2022

Learning Video Salient Object Detection Progressively from Unlabeled Videos

bradleybin/locate-globally-segment-locally-a-progressive-architecture-with-knowledge-review-network-for-sod 5 Apr 2022

Recent deep learning-based video salient object detection (VSOD) has achieved some breakthrough, but these methods rely on expensive annotated videos with pixel-wise annotations, weak annotations, or part of the pixel-wise annotations.

42
05 Apr 2022

A Unified Transformer Framework for Group-based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection

suyukun666/UFO 9 Mar 2022

Besides, they fail to take full advantage of the cues among inter- and intra-feature within a group of images.

278
09 Mar 2022

Depth-Cooperated Trimodal Network for Video Salient Object Detection

kerenfu/rdvs 12 Feb 2022

However, existing video salient object detection (VSOD) methods only utilize spatiotemporal information and seldom exploit depth information for detection.

35
12 Feb 2022

Full-Duplex Strategy for Video Object Segmentation

GewelsJI/FSNet ICCV 2021

Previous video object segmentation approaches mainly focus on using simplex solutions between appearance and motion, limiting feature collaboration efficiency among and across these two cues.

65
06 Aug 2021

Video Salient Object Detection via Adaptive Local-Global Refinement

piggy2021/ALGRF 29 Apr 2021

Despite their simplicity, such fusion strategies may introduce feature redundancy, and also fail to fully exploit the relationship between multi-level features extracted from both spatial and temporal domains.

6
29 Apr 2021

Weakly Supervised Video Salient Object Detection

wangbo-zhao/WSVSOD CVPR 2021

Significant performance improvement has been achieved for fully-supervised video salient object detection with the pixel-wise labeled training datasets, which are time-consuming and expensive to obtain.

41
06 Apr 2021

Dynamic Context-Sensitive Filtering Network for Video Salient Object Detection

oiplab-dut/dcfnet ICCV 2021

Our bidirectional dynamic fusion strategy encourages the interaction of spatial and temporal information in a dynamic manner.

8
01 Jan 2021