TrickVOS: A Bag of Tricks for Video Object Segmentation

Space-time memory (STM) network methods have been dominant in semi-supervised video object segmentation (SVOS) due to their remarkable performance. In this work, we identify three key aspects where we can improve such methods; i) supervisory signal, ii) pretraining and iii) spatial awareness. We then propose TrickVOS; a generic, method-agnostic bag of tricks addressing each aspect with i) a structure-aware hybrid loss, ii) a simple decoder pretraining regime and iii) a cheap tracker that imposes spatial constraints in model predictions. Finally, we propose a lightweight network and show that when trained with TrickVOS, it achieves competitive results to state-of-the-art methods on DAVIS and YouTube benchmarks, while being one of the first STM-based SVOS methods that can run in real-time on a mobile device.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Semi-Supervised Video Object Segmentation DAVIS 2016 STCN + TrickVOS (PT) Jaccard (Mean) 90.5 # 13
F-measure (Mean) 93.1 # 17
J&F 91.8 # 16
Semi-Supervised Video Object Segmentation DAVIS 2016 Lightweight TrickVOS (PT) Jaccard (Mean) 88.7 # 31
F-measure (Mean) 89.9 # 37
J&F 89.3 # 36
Speed (FPS) 86.4 # 3
Semi-Supervised Video Object Segmentation DAVIS-2016 STCN + TrickVOS (PT) Speed (FPS) 45.4 # 1
Semi-Supervised Video Object Segmentation DAVIS-2017 STCN + TrickVOS (PT) J&F 86.1 # 1
Speed (FPS) 35.1 # 2
Jaccard (Mean) 82.6 # 1
F-measure (Mean) 89.6 # 1
Semi-Supervised Video Object Segmentation DAVIS-2017 Lightweight TrickVOS (PT) J&F 82.7 # 2
Speed (FPS) 76.4 # 1
Jaccard (Mean) 79.4 # 2
F-measure (Mean) 86 # 2
Semi-Supervised Video Object Segmentation YouTube-VOS 2019 STCN + TrickVOS (PT) Jaccard (Seen) 82.1 # 19
Jaccard (Unseen) 77.2 # 20
F-Measure (Seen) 86.4 # 19
F-Measure (Unseen) 85.5 # 19
J&F 82.8 # 1
Semi-Supervised Video Object Segmentation YouTube-VOS 2019 Lightweight TrickVOS (PT) Jaccard (Seen) 79.5 # 22
F-Measure (Seen) 83.3 # 22
F-Measure (Unseen) 84 # 21
J&F 80.5 # 2
J score (unseen) 75.2 # 1

Methods


No methods listed for this paper. Add relevant methods here