Bifurcated backbone strategy for RGB-D salient object detection

6 Jul 2020  ·  Yingjie Zhai, Deng-Ping Fan, Jufeng Yang, Ali Borji, Ling Shao, Junwei Han, Liang Wang ·

Multi-level feature fusion is a fundamental topic in computer vision. It has been exploited to detect, segment and classify objects at various scales. When multi-level features meet multi-modal cues, the optimal feature aggregation and multi-modal learning strategy become a hot potato. In this paper, we leverage the inherent multi-modal and multi-level nature of RGB-D salient object detection to devise a novel cascaded refinement network. In particular, first, we propose to regroup the multi-level features into teacher and student features using a bifurcated backbone strategy (BBS). Second, we introduce a depth-enhanced module (DEM) to excavate informative depth cues from the channel and spatial views. Then, RGB and depth modalities are fused in a complementary way. Our architecture, named Bifurcated Backbone Strategy Network (BBS-Net), is simple, efficient, and backbone-independent. Extensive experiments show that BBS-Net significantly outperforms eighteen SOTA models on eight challenging datasets under five evaluation measures, demonstrating the superiority of our approach ($\sim 4 \%$ improvement in S-measure $vs.$ the top-ranked model: DMRA-iccv2019). In addition, we provide a comprehensive analysis on the generalization ability of different RGB-D datasets and provide a powerful training set for future research.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
RGB-D Salient Object Detection DES BBS-Net S-Measure 93.3 # 8
Average MAE 0.021 # 7
max E-Measure 96.6 # 7
max F-Measure 92.7 # 6
RGB-D Salient Object Detection LFSD BBS-Net S-Measure 86.4 # 4
Average MAE 0.072 # 5
max E-Measure 90.1 # 2
max F-Measure 85.8 # 2
RGB-D Salient Object Detection NLPR BBS-Net S-Measure 93.0 # 2
Average MAE 0.023 # 5
max F-Measure 91.8 # 3
max E-Measure 96.1 # 5
RGB-D Salient Object Detection RGBD135 BBS-Net S-Measure 88.2 # 2
Average MAE 0.044 # 2
max F-Measure 85.9 # 2
max E-Measure 91.9 # 1
RGB-D Salient Object Detection SIP BBS-Net S-Measure 87.9 # 8
max E-Measure 92.2 # 8
max F-Measure 88.3 # 8
Average MAE 0.055 # 12
RGB-D Salient Object Detection STERE BBS-Net S-Measure 90.8 # 5
Average MAE 0.041 # 9
max F-Measure 90.3 # 5
max E-Measure 94.2 # 6

Methods


No methods listed for this paper. Add relevant methods here