RGB-D Salient Object Detection

56 papers with code • 8 benchmarks • 5 datasets

RGB-D Salient object detection (SOD) aims at distinguishing the most visually distinctive objects or regions in a scene from the given RGB and Depth data. It has a wide range of applications, including video/image segmentation, object recognition, visual tracking, foreground maps evaluation, image retrieval, content-aware image editing, information discovery, photosynthesis, and weakly supervised semantic segmentation. Here, depth information plays an important complementary role in finding salient objects. Online benchmark: http://dpfan.net/d3netbenchmark.

( Image credit: Rethinking RGB-D Salient Object Detection: Models, Data Sets, and Large-Scale Benchmarks, TNNLS20 )

Benchmarks

Add a Result

These leaderboards are used to track progress in RGB-D Salient Object Detection

Dataset	Best Model	Compare
NJU2K	DFormer-L	See all
SIP	DFormer-L	See all
NLPR	DFormer-L	See all
STERE	DFormer-L	See all
DES	DFormer-L	See all
LFSD	UCNet-CVAE	See all
RGBD135	DASNet	See all
NJUD	VST	See all

Libraries

Use these libraries to find RGB-D Salient Object Detection models and implementations

taozh2017/RGBD-SODsurvey

5 papers

319

kerenfu/JLDCF

3 papers

Datasets

Latest papers with no code

Most implemented Social Latest No code

Multi-Scale Iterative Refinement Network for RGB-D Salient Object Detection

no code yet • 24 Jan 2022

The extensive research leveraging RGB-D information has been exploited in salient object detection.

Paper
Add Code

Learning Generative Vision Transformer with Energy-Based Latent Space for Saliency Prediction

no code yet • NeurIPS 2021

In this paper, we take a step further by proposing a novel generative vision transformer with latent variables following an informative energy-based prior for salient object detection.

Paper
Add Code

MutualFormer: Multi-Modality Representation Learning via Cross-Diffusion Attention

no code yet • 2 Dec 2021

In this work, we re-think Transformer and extend it to MutualFormer for multi-modality data representation.

Paper
Add Code

M2RNet: Multi-modal and Multi-scale Refined Network for RGB-D Salient Object Detection

no code yet • 16 Sep 2021

The adjacent interactive aggregation module (AIAM) gradually integrates the neighbor features of high, middle and low levels.

Paper
Add Code

ACFNet: Adaptively-Cooperative Fusion Network for RGB-D Salient Object Detection

no code yet • 10 Sep 2021

Further, we proposed a type-based attention module (TAM) to optimize the network and enhance the multi-scale perception of different objects.

Paper
Add Code

RGB-D Salient Object Detection with Ubiquitous Target Awareness

no code yet • 8 Sep 2021

To construct our framework as well as achieving accurate salient detection results, we propose a Ubiquitous Target Awareness (UTA) network to solve three important challenges in RGB-D SOD task: 1) a depth awareness module to excavate depth information and to mine ambiguous regions via adaptive depth-error weights, 2) a spatial-aware cross-modal interaction and a channel-aware cross-level interaction, exploiting the low-level boundary cues and amplifying high-level salient channels, and 3) a gated multi-scale predictor module to perceive the object saliency in different contextual scales.

Paper
Add Code

Modal-Adaptive Gated Recoding Network for RGB-D Salient Object Detection

no code yet • 13 Aug 2021

Then, a modal-adaptive gate unit (MGU) is proposed to suppress the invalid information and transfer the effective modal features to the recoding mixer and the hybrid branch decoder.

Paper
Add Code

Dynamic Knowledge Distillation With Noise Elimination for RGB-D Salient Object Detection

no code yet • 17 Jun 2021

RGB-D salient object detection (SOD) demonstrates its superiority on detecting in complex environments due to the additional depth information introduced in the data.

Paper
Add Code

Progressive Multi-scale Fusion Network for RGB-D Salient Object Detection

no code yet • 7 Jun 2021

We further introduce a mask-guided refinement module(MGRM) to complement the high-level semantic features and reduce the irrelevant features from multi-scale fusion, leading to an overall refinement of detection.

Paper
Add Code

Middle-level Fusion for Lightweight RGB-D Salient Object Detection

no code yet • 23 Apr 2021

The former one first uses two sub-networks to extract unimodal features from RGB and depth images, respectively, and then fuses them for SOD.

Paper
Add Code

RGB-D Salient Object Detection

Benchmarks Add a Result

Libraries

Datasets

Latest papers with no code

Content

Benchmarks

Add a Result