Depth Estimation

799 papers with code • 14 benchmarks • 70 datasets

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Benchmarks

Add a Result

These leaderboards are used to track progress in Depth Estimation

Dataset	Best Model	Compare
Stanford2D3D Panoramic	HiMODE	See all
NYU-Depth V2	EVP	See all
eBDtheque	Bhattacharjee et al.	See all
DCM	Bhattacharjee et al.	See all
ScanNet	Atlas (plain)	See all
ScanNetV2	DELTAS	See all
DIODE	AIP-Brown	See all
Mars DTM Estimation	GLPDepth	See all
KITTI 2015	H-Net (Ours) Full Eigen	See all
4D Light Field Dataset	LFattNet	See all
Taskonomy	X-TC (Cross-Task Consistency)	See all
Cityscapes test	SDC-Depth	See all
Matterport3D	UniFuse	See all
KITTI Eigen split	LightDepth	See all

Show all 14 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Depth Estimation models and implementations

huggingface/transformers

3 papers

124,984

open-mmlab/mmdetection3d

2 papers

4,808

google-research/big_vision

2 papers

1,554

Datasets

Subtasks

3D Depth Estimation

Depth Map Super-Resolution

Stereo-LiDAR Fusion

Indoor Monocular Depth Estimation

Depth Aleatoric Uncertainty Estimation

Depth Image Upsampling

Latest papers with no code

Most implemented Social Latest No code

SGFormer: Spherical Geometry Transformer for 360 Depth Estimation

no code yet • 23 Apr 2024

Panoramic distortion poses a significant challenge in 360 depth estimation, particularly pronounced at the north and south poles.

Paper
Add Code

Mining Supervision for Dynamic Regions in Self-Supervised Monocular Depth Estimation

no code yet • 23 Apr 2024

In the next stage, we use an object network to estimate the depth of those moving objects assuming rigid motions.

Paper
Add Code

Self-Supervised Monocular Depth Estimation in the Dark: Towards Data Distribution Compensation

no code yet • 22 Apr 2024

In this paper, we propose a self-supervised nighttime monocular depth estimation method that does not use any night images during training.

Paper
Add Code

GScream: Learning 3D Geometry and Feature Consistent Gaussian Splatting for Object Removal

no code yet • 21 Apr 2024

This paper tackles the intricate challenge of object removal to update the radiance field using the 3D Gaussian Splatting.

Paper
Add Code

High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces

no code yet • 20 Apr 2024

In surgical oncology, screening colonoscopy plays a pivotal role in providing diagnostic assistance, such as biopsy, and facilitating surgical navigation, particularly in polyp detection.

Paper
Add Code

BLINK: Multimodal Large Language Models Can See but Not Perceive

no code yet • 18 Apr 2024

We introduce Blink, a new benchmark for multimodal language models (LLMs) that focuses on core visual perception abilities not found in other evaluations.

Paper
Add Code

SPIdepth: Strengthened Pose Information for Self-supervised Monocular Depth Estimation

no code yet • 18 Apr 2024

Our approach represents a significant leap forward in self-supervised monocular depth estimation, underscoring the importance of strengthening pose information for advancing scene understanding in real-world applications.

Paper
Add Code

How to deal with glare for improved perception of Autonomous Vehicles

no code yet • 17 Apr 2024

In this paper, we investigate various glare reduction techniques, including the proposed saturated pixel-aware glare reduction technique for improved performance of the computer vision (CV) tasks employed by the perception layer of AVs.

Paper
Add Code

Digging into contrastive learning for robust depth estimation with diffusion models

no code yet • 15 Apr 2024

In this paper, we propose a novel robust depth estimation method called D4RD, featuring a custom contrastive learning mode tailored for diffusion models to mitigate performance degradation in complex environments.

Paper
Add Code

In My Perspective, In My Hands: Accurate Egocentric 2D Hand Pose and Action Recognition

no code yet • 14 Apr 2024

Our study aims to fill this research gap by exploring the field of 2D hand pose estimation for egocentric action recognition, making two contributions.

Paper
Add Code

Depth Estimation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result