Depth Estimation

799 papers with code • 14 benchmarks • 70 datasets

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Benchmarks

Add a Result

These leaderboards are used to track progress in Depth Estimation

Dataset	Best Model	Compare
Stanford2D3D Panoramic	HiMODE	See all
NYU-Depth V2	EVP	See all
eBDtheque	Bhattacharjee et al.	See all
DCM	Bhattacharjee et al.	See all
ScanNet	Atlas (plain)	See all
ScanNetV2	DELTAS	See all
DIODE	AIP-Brown	See all
Mars DTM Estimation	GLPDepth	See all
KITTI 2015	H-Net (Ours) Full Eigen	See all
4D Light Field Dataset	LFattNet	See all
Taskonomy	X-TC (Cross-Task Consistency)	See all
Cityscapes test	SDC-Depth	See all
Matterport3D	UniFuse	See all
KITTI Eigen split	LightDepth	See all

Show all 14 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Depth Estimation models and implementations

huggingface/transformers

3 papers

125,059

open-mmlab/mmdetection3d

2 papers

4,810

google-research/big_vision

2 papers

1,554

Datasets

Subtasks

3D Depth Estimation

Depth Map Super-Resolution

Stereo-LiDAR Fusion

Indoor Monocular Depth Estimation

Depth Aleatoric Uncertainty Estimation

Depth Image Upsampling

Latest papers

Most implemented Social Latest No code

UniDepth: Universal Monocular Metric Depth Estimation

lpiccinelli-eth/unidepth • • 27 Mar 2024

However, the remarkable accuracy of recent MMDE methods is confined to their training domains.

324

27 Mar 2024

Paper
Code

ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation

aradhye2002/ecodepth • • 27 Mar 2024

We argue that the embedding vector from a ViT model, pre-trained on a large dataset, captures greater relevant information for SIDE than the usual route of generating pseudo image captions, followed by CLIP based text embeddings.

27 Mar 2024

Paper
Code

ModaLink: Unifying Modalities for Efficient Image-to-PointCloud Place Recognition

haomo-ai/modalink • • 27 Mar 2024

Experimental results on the KITTI dataset show that our proposed methods achieve state-of-the-art performance while running in real time.

27 Mar 2024

Paper
Code

DN-Splatter: Depth and Normal Priors for Gaussian Splatting and Meshing

maturk/dn-splatter • • 26 Mar 2024

3D Gaussian splatting, a novel differentiable rendering technique, has achieved state-of-the-art novel view synthesis results with high rendering speeds and relatively low training times.

194

26 Mar 2024

Paper
Code

Physical 3D Adversarial Attacks against Monocular Depth Estimation in Autonomous Driving

gandolfczjh/3d2fool • • 26 Mar 2024

Deep learning-based monocular depth estimation (MDE), extensively applied in autonomous driving, is known to be vulnerable to adversarial attacks.

26 Mar 2024

Paper
Code

When Do We Not Need Larger Vision Models?

bfshi/scaling_on_scales • • 19 Mar 2024

Our results show that a multi-scale smaller model has comparable learning capacity to a larger model, and pre-training smaller models with S$^2$ can match or even exceed the advantage of larger models.

184

19 Mar 2024

Paper
Code

FeatUp: A Model-Agnostic Framework for Features at Any Resolution

mhamilton723/FeatUp • • 15 Mar 2024

Deep features are a cornerstone of computer vision research, capturing image semantics and enabling the community to solve downstream tasks even in the zero- or few-shot regime.

1,037

15 Mar 2024

Paper
Code

Robust Shape Fitting for 3D Scene Abstraction

fkluger/cuboids_revisited • • 15 Mar 2024

A RANSAC estimator guided by a neural network fits these primitives to a depth map.

15 Mar 2024

Paper
Code

SwinMTL: A Shared Architecture for Simultaneous Depth Estimation and Semantic Segmentation from Monocular Camera Images

pardistaghavi/swinmtl • • 15 Mar 2024

This research paper presents an innovative multi-task learning framework that allows concurrent depth estimation and semantic segmentation using a single camera.

15 Mar 2024

Paper
Code

SM4Depth: Seamless Monocular Metric Depth Estimation across Multiple Cameras and Scenes by One Model

1hao-liu/sm4depth • • 13 Mar 2024

Third, to reduce the reliance on massive training data, we propose a ``divide and conquer" solution.

13 Mar 2024

Paper
Code

Depth Estimation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result