Depth Estimation

797 papers with code • 14 benchmarks • 70 datasets

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Benchmarks

Add a Result

These leaderboards are used to track progress in Depth Estimation

Dataset	Best Model	Compare
Stanford2D3D Panoramic	HiMODE	See all
NYU-Depth V2	EVP	See all
eBDtheque	Bhattacharjee et al.	See all
DCM	Bhattacharjee et al.	See all
ScanNet	Atlas (plain)	See all
ScanNetV2	DELTAS	See all
DIODE	AIP-Brown	See all
Mars DTM Estimation	GLPDepth	See all
KITTI 2015	H-Net (Ours) Full Eigen	See all
4D Light Field Dataset	LFattNet	See all
Taskonomy	X-TC (Cross-Task Consistency)	See all
Cityscapes test	SDC-Depth	See all
Matterport3D	UniFuse	See all
KITTI Eigen split	LightDepth	See all

Show all 14 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Depth Estimation models and implementations

huggingface/transformers

3 papers

124,889

open-mmlab/mmdetection3d

2 papers

4,808

google-research/big_vision

2 papers

1,552

Datasets

Subtasks

3D Depth Estimation

Depth Map Super-Resolution

Stereo-LiDAR Fusion

Indoor Monocular Depth Estimation

Depth Aleatoric Uncertainty Estimation

Depth Image Upsampling

Most implemented papers

Most implemented Social Latest No code

AdaBins: Depth Estimation using Adaptive Bins

shariqfarooq123/AdaBins • • CVPR 2021

We address the problem of estimating a high quality dense depth map from a single RGB input image.

Paper
Code

DINOv2: Learning Robust Visual Features without Supervision

facebookresearch/dinov2 • • 14 Apr 2023

The recent breakthroughs in natural language processing for model pretraining on large quantities of data have opened the way for similar foundation models in computer vision.

Paper
Code

Sparse-to-Dense: Depth Prediction from Sparse Depth Samples and a Single Image

fangchangma/sparse-to-dense • • 21 Sep 2017

We consider the problem of dense depth prediction from a sparse set of depth measurements and a single RGB image.

Paper
Code

Pyramid Stereo Matching Network

JiaRenChang/PSMNet • • CVPR 2018

The spatial pyramid pooling module takes advantage of the capacity of global context information by aggregating context in different scales and locations to form a cost volume.

Paper
Code

Multi-Task Learning as Multi-Objective Optimization

IntelVCL/MultiObjectiveOptimization • • NeurIPS 2018

These algorithms are not directly applicable to large-scale learning problems since they scale poorly with the dimensionality of the gradients and the number of tasks.

Paper
Code

Index Network

poppinace/indexnet_matting • • 11 Aug 2019

By viewing the indices as a function of the feature map, we introduce the concept of "learning to index", and present a novel index-guided encoder-decoder framework where indices are self-learned adaptively from data and are used to guide the downsampling and upsampling stages, without extra training supervision.

Paper
Code

EPP-MVSNet: Epipolar-Assembling Based Depth Prediction for Multi-View Stereo

mindspore-ai/models • • ICCV 2021

As a result, we achieve promising results on all datasets and the highest F-Score on the online TNT intermediate benchmark.

Paper
Code

Deep Depth From Focus

gameover27/ddff-pytorch • • 4 Apr 2017

Depth from focus (DFF) is one of the classical ill-posed inverse problems in computer vision.

Paper
Code

Deep Ordinal Regression Network for Monocular Depth Estimation

hufu6371/DORN • CVPR 2018

These methods model depth estimation as a regression problem and train the regression networks by minimizing mean squared error, which suffers from slow convergence and unsatisfactory local solutions.

Paper
Code

Unsupervised Monocular Depth Learning in Dynamic Scenes

google-research/google-research • • 30 Oct 2020

We present a method for jointly training the estimation of depth, ego-motion, and a dense 3D translation field of objects relative to the scene, with monocular photometric consistency being the sole source of supervision.

Paper
Code

Depth Estimation

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result