Depth Estimation

799 papers with code • 14 benchmarks • 70 datasets

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Libraries

Use these libraries to find Depth Estimation models and implementations

UniDepth: Universal Monocular Metric Depth Estimation

lpiccinelli-eth/unidepth 27 Mar 2024

However, the remarkable accuracy of recent MMDE methods is confined to their training domains.

324
27 Mar 2024

ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation

aradhye2002/ecodepth 27 Mar 2024

We argue that the embedding vector from a ViT model, pre-trained on a large dataset, captures greater relevant information for SIDE than the usual route of generating pseudo image captions, followed by CLIP based text embeddings.

98
27 Mar 2024

ModaLink: Unifying Modalities for Efficient Image-to-PointCloud Place Recognition

haomo-ai/modalink 27 Mar 2024

Experimental results on the KITTI dataset show that our proposed methods achieve state-of-the-art performance while running in real time.

6
27 Mar 2024

DN-Splatter: Depth and Normal Priors for Gaussian Splatting and Meshing

maturk/dn-splatter 26 Mar 2024

3D Gaussian splatting, a novel differentiable rendering technique, has achieved state-of-the-art novel view synthesis results with high rendering speeds and relatively low training times.

194
26 Mar 2024

Physical 3D Adversarial Attacks against Monocular Depth Estimation in Autonomous Driving

gandolfczjh/3d2fool 26 Mar 2024

Deep learning-based monocular depth estimation (MDE), extensively applied in autonomous driving, is known to be vulnerable to adversarial attacks.

7
26 Mar 2024

When Do We Not Need Larger Vision Models?

bfshi/scaling_on_scales 19 Mar 2024

Our results show that a multi-scale smaller model has comparable learning capacity to a larger model, and pre-training smaller models with S$^2$ can match or even exceed the advantage of larger models.

184
19 Mar 2024

FeatUp: A Model-Agnostic Framework for Features at Any Resolution

mhamilton723/FeatUp 15 Mar 2024

Deep features are a cornerstone of computer vision research, capturing image semantics and enabling the community to solve downstream tasks even in the zero- or few-shot regime.

1,037
15 Mar 2024

Robust Shape Fitting for 3D Scene Abstraction

fkluger/cuboids_revisited 15 Mar 2024

A RANSAC estimator guided by a neural network fits these primitives to a depth map.

35
15 Mar 2024

SwinMTL: A Shared Architecture for Simultaneous Depth Estimation and Semantic Segmentation from Monocular Camera Images

pardistaghavi/swinmtl 15 Mar 2024

This research paper presents an innovative multi-task learning framework that allows concurrent depth estimation and semantic segmentation using a single camera.

5
15 Mar 2024

SM4Depth: Seamless Monocular Metric Depth Estimation across Multiple Cameras and Scenes by One Model

1hao-liu/sm4depth 13 Mar 2024

Third, to reduce the reliance on massive training data, we propose a ``divide and conquer" solution.

28
13 Mar 2024