Monocular Depth Estimation

339 papers with code • 17 benchmarks • 27 datasets

Monocular Depth Estimation is the task of estimating the depth value (distance relative to the camera) of each pixel given a single (monocular) RGB image. This challenging task is a key prerequisite for determining scene understanding for applications such as 3D scene reconstruction, autonomous driving, and AR. State-of-the-art methods usually fall into one of two categories: designing a complex network that is powerful enough to directly regress the depth map, or splitting the input into bins or windows to reduce computational complexity. The most popular benchmarks are the KITTI and NYUv2 datasets. Models are typically evaluated using RMSE or absolute relative error.

Source: Defocus Deblurring Using Dual-Pixel Data

Libraries

Use these libraries to find Monocular Depth Estimation models and implementations

Harnessing Diffusion Models for Visual Perception with Meta Prompts

fudan-zvg/meta-prompts 22 Dec 2023

Our key insight is to introduce learnable embeddings (meta prompts) to the pre-trained diffusion models to extract proper features for perception.

60
22 Dec 2023

Atlantis: Enabling Underwater Depth Estimation with Stable Diffusion

zkawfanx/atlantis 19 Dec 2023

Nonetheless, the performance of these methods is often constrained by the domain gap and looser constraints.

30
19 Dec 2023

EVP: Enhanced Visual Perception using Inverse Multi-Attentive Feature Refinement and Regularized Image-Text Alignment

lavreniuk/evp 13 Dec 2023

Second, we propose a novel image-text alignment module for improved feature extraction of the Stable Diffusion backbone.

49
13 Dec 2023

Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

prs-eth/marigold 4 Dec 2023

Monocular depth estimation is a fundamental computer vision task.

1,690
04 Dec 2023

Deeper into Self-Supervised Monocular Indoor Depth Estimation

fcntes/indoordepth 3 Dec 2023

One is the large areas of low-texture regions and the other is the complex ego-motion on indoor training datasets.

13
03 Dec 2023

SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction

huang-yh/selfocc 21 Nov 2023

Our SelfOcc outperforms the previous best method SceneRF by 58. 7% using a single frame as input on SemanticKITTI and is the first self-supervised work that produces reasonable 3D occupancy for surround cameras on nuScenes.

224
21 Nov 2023

Camera-Independent Single Image Depth Estimation from Defocus Blur

sleekeagle/defocus_camind 21 Nov 2023

We created a synthetic dataset which can be used to test the camera independent performance of depth from defocus blur models.

3
21 Nov 2023

NDDepth: Normal-Distance Assisted Monocular Depth Estimation and Completion

ShuweiShao/NDDepth 13 Nov 2023

To this end, we develop a normal-distance head that outputs pixel-level surface normal and distance.

91
13 Nov 2023

MonoDiffusion: Self-Supervised Monocular Depth Estimation Using Diffusion Model

shuweishao/monodiffusion 13 Nov 2023

Because the depth ground-truth is unavailable in the training phase, we develop a pseudo ground-truth diffusion process to assist the diffusion in MonoDiffusion.

25
13 Nov 2023

MonoProb: Self-Supervised Monocular Depth Estimation with Interpretable Uncertainty

cea-list/monoprob 10 Nov 2023

Self-supervised monocular depth estimation methods aim to be used in critical applications such as autonomous vehicles for environment analysis.

23
10 Nov 2023