Monocular Depth Estimation


Monocular-depth estimation is the process of estimating the depth of objects in a scene using a single image.

WEDepth: Efficient Adaptation of World Knowledge for Monocular Depth Estimation

Add code
Nov 11, 2025
Viaarxiv icon

GeoSurDepth: Spatial Geometry-Consistent Self-Supervised Depth Estimation for Surround-View Cameras

Add code
Jan 09, 2026
Viaarxiv icon

An Empirical Study of Monocular Human Body Measurement Under Weak Calibration

Add code
Jan 04, 2026
Viaarxiv icon

BoRe-Depth: Self-supervised Monocular Depth Estimation with Boundary Refinement for Embedded Systems

Add code
Nov 06, 2025
Figure 1 for BoRe-Depth: Self-supervised Monocular Depth Estimation with Boundary Refinement for Embedded Systems
Figure 2 for BoRe-Depth: Self-supervised Monocular Depth Estimation with Boundary Refinement for Embedded Systems
Figure 3 for BoRe-Depth: Self-supervised Monocular Depth Estimation with Boundary Refinement for Embedded Systems
Figure 4 for BoRe-Depth: Self-supervised Monocular Depth Estimation with Boundary Refinement for Embedded Systems
Viaarxiv icon

EGSA-PT:Edge-Guided Spatial Attention with Progressive Training for Monocular Depth Estimation and Segmentation of Transparent Objects

Add code
Nov 18, 2025
Figure 1 for EGSA-PT:Edge-Guided Spatial Attention with Progressive Training for Monocular Depth Estimation and Segmentation of Transparent Objects
Figure 2 for EGSA-PT:Edge-Guided Spatial Attention with Progressive Training for Monocular Depth Estimation and Segmentation of Transparent Objects
Figure 3 for EGSA-PT:Edge-Guided Spatial Attention with Progressive Training for Monocular Depth Estimation and Segmentation of Transparent Objects
Figure 4 for EGSA-PT:Edge-Guided Spatial Attention with Progressive Training for Monocular Depth Estimation and Segmentation of Transparent Objects
Viaarxiv icon

No Pose Estimation? No Problem: Pose-Agnostic and Instance-Aware Test-Time Adaptation for Monocular Depth Estimation

Add code
Nov 07, 2025
Viaarxiv icon

How to Evaluate Monocular Depth Estimation?

Add code
Oct 22, 2025
Viaarxiv icon

StereoVLA: Enhancing Vision-Language-Action Models with Stereo Vision

Add code
Dec 26, 2025
Figure 1 for StereoVLA: Enhancing Vision-Language-Action Models with Stereo Vision
Figure 2 for StereoVLA: Enhancing Vision-Language-Action Models with Stereo Vision
Figure 3 for StereoVLA: Enhancing Vision-Language-Action Models with Stereo Vision
Figure 4 for StereoVLA: Enhancing Vision-Language-Action Models with Stereo Vision
Viaarxiv icon

FoundationSLAM: Unleashing the Power of Depth Foundation Models for End-to-End Dense Visual SLAM

Add code
Dec 31, 2025
Viaarxiv icon

With Great Context Comes Great Prediction Power: Classifying Objects via Geo-Semantic Scene Graphs

Add code
Dec 28, 2025
Viaarxiv icon