Depth To 3D


FMPose3D: monocular 3D pose estimation via flow matching

Add code
Feb 05, 2026
Viaarxiv icon

NeVStereo: A NeRF-Driven NVS-Stereo Architecture for High-Fidelity 3D Tasks

Add code
Feb 05, 2026
Viaarxiv icon

TrajVG: 3D Trajectory-Coupled Visual Geometry Learning

Add code
Feb 05, 2026
Viaarxiv icon

Splat and Distill: Augmenting Teachers with Feed-Forward 3D Reconstruction For 3D-Aware Distillation

Add code
Feb 05, 2026
Viaarxiv icon

PoseGaussian: Pose-Driven Novel View Synthesis for Robust 3D Human Reconstruction

Add code
Feb 05, 2026
Viaarxiv icon

DRMOT: A Dataset and Framework for RGBD Referring Multi-Object Tracking

Add code
Feb 04, 2026
Viaarxiv icon

A$^2$-LLM: An End-to-end Conversational Audio Avatar Large Language Model

Add code
Feb 04, 2026
Viaarxiv icon

SpatiaLab: Can Vision-Language Models Perform Spatial Reasoning in the Wild?

Add code
Feb 03, 2026
Viaarxiv icon

3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation

Add code
Feb 03, 2026
Viaarxiv icon

Seeing Through Clutter: Structured 3D Scene Reconstruction via Iterative Object Removal

Add code
Feb 03, 2026
Viaarxiv icon