Picture for Haodong Li

Haodong Li

How Well Do Models Follow Visual Instructions? VIBE: A Systematic Benchmark for Visual Instruction-Driven Image Editing

Add code
Feb 02, 2026
Viaarxiv icon

MPF-Net: Exposing High-Fidelity AI-Generated Video Forgeries via Hierarchical Manifold Deviation and Micro-Temporal Fluctuations

Add code
Jan 29, 2026
Viaarxiv icon

Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation

Add code
Dec 18, 2025
Figure 1 for Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation
Figure 2 for Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation
Figure 3 for Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation
Figure 4 for Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation
Viaarxiv icon

As If We've Met Before: LLMs Exhibit Certainty in Recognizing Seen Files

Add code
Nov 19, 2025
Viaarxiv icon

DA$^2$: Depth Anything in Any Direction

Add code
Sep 30, 2025
Viaarxiv icon

LCS-CTC: Leveraging Soft Alignments to Enhance Phonetic Transcription Robustness

Add code
Aug 05, 2025
Viaarxiv icon

StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation

Add code
Jun 25, 2025
Figure 1 for StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation
Figure 2 for StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation
Figure 3 for StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation
Figure 4 for StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation
Viaarxiv icon

Advancing high-fidelity 3D and Texture Generation with 2.5D latents

Add code
May 28, 2025
Viaarxiv icon

Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation

Add code
Mar 20, 2025
Figure 1 for Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation
Figure 2 for Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation
Figure 3 for Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation
Figure 4 for Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation
Viaarxiv icon

DDFAV: Remote Sensing Large Vision Language Models Dataset and Evaluation Benchmark

Add code
Nov 05, 2024
Viaarxiv icon