Picture for Hao Li

Hao Li

Jack

Seeing Sound, Hearing Sight: Uncovering Modality Bias and Conflict of AI models in Sound Localization

Add code
May 16, 2025
Viaarxiv icon

Unsupervised Radar Point Cloud Enhancement via Arbitrary LiDAR Guided Diffusion Prior

Add code
May 15, 2025
Viaarxiv icon

GIFStream: 4D Gaussian-based Immersive Video with Feature Stream

Add code
May 12, 2025
Viaarxiv icon

Step1X-3D: Towards High-Fidelity and Controllable Generation of Textured 3D Assets

Add code
May 12, 2025
Viaarxiv icon

Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding

Add code
May 08, 2025
Figure 1 for Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding
Figure 2 for Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding
Figure 3 for Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding
Figure 4 for Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding
Viaarxiv icon

SOAP: Style-Omniscient Animatable Portraits

Add code
May 08, 2025
Figure 1 for SOAP: Style-Omniscient Animatable Portraits
Figure 2 for SOAP: Style-Omniscient Animatable Portraits
Figure 3 for SOAP: Style-Omniscient Animatable Portraits
Figure 4 for SOAP: Style-Omniscient Animatable Portraits
Viaarxiv icon

Convex Relaxation for Robust Vanishing Point Estimation in Manhattan World

Add code
May 07, 2025
Figure 1 for Convex Relaxation for Robust Vanishing Point Estimation in Manhattan World
Figure 2 for Convex Relaxation for Robust Vanishing Point Estimation in Manhattan World
Figure 3 for Convex Relaxation for Robust Vanishing Point Estimation in Manhattan World
Figure 4 for Convex Relaxation for Robust Vanishing Point Estimation in Manhattan World
Viaarxiv icon

Optimization of Module Transferability in Single Image Super-Resolution: Universality Assessment and Cycle Residual Blocks

Add code
May 06, 2025
Viaarxiv icon

A machine learning model for skillful climate system prediction

Add code
May 06, 2025
Viaarxiv icon

T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT

Add code
May 01, 2025
Viaarxiv icon