Picture for Hao Zhang

Hao Zhang

refer to the report for detailed contributions

Hunyuan3D 2.1: From Images to High-Fidelity 3D Assets with Production-Ready PBR Material

Add code
Jun 18, 2025
Viaarxiv icon

Break Stylistic Sophon: Are We Really Meant to Confine the Imagination in Style Transfer?

Add code
Jun 18, 2025
Viaarxiv icon

Non-Overlap-Aware Egocentric Pose Estimation for Collaborative Perception in Connected Autonomy

Add code
Jun 17, 2025
Viaarxiv icon

Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning

Add code
Jun 16, 2025
Viaarxiv icon

Leveraging Reference Documents for Zero-Shot Ranking via Large Language Models

Add code
Jun 13, 2025
Viaarxiv icon

ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning

Add code
Jun 11, 2025
Viaarxiv icon

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Add code
Jun 08, 2025
Figure 1 for Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning
Figure 2 for Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning
Figure 3 for Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning
Figure 4 for Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning
Viaarxiv icon

Score-based Generative Modeling for Conditional Independence Testing

Add code
May 29, 2025
Viaarxiv icon

MagicTryOn: Harnessing Diffusion Transformer for Garment-Preserving Video Virtual Try-on

Add code
May 28, 2025
Viaarxiv icon

Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Add code
May 28, 2025
Viaarxiv icon