Picture for Hao Zhang

Hao Zhang

refer to the report for detailed contributions

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

Add code
Jun 08, 2025
Viaarxiv icon

Score-based Generative Modeling for Conditional Independence Testing

Add code
May 29, 2025
Viaarxiv icon

MagicTryOn: Harnessing Diffusion Transformer for Garment-Preserving Video Virtual Try-on

Add code
May 28, 2025
Viaarxiv icon

Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding

Add code
May 28, 2025
Viaarxiv icon

MoPFormer: Motion-Primitive Transformer for Wearable-Sensor Activity Recognition

Add code
May 27, 2025
Viaarxiv icon

Simulating the Unseen: Crash Prediction Must Learn from What Did Not Happen

Add code
May 27, 2025
Viaarxiv icon

Done Is Better than Perfect: Unlocking Efficient Reasoning by Structured Multi-Turn Decomposition

Add code
May 26, 2025
Viaarxiv icon

ART-DECO: Arbitrary Text Guidance for 3D Detailizer Construction

Add code
May 26, 2025
Viaarxiv icon

SAMA: Towards Multi-Turn Referential Grounded Video Chat with Large Language Models

Add code
May 24, 2025
Viaarxiv icon

Large-Scale Bayesian Tensor Reconstruction: An Approximate Message Passing Solution

Add code
May 22, 2025
Viaarxiv icon