Picture for Hang Zhou

Hang Zhou

and Other Contributors

InterDyad: Interactive Dyadic Speech-to-Video Generation by Querying Intermediate Visual Guidance

Add code
Mar 24, 2026
Viaarxiv icon

SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing

Add code
Mar 19, 2026
Viaarxiv icon

MVHOI: Bridge Multi-view Condition to Complex Human-Object Interaction Video Reenactment via 3D Foundation Model

Add code
Mar 16, 2026
Viaarxiv icon

DISPLAY: Directable Human-Object Interaction Video Generation via Sparse Motion Guidance and Multi-Task Auxiliary

Add code
Mar 10, 2026
Viaarxiv icon

High-Slip-Ratio Control for Peak Tire-Road Friction Estimation Using Automated Vehicles

Add code
Mar 10, 2026
Viaarxiv icon

IPD: Boosting Sequential Policy with Imaginary Planning Distillation in Offline Reinforcement Learning

Add code
Mar 04, 2026
Viaarxiv icon

CoLoGen: Progressive Learning of Concept-Localization Duality for Unified Image Generation

Add code
Feb 26, 2026
Viaarxiv icon

Transolver-3: Scaling Up Transformer Solvers to Industrial-Scale Geometries

Add code
Feb 04, 2026
Viaarxiv icon

An Empirical Study of World Model Quantization

Add code
Feb 02, 2026
Viaarxiv icon

Physics-informed Diffusion Mamba Transformer for Real-world Driving

Add code
Jan 31, 2026
Viaarxiv icon