Picture for Miles Yang

Miles Yang

TMD-Bench: A Multi-Level Evaluation Paradigm for Music-Dance Co-Generation

Add code
May 03, 2026
Viaarxiv icon

Symbiotic-MoE: Unlocking the Synergy between Generation and Understanding

Add code
Apr 09, 2026
Viaarxiv icon

HYDRA: Unifying Multi-modal Generation and Understanding via Representation-Harmonized Tokenization

Add code
Mar 17, 2026
Viaarxiv icon

UniCom: Unified Multimodal Modeling via Compressed Continuous Semantic Representations

Add code
Mar 11, 2026
Viaarxiv icon

iFSQ: Improving FSQ for Image Generation with 1 Line of Code

Add code
Jan 27, 2026
Viaarxiv icon

HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation

Add code
Aug 23, 2025
Viaarxiv icon

MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE

Add code
Jul 29, 2025
Viaarxiv icon