Picture for Zhao Zhong

Zhao Zhong

Symbiotic-MoE: Unlocking the Synergy between Generation and Understanding

Add code
Apr 09, 2026
Viaarxiv icon

Unify-Agent: A Unified Multimodal Agent for World-Grounded Image Synthesis

Add code
Apr 01, 2026
Viaarxiv icon

OmniWeaving: Towards Unified Video Generation with Free-form Composition and Reasoning

Add code
Mar 25, 2026
Viaarxiv icon

Manifold-Aware Exploration for Reinforcement Learning in Video Generation

Add code
Mar 23, 2026
Viaarxiv icon

HYDRA: Unifying Multi-modal Generation and Understanding via Representation-Harmonized Tokenization

Add code
Mar 17, 2026
Viaarxiv icon

UniCom: Unified Multimodal Modeling via Compressed Continuous Semantic Representations

Add code
Mar 11, 2026
Viaarxiv icon

DisCa: Accelerating Video Diffusion Transformers with Distillation-Compatible Learnable Feature Caching

Add code
Feb 05, 2026
Viaarxiv icon

iFSQ: Improving FSQ for Image Generation with 1 Line of Code

Add code
Jan 27, 2026
Viaarxiv icon

HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation

Add code
Aug 23, 2025
Viaarxiv icon

MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE

Add code
Jul 29, 2025
Viaarxiv icon