Picture for Cheng Yu

Cheng Yu

Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models

Add code
Jun 25, 2026
Viaarxiv icon

FreeSonic: Training-Free Temporal-Aware Decoupled Attention for Precise Audio Editing

Add code
Jun 13, 2026
Viaarxiv icon

JODA: Composable Joint Dynamics for Articulated Objects

Add code
May 11, 2026
Viaarxiv icon

DDA-Thinker: Decoupled Dual-Atomic Reinforcement Learning for Reasoning-Driven Image Editing

Add code
Apr 28, 2026
Viaarxiv icon

AdaSpark: Adaptive Sparsity for Efficient Long-Video Understanding

Add code
Apr 09, 2026
Viaarxiv icon

SpatialReward: Verifiable Spatial Reward Modeling for Fine-Grained Spatial Consistency in Text-to-Image Generation

Add code
Mar 23, 2026
Viaarxiv icon

The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models

Add code
Jan 21, 2026
Viaarxiv icon

Unified Thinker: A General Reasoning Modular Core for Image Generation

Add code
Jan 06, 2026
Viaarxiv icon

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Add code
Dec 31, 2025
Viaarxiv icon

Understanding Diffusion Models via Code Execution

Add code
Dec 08, 2025
Viaarxiv icon