Picture for Canyu Zhao

Canyu Zhao

MARBLE: Multi-Aspect Reward Balance for Diffusion RL

Add code
May 07, 2026
Viaarxiv icon

MMControl: Unified Multi-Modal Control for Joint Audio-Video Generation

Add code
Apr 22, 2026
Viaarxiv icon

Alleviating Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based GRPO

Add code
Feb 06, 2026
Viaarxiv icon

StaMo: Unsupervised Learning of Generalizable Robot Motion from Compact State Representation

Add code
Oct 06, 2025
Viaarxiv icon

Tinker: Diffusion's Gift to 3D--Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization

Add code
Aug 20, 2025
Figure 1 for Tinker: Diffusion's Gift to 3D--Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization
Figure 2 for Tinker: Diffusion's Gift to 3D--Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization
Figure 3 for Tinker: Diffusion's Gift to 3D--Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization
Figure 4 for Tinker: Diffusion's Gift to 3D--Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization
Viaarxiv icon

Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO

Add code
May 27, 2025
Viaarxiv icon

Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration

Add code
May 26, 2025
Viaarxiv icon

DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks

Add code
Feb 25, 2025
Viaarxiv icon

MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence

Add code
Jul 23, 2024
Figure 1 for MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence
Figure 2 for MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence
Figure 3 for MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence
Figure 4 for MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence
Viaarxiv icon

FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition

Add code
May 22, 2024
Figure 1 for FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition
Figure 2 for FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition
Figure 3 for FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition
Figure 4 for FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition
Viaarxiv icon