Picture for Zhenheng Yang

Zhenheng Yang

End-to-End Training for Autoregressive Video Diffusion via Self-Resampling

Add code
Dec 17, 2025
Viaarxiv icon

AgentComp: From Agentic Reasoning to Compositional Mastery in Text-to-Image Models

Add code
Dec 09, 2025
Viaarxiv icon

FOCUS: Efficient Keyframe Selection for Long Video Understanding

Add code
Oct 31, 2025
Viaarxiv icon

Growing Visual Generative Capacity for Pre-Trained MLLMs

Add code
Oct 02, 2025
Viaarxiv icon

Mixture of Contexts for Long Video Generation

Add code
Aug 28, 2025
Viaarxiv icon

UniAPO: Unified Multimodal Automated Prompt Optimization

Add code
Aug 25, 2025
Figure 1 for UniAPO: Unified Multimodal Automated Prompt Optimization
Figure 2 for UniAPO: Unified Multimodal Automated Prompt Optimization
Figure 3 for UniAPO: Unified Multimodal Automated Prompt Optimization
Figure 4 for UniAPO: Unified Multimodal Automated Prompt Optimization
Viaarxiv icon

Show-o2: Improved Native Unified Multimodal Models

Add code
Jun 18, 2025
Figure 1 for Show-o2: Improved Native Unified Multimodal Models
Figure 2 for Show-o2: Improved Native Unified Multimodal Models
Figure 3 for Show-o2: Improved Native Unified Multimodal Models
Figure 4 for Show-o2: Improved Native Unified Multimodal Models
Viaarxiv icon

UniRL: Self-Improving Unified Multimodal Models via Supervised and Reinforcement Learning

Add code
May 29, 2025
Viaarxiv icon

DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling

Add code
May 16, 2025
Viaarxiv icon

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Add code
Apr 11, 2025
Figure 1 for Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model
Figure 2 for Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model
Figure 3 for Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model
Figure 4 for Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model
Viaarxiv icon