Picture for Ziyan Yang

Ziyan Yang

Growing Visual Generative Capacity for Pre-Trained MLLMs

Add code
Oct 02, 2025
Viaarxiv icon

NoiseShift: Resolution-Aware Noise Recalibration for Better Low-Resolution Image Generation

Add code
Oct 02, 2025
Viaarxiv icon

Mixture of Contexts for Long Video Generation

Add code
Aug 28, 2025
Viaarxiv icon

Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations

Add code
Jun 23, 2025
Figure 1 for Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
Figure 2 for Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
Figure 3 for Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
Figure 4 for Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
Viaarxiv icon

VINCIE: Unlocking In-context Image Editing from Video

Add code
Jun 12, 2025
Viaarxiv icon

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Add code
Apr 11, 2025
Viaarxiv icon

Evaluating Text-to-Image Synthesis with a Conditional Fréchet Distance

Add code
Mar 27, 2025
Figure 1 for Evaluating Text-to-Image Synthesis with a Conditional Fréchet Distance
Figure 2 for Evaluating Text-to-Image Synthesis with a Conditional Fréchet Distance
Figure 3 for Evaluating Text-to-Image Synthesis with a Conditional Fréchet Distance
Figure 4 for Evaluating Text-to-Image Synthesis with a Conditional Fréchet Distance
Viaarxiv icon

Synthetic Video Enhances Physical Fidelity in Video Synthesis

Add code
Mar 26, 2025
Viaarxiv icon

Long Context Tuning for Video Generation

Add code
Mar 13, 2025
Viaarxiv icon

Is Your Text-to-Image Model Robust to Caption Noise?

Add code
Dec 27, 2024
Viaarxiv icon