Picture for Songwei Ge

Songwei Ge

Flow Matching Policy Gradients

Add code
Jul 28, 2025
Viaarxiv icon

A Comprehensive Study of Decoder-Only LLMs for Text-to-Image Generation

Add code
Jun 09, 2025
Viaarxiv icon

Cosmos World Foundation Model Platform for Physical AI

Add code
Jan 07, 2025
Figure 1 for Cosmos World Foundation Model Platform for Physical AI
Figure 2 for Cosmos World Foundation Model Platform for Physical AI
Figure 3 for Cosmos World Foundation Model Platform for Physical AI
Figure 4 for Cosmos World Foundation Model Platform for Physical AI
Viaarxiv icon

Illusion3D: 3D Multiview Illusion with 2D Diffusion Priors

Add code
Dec 12, 2024
Viaarxiv icon

Rethinking Score Distillation as a Bridge Between Image Distributions

Add code
Jun 13, 2024
Figure 1 for Rethinking Score Distillation as a Bridge Between Image Distributions
Figure 2 for Rethinking Score Distillation as a Bridge Between Image Distributions
Figure 3 for Rethinking Score Distillation as a Bridge Between Image Distributions
Figure 4 for Rethinking Score Distillation as a Bridge Between Image Distributions
Viaarxiv icon

Coherent Zero-Shot Visual Instruction Generation

Add code
Jun 06, 2024
Figure 1 for Coherent Zero-Shot Visual Instruction Generation
Figure 2 for Coherent Zero-Shot Visual Instruction Generation
Figure 3 for Coherent Zero-Shot Visual Instruction Generation
Figure 4 for Coherent Zero-Shot Visual Instruction Generation
Viaarxiv icon

On the Content Bias in Fréchet Video Distance

Add code
Apr 18, 2024
Figure 1 for On the Content Bias in Fréchet Video Distance
Figure 2 for On the Content Bias in Fréchet Video Distance
Figure 3 for On the Content Bias in Fréchet Video Distance
Figure 4 for On the Content Bias in Fréchet Video Distance
Viaarxiv icon

Grounded Text-to-Image Synthesis with Attention Refocusing

Add code
Jun 08, 2023
Viaarxiv icon

Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models

Add code
May 17, 2023
Viaarxiv icon

Expressive Text-to-Image Generation with Rich Text

Add code
Apr 13, 2023
Figure 1 for Expressive Text-to-Image Generation with Rich Text
Figure 2 for Expressive Text-to-Image Generation with Rich Text
Figure 3 for Expressive Text-to-Image Generation with Rich Text
Figure 4 for Expressive Text-to-Image Generation with Rich Text
Viaarxiv icon