Picture for Shoufa Chen

Shoufa Chen

Tuna-2: Pixel Embeddings Beat Vision Encoders for Multimodal Understanding and Generation

Add code
Apr 27, 2026
Viaarxiv icon

Rein3D: Reinforced 3D Indoor Scene Generation with Panoramic Video Diffusion Models

Add code
Apr 14, 2026
Viaarxiv icon

HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming

Add code
Dec 24, 2025
Figure 1 for HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
Figure 2 for HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
Figure 3 for HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
Figure 4 for HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
Viaarxiv icon

PixelFlow: Pixel-Space Generative Models with Flow

Add code
Apr 10, 2025
Figure 1 for PixelFlow: Pixel-Space Generative Models with Flow
Figure 2 for PixelFlow: Pixel-Space Generative Models with Flow
Figure 3 for PixelFlow: Pixel-Space Generative Models with Flow
Figure 4 for PixelFlow: Pixel-Space Generative Models with Flow
Viaarxiv icon

Goku: Flow Based Video Generative Foundation Models

Add code
Feb 10, 2025
Figure 1 for Goku: Flow Based Video Generative Foundation Models
Figure 2 for Goku: Flow Based Video Generative Foundation Models
Figure 3 for Goku: Flow Based Video Generative Foundation Models
Figure 4 for Goku: Flow Based Video Generative Foundation Models
Viaarxiv icon

FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation

Add code
Feb 07, 2025
Viaarxiv icon

Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM

Add code
Dec 19, 2024
Figure 1 for Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM
Figure 2 for Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM
Figure 3 for Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM
Figure 4 for Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM
Viaarxiv icon

ControlAR: Controllable Image Generation with Autoregressive Models

Add code
Oct 03, 2024
Figure 1 for ControlAR: Controllable Image Generation with Autoregressive Models
Figure 2 for ControlAR: Controllable Image Generation with Autoregressive Models
Figure 3 for ControlAR: Controllable Image Generation with Autoregressive Models
Figure 4 for ControlAR: Controllable Image Generation with Autoregressive Models
Viaarxiv icon

MobileAgentBench: An Efficient and User-Friendly Benchmark for Mobile LLM Agents

Add code
Jun 12, 2024
Figure 1 for MobileAgentBench: An Efficient and User-Friendly Benchmark for Mobile LLM Agents
Figure 2 for MobileAgentBench: An Efficient and User-Friendly Benchmark for Mobile LLM Agents
Figure 3 for MobileAgentBench: An Efficient and User-Friendly Benchmark for Mobile LLM Agents
Figure 4 for MobileAgentBench: An Efficient and User-Friendly Benchmark for Mobile LLM Agents
Viaarxiv icon

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Add code
Jun 10, 2024
Figure 1 for Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
Figure 2 for Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
Figure 3 for Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
Figure 4 for Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
Viaarxiv icon