Picture for Weilin Huang

Weilin Huang

Seedream 3.0 Technical Report

Add code
Apr 16, 2025
Viaarxiv icon

SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL

Add code
Apr 15, 2025
Viaarxiv icon

DDT: Decoupled Diffusion Transformer

Add code
Apr 09, 2025
Viaarxiv icon

Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model

Add code
Mar 10, 2025
Viaarxiv icon

OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization

Add code
Dec 19, 2024
Figure 1 for OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization
Figure 2 for OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization
Figure 3 for OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization
Figure 4 for OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization
Viaarxiv icon

Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM

Add code
Dec 19, 2024
Viaarxiv icon

Fast Prompt Alignment for Text-to-Image Generation

Add code
Dec 11, 2024
Viaarxiv icon

SeedEdit: Align Image Re-Generation to Image Editing

Add code
Nov 11, 2024
Viaarxiv icon

UniFL: Improve Stable Diffusion via Unified Feedback Learning

Add code
Apr 08, 2024
Figure 1 for UniFL: Improve Stable Diffusion via Unified Feedback Learning
Figure 2 for UniFL: Improve Stable Diffusion via Unified Feedback Learning
Figure 3 for UniFL: Improve Stable Diffusion via Unified Feedback Learning
Figure 4 for UniFL: Improve Stable Diffusion via Unified Feedback Learning
Viaarxiv icon

Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Models

Add code
Dec 12, 2023
Figure 1 for Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Models
Figure 2 for Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Models
Figure 3 for Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Models
Figure 4 for Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Models
Viaarxiv icon