Picture for Yabiao Wang

Yabiao Wang

SwiftVideo: A Unified Framework for Few-Step Video Generation through Trajectory-Distribution Alignment

Add code
Aug 08, 2025
Viaarxiv icon

Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning

Add code
Jul 02, 2025
Viaarxiv icon

UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions

Add code
Jun 16, 2025
Viaarxiv icon

Swin DiT: Diffusion Transformer using Pseudo Shifted Windows

Add code
May 19, 2025
Viaarxiv icon

UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer

Add code
Mar 12, 2025
Viaarxiv icon

PixelPonder: Dynamic Patch Adaptation for Enhanced Multi-Conditional Text-to-Image Generation

Add code
Mar 09, 2025
Viaarxiv icon

Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction

Add code
Jan 01, 2025
Figure 1 for Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction
Figure 2 for Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction
Figure 3 for Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction
Figure 4 for Improving Autoregressive Visual Generation with Cluster-Oriented Token Prediction
Viaarxiv icon

EMOv2: Pushing 5M Vision Model Frontier

Add code
Dec 09, 2024
Figure 1 for EMOv2: Pushing 5M Vision Model Frontier
Figure 2 for EMOv2: Pushing 5M Vision Model Frontier
Figure 3 for EMOv2: Pushing 5M Vision Model Frontier
Figure 4 for EMOv2: Pushing 5M Vision Model Frontier
Viaarxiv icon

Exploring Real&Synthetic Dataset and Linear Attention in Image Restoration

Add code
Dec 05, 2024
Viaarxiv icon

DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation

Add code
Dec 04, 2024
Figure 1 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Figure 2 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Figure 3 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Figure 4 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Viaarxiv icon