Picture for Liang-Chieh Chen

Liang-Chieh Chen

Taming Outlier Tokens in Diffusion Transformers

Add code
May 06, 2026
Viaarxiv icon

Large Language Models are Universal Reasoners for Visual Generation

Add code
May 05, 2026
Viaarxiv icon

A Frame is Worth One Token: Efficient Generative World Modeling with Delta Tokens

Add code
Apr 06, 2026
Viaarxiv icon

Autoregressive Image Generation with Masked Bit Modeling

Add code
Feb 09, 2026
Viaarxiv icon

Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers

Add code
May 20, 2025
Figure 1 for Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers
Figure 2 for Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers
Figure 3 for Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers
Figure 4 for Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers
Viaarxiv icon

ReVision: High-Quality, Low-Cost Video Generation with Explicit 3D Physics Modeling for Complex Motion and Interaction

Add code
Apr 30, 2025
Viaarxiv icon

Deeply Supervised Flow-Based Generative Models

Add code
Mar 18, 2025
Viaarxiv icon

FlowTok: Flowing Seamlessly Across Text and Image Tokens

Add code
Mar 13, 2025
Viaarxiv icon

Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation

Add code
Feb 27, 2025
Viaarxiv icon

COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation

Add code
Feb 04, 2025
Viaarxiv icon