Picture for Zhenxiong Tan

Zhenxiong Tan

SpotEdit: Selective Region Editing in Diffusion Transformers

Add code
Dec 26, 2025
Viaarxiv icon

FreeSwim: Revisiting Sliding-Window Attention Mechanisms for Training-Free Ultra-High-Resolution Video Generation

Add code
Nov 18, 2025
Viaarxiv icon

Image Editing As Programs with Diffusion Models

Add code
Jun 04, 2025
Viaarxiv icon

Minute-Long Videos with Dual Parallelisms

Add code
May 29, 2025
Figure 1 for Minute-Long Videos with Dual Parallelisms
Figure 2 for Minute-Long Videos with Dual Parallelisms
Figure 3 for Minute-Long Videos with Dual Parallelisms
Figure 4 for Minute-Long Videos with Dual Parallelisms
Viaarxiv icon

Ultra-Resolution Adaptation with Ease

Add code
Mar 20, 2025
Viaarxiv icon

OminiControl2: Efficient Conditioning for Diffusion Transformers

Add code
Mar 11, 2025
Viaarxiv icon

CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up

Add code
Dec 20, 2024
Viaarxiv icon

MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation

Add code
Dec 05, 2024
Figure 1 for MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation
Figure 2 for MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation
Figure 3 for MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation
Figure 4 for MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation
Viaarxiv icon

OminiControl: Minimal and Universal Control for Diffusion Transformer

Add code
Nov 25, 2024
Figure 1 for OminiControl: Minimal and Universal Control for Diffusion Transformer
Figure 2 for OminiControl: Minimal and Universal Control for Diffusion Transformer
Figure 3 for OminiControl: Minimal and Universal Control for Diffusion Transformer
Figure 4 for OminiControl: Minimal and Universal Control for Diffusion Transformer
Viaarxiv icon

LinFusion: 1 GPU, 1 Minute, 16K Image

Add code
Sep 03, 2024
Figure 1 for LinFusion: 1 GPU, 1 Minute, 16K Image
Figure 2 for LinFusion: 1 GPU, 1 Minute, 16K Image
Figure 3 for LinFusion: 1 GPU, 1 Minute, 16K Image
Figure 4 for LinFusion: 1 GPU, 1 Minute, 16K Image
Viaarxiv icon