Picture for Zhenxiong Tan

Zhenxiong Tan

ViFeEdit: A Video-Free Tuner of Your Video Diffusion Transformer

Add code
Mar 16, 2026
Viaarxiv icon

SpotEdit: Selective Region Editing in Diffusion Transformers

Add code
Dec 26, 2025
Viaarxiv icon

FreeSwim: Revisiting Sliding-Window Attention Mechanisms for Training-Free Ultra-High-Resolution Video Generation

Add code
Nov 18, 2025
Viaarxiv icon

Image Editing As Programs with Diffusion Models

Add code
Jun 04, 2025
Viaarxiv icon

Minute-Long Videos with Dual Parallelisms

Add code
May 29, 2025
Figure 1 for Minute-Long Videos with Dual Parallelisms
Figure 2 for Minute-Long Videos with Dual Parallelisms
Figure 3 for Minute-Long Videos with Dual Parallelisms
Figure 4 for Minute-Long Videos with Dual Parallelisms
Viaarxiv icon

Ultra-Resolution Adaptation with Ease

Add code
Mar 20, 2025
Viaarxiv icon

OminiControl2: Efficient Conditioning for Diffusion Transformers

Add code
Mar 11, 2025
Figure 1 for OminiControl2: Efficient Conditioning for Diffusion Transformers
Figure 2 for OminiControl2: Efficient Conditioning for Diffusion Transformers
Figure 3 for OminiControl2: Efficient Conditioning for Diffusion Transformers
Figure 4 for OminiControl2: Efficient Conditioning for Diffusion Transformers
Viaarxiv icon

CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up

Add code
Dec 20, 2024
Figure 1 for CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up
Figure 2 for CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up
Figure 3 for CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up
Figure 4 for CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up
Viaarxiv icon

MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation

Add code
Dec 05, 2024
Figure 1 for MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation
Figure 2 for MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation
Figure 3 for MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation
Figure 4 for MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation
Viaarxiv icon

OminiControl: Minimal and Universal Control for Diffusion Transformer

Add code
Nov 25, 2024
Figure 1 for OminiControl: Minimal and Universal Control for Diffusion Transformer
Figure 2 for OminiControl: Minimal and Universal Control for Diffusion Transformer
Figure 3 for OminiControl: Minimal and Universal Control for Diffusion Transformer
Figure 4 for OminiControl: Minimal and Universal Control for Diffusion Transformer
Viaarxiv icon