Picture for Xin Tao

Xin Tao

Training-Free Efficient Video Generation via Dynamic Token Carving

Add code
May 22, 2025
Viaarxiv icon

VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption

Add code
May 17, 2025
Viaarxiv icon

BadVideo: Stealthy Backdoor Attack against Text-to-Video Generation

Add code
Apr 23, 2025
Viaarxiv icon

Boosting Resolution Generalization of Diffusion Transformers with Randomized Positional Encodings

Add code
Mar 24, 2025
Viaarxiv icon

DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers

Add code
Mar 18, 2025
Viaarxiv icon

MTV-Inpaint: Multi-Task Long Video Inpainting

Add code
Mar 14, 2025
Viaarxiv icon

RectifiedHR: Enable Efficient High-Resolution Image Generation via Energy Rectification

Add code
Mar 04, 2025
Viaarxiv icon

Owl-1: Omni World Model for Consistent Long Video Generation

Add code
Dec 12, 2024
Figure 1 for Owl-1: Omni World Model for Consistent Long Video Generation
Figure 2 for Owl-1: Omni World Model for Consistent Long Video Generation
Figure 3 for Owl-1: Omni World Model for Consistent Long Video Generation
Figure 4 for Owl-1: Omni World Model for Consistent Long Video Generation
Viaarxiv icon

Towards Precise Scaling Laws for Video Diffusion Transformers

Add code
Nov 25, 2024
Figure 1 for Towards Precise Scaling Laws for Video Diffusion Transformers
Figure 2 for Towards Precise Scaling Laws for Video Diffusion Transformers
Figure 3 for Towards Precise Scaling Laws for Video Diffusion Transformers
Figure 4 for Towards Precise Scaling Laws for Video Diffusion Transformers
Viaarxiv icon

Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content

Add code
Oct 10, 2024
Viaarxiv icon