Picture for Xin Tao

Xin Tao

StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors

Add code
Dec 18, 2025
Viaarxiv icon

Alchemist: Unlocking Efficiency in Text-to-Image Model Training via Meta-Gradient Data Selection

Add code
Dec 18, 2025
Viaarxiv icon

MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives

Add code
Dec 16, 2025
Viaarxiv icon

Astra: General Interactive World Model with Autoregressive Denoising

Add code
Dec 15, 2025
Viaarxiv icon

UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation

Add code
Dec 08, 2025
Viaarxiv icon

Denoising Vision Transformer Autoencoder with Spectral Self-Regularization

Add code
Nov 16, 2025
Viaarxiv icon

Terra: Explorable Native 3D World Model with Point Latents

Add code
Oct 16, 2025
Viaarxiv icon

Mitigating the Noise Shift for Denoising Generative Models via Noise Awareness Guidance

Add code
Oct 14, 2025
Viaarxiv icon

Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play?

Add code
Sep 03, 2025
Viaarxiv icon

Score Augmentation for Diffusion Models

Add code
Aug 11, 2025
Viaarxiv icon