Picture for Shuai Tan

Shuai Tan

PhysRVG: Physics-Aware Unified Reinforcement Learning for Video Generative Models

Add code
Jan 16, 2026
Viaarxiv icon

CoDance: An Unbind-Rebind Paradigm for Robust Multi-Subject Animation

Add code
Jan 16, 2026
Viaarxiv icon

ESGaussianFace: Emotional and Stylized Audio-Driven Facial Animation via 3D Gaussian Splatting

Add code
Jan 05, 2026
Viaarxiv icon

Phys-Liquid: A Physics-Informed Dataset for Estimating 3D Geometry and Volume of Transparent Deformable Liquids

Add code
Nov 14, 2025
Figure 1 for Phys-Liquid: A Physics-Informed Dataset for Estimating 3D Geometry and Volume of Transparent Deformable Liquids
Figure 2 for Phys-Liquid: A Physics-Informed Dataset for Estimating 3D Geometry and Volume of Transparent Deformable Liquids
Figure 3 for Phys-Liquid: A Physics-Informed Dataset for Estimating 3D Geometry and Volume of Transparent Deformable Liquids
Figure 4 for Phys-Liquid: A Physics-Informed Dataset for Estimating 3D Geometry and Volume of Transparent Deformable Liquids
Viaarxiv icon

FixTalk: Taming Identity Leakage for High-Quality Talking Head Generation in Extreme Cases

Add code
Jul 02, 2025
Viaarxiv icon

MambaControl: Anatomy Graph-Enhanced Mamba ControlNet with Fourier Refinement for Diffusion-Based Disease Trajectory Prediction

Add code
May 15, 2025
Viaarxiv icon

DreamRelation: Relation-Centric Video Customization

Add code
Mar 10, 2025
Viaarxiv icon

MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation

Add code
Dec 08, 2024
Figure 1 for MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation
Figure 2 for MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation
Figure 3 for MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation
Figure 4 for MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation
Viaarxiv icon

Mimir: Improving Video Diffusion Models for Precise Text Understanding

Add code
Dec 04, 2024
Figure 1 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 2 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 3 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 4 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Viaarxiv icon

Animate-X: Universal Character Image Animation with Enhanced Motion Representation

Add code
Oct 14, 2024
Figure 1 for Animate-X: Universal Character Image Animation with Enhanced Motion Representation
Figure 2 for Animate-X: Universal Character Image Animation with Enhanced Motion Representation
Figure 3 for Animate-X: Universal Character Image Animation with Enhanced Motion Representation
Figure 4 for Animate-X: Universal Character Image Animation with Enhanced Motion Representation
Viaarxiv icon