Picture for Qi Tian

Qi Tian

Refer to the report for detailed contributions

Few-step Flow for 3D Generation via Marginal-Data Transport Distillation

Add code
Sep 04, 2025
Viaarxiv icon

Snap-Snap: Taking Two Images to Reconstruct 3D Human Gaussians in Milliseconds

Add code
Aug 20, 2025
Viaarxiv icon

METEOR: Multi-Encoder Collaborative Token Pruning for Efficient Vision Language Models

Add code
Jul 28, 2025
Viaarxiv icon

MagCache: Fast Video Generation with Magnitude-Aware Cache

Add code
Jun 10, 2025
Viaarxiv icon

Mixpert: Mitigating Multimodal Learning Conflicts with Efficient Mixture-of-Vision-Experts

Add code
May 30, 2025
Viaarxiv icon

Tackling View-Dependent Semantics in 3D Language Gaussian Splatting

Add code
May 30, 2025
Viaarxiv icon

Multimodal LLM-Guided Semantic Correction in Text-to-Image Diffusion

Add code
May 26, 2025
Viaarxiv icon

Efficient Multi-modal Long Context Learning for Training-free Adaptation

Add code
May 26, 2025
Viaarxiv icon

Dereflection Any Image with Diffusion Priors and Diversified Data

Add code
Mar 21, 2025
Viaarxiv icon

RASA: Replace Anyone, Say Anything -- A Training-Free Framework for Audio-Driven and Universal Portrait Video Editing

Add code
Mar 14, 2025
Viaarxiv icon