Picture for Xinyuan Chen

Xinyuan Chen

LIA-X: Interpretable Latent Portrait Animator

Add code
Aug 13, 2025
Viaarxiv icon

Consistent and Controllable Image Animation with Motion Linear Diffusion Transformers

Add code
Aug 10, 2025
Viaarxiv icon

Self-Improvement for Audio Large Language Model using Unlabeled Speech

Add code
Jul 27, 2025
Viaarxiv icon

GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects

Add code
Jun 18, 2025
Viaarxiv icon

Faster than Fast: Accelerating Oriented FAST Feature Detection on Low-end Embedded GPUs

Add code
Jun 08, 2025
Viaarxiv icon

Training-free Stylized Text-to-Image Generation with Fast Inference

Add code
May 25, 2025
Viaarxiv icon

The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation

Add code
Apr 16, 2025
Viaarxiv icon

AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset

Add code
Mar 25, 2025
Viaarxiv icon

GMG: A Video Prediction Method Based on Global Focus and Motion Guided

Add code
Mar 14, 2025
Figure 1 for GMG: A Video Prediction Method Based on Global Focus and Motion Guided
Figure 2 for GMG: A Video Prediction Method Based on Global Focus and Motion Guided
Figure 3 for GMG: A Video Prediction Method Based on Global Focus and Motion Guided
Figure 4 for GMG: A Video Prediction Method Based on Global Focus and Motion Guided
Viaarxiv icon

MouseGPT: A Large-scale Vision-Language Model for Mouse Behavior Analysis

Add code
Mar 13, 2025
Viaarxiv icon