Picture for Xinyuan Chen

Xinyuan Chen

GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects

Add code
Jun 18, 2025
Viaarxiv icon

Faster than Fast: Accelerating Oriented FAST Feature Detection on Low-end Embedded GPUs

Add code
Jun 08, 2025
Viaarxiv icon

Training-free Stylized Text-to-Image Generation with Fast Inference

Add code
May 25, 2025
Viaarxiv icon

The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation

Add code
Apr 16, 2025
Viaarxiv icon

AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset

Add code
Mar 25, 2025
Viaarxiv icon

GMG: A Video Prediction Method Based on Global Focus and Motion Guided

Add code
Mar 14, 2025
Viaarxiv icon

MouseGPT: A Large-scale Vision-Language Model for Mouse Behavior Analysis

Add code
Mar 13, 2025
Viaarxiv icon

TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision

Add code
Mar 10, 2025
Viaarxiv icon

An Egocentric Vision-Language Model based Portable Real-time Smart Assistant

Add code
Mar 06, 2025
Viaarxiv icon

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Add code
Jan 14, 2025
Figure 1 for Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
Figure 2 for Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
Figure 3 for Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
Figure 4 for Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
Viaarxiv icon