Picture for Li Yuan

Li Yuan

Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation

Add code
Jul 15, 2024
Viaarxiv icon

LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference

Add code
Jun 26, 2024
Viaarxiv icon

ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation

Add code
Jun 26, 2024
Viaarxiv icon

Point Tree Transformer for Point Cloud Registration

Add code
Jun 25, 2024
Figure 1 for Point Tree Transformer for Point Cloud Registration
Figure 2 for Point Tree Transformer for Point Cloud Registration
Figure 3 for Point Tree Transformer for Point Cloud Registration
Figure 4 for Point Tree Transformer for Point Cloud Registration
Viaarxiv icon

DF40: Toward Next-Generation Deepfake Detection

Add code
Jun 19, 2024
Viaarxiv icon

ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Add code
Jun 06, 2024
Figure 1 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Figure 2 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Figure 3 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Figure 4 for ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
Viaarxiv icon

RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter

Add code
May 29, 2024
Viaarxiv icon

EvaGaussians: Event Stream Assisted Gaussian Splatting from Blurry Images

Add code
May 29, 2024
Viaarxiv icon

GraCo: Granularity-Controllable Interactive Segmentation

Add code
May 01, 2024
Viaarxiv icon

UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark

Add code
Apr 15, 2024
Viaarxiv icon