Picture for Yue Ma

Yue Ma

The Hong Kong University of Science and Technology

SkipVAR: Accelerating Visual Autoregressive Modeling via Adaptive Frequency-Aware Skipping

Add code
Jun 11, 2025
Viaarxiv icon

Follow-Your-Motion: Video Motion Transfer via Efficient Spatial-Temporal Decoupled Finetuning

Add code
Jun 05, 2025
Viaarxiv icon

Follow-Your-Creation: Empowering 4D Creation through Video Inpainting

Add code
Jun 05, 2025
Viaarxiv icon

AvatarArtist: Open-Domain 4D Avatarization

Add code
Mar 26, 2025
Viaarxiv icon

EEdit : Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing

Add code
Mar 13, 2025
Viaarxiv icon

RadioLLM: Introducing Large Language Model into Cognitive Radio via Hybrid Prompt and Token Reprogrammings

Add code
Jan 28, 2025
Viaarxiv icon

ListConRanker: A Contrastive Text Reranker with Listwise Encoding

Add code
Jan 13, 2025
Figure 1 for ListConRanker: A Contrastive Text Reranker with Listwise Encoding
Figure 2 for ListConRanker: A Contrastive Text Reranker with Listwise Encoding
Figure 3 for ListConRanker: A Contrastive Text Reranker with Listwise Encoding
Figure 4 for ListConRanker: A Contrastive Text Reranker with Listwise Encoding
Viaarxiv icon

Enhancing Image Generation Fidelity via Progressive Prompts

Add code
Jan 13, 2025
Figure 1 for Enhancing Image Generation Fidelity via Progressive Prompts
Figure 2 for Enhancing Image Generation Fidelity via Progressive Prompts
Figure 3 for Enhancing Image Generation Fidelity via Progressive Prompts
Figure 4 for Enhancing Image Generation Fidelity via Progressive Prompts
Viaarxiv icon

H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving

Add code
Jan 08, 2025
Figure 1 for H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving
Figure 2 for H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving
Figure 3 for H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving
Figure 4 for H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving
Viaarxiv icon

Follow-Your-MultiPose: Tuning-Free Multi-Character Text-to-Video Generation via Pose Guidance

Add code
Dec 21, 2024
Figure 1 for Follow-Your-MultiPose: Tuning-Free Multi-Character Text-to-Video Generation via Pose Guidance
Figure 2 for Follow-Your-MultiPose: Tuning-Free Multi-Character Text-to-Video Generation via Pose Guidance
Figure 3 for Follow-Your-MultiPose: Tuning-Free Multi-Character Text-to-Video Generation via Pose Guidance
Figure 4 for Follow-Your-MultiPose: Tuning-Free Multi-Character Text-to-Video Generation via Pose Guidance
Viaarxiv icon