Picture for Rongrong Ji

Rongrong Ji

Xiamen University, Peng Cheng Laboratory

Motion-Aware Caching for Efficient Autoregressive Video Generation

Add code
May 03, 2026
Viaarxiv icon

Prototype-Based Test-Time Adaptation of Vision-Language Models

Add code
Apr 23, 2026
Viaarxiv icon

ID-Selection: Importance-Diversity Based Visual Token Selection for Efficient LVLM Inference

Add code
Apr 07, 2026
Viaarxiv icon

Scaling the Long Video Understanding of Multimodal Large Language Models via Visual Memory Mechanism

Add code
Mar 31, 2026
Viaarxiv icon

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Add code
Mar 24, 2026
Viaarxiv icon

ForestPrune: High-ratio Visual Token Compression for Video Multimodal Large Language Models via Spatial-Temporal Forest Modeling

Add code
Mar 24, 2026
Viaarxiv icon

Persistent Story World Simulation with Continuous Character Customization

Add code
Mar 17, 2026
Viaarxiv icon

SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models

Add code
Mar 17, 2026
Viaarxiv icon

Efficiently Aligning Draft Models via Parameter- and Data-Efficient Adaptation

Add code
Mar 10, 2026
Viaarxiv icon

Event-Anchored Frame Selection for Effective Long-Video Understanding

Add code
Mar 01, 2026
Viaarxiv icon