Picture for Shaobo Ju

Shaobo Ju

Towards Effective and Efficient Long Video Understanding of Multimodal Large Language Models via One-shot Clip Retrieval

Add code
Dec 09, 2025
Viaarxiv icon