Picture for Zhiyi Zhu

Zhiyi Zhu

Enhancing Video Memorability Prediction with Text-Motion Cross-modal Contrastive Loss and Its Application in Video Summarization

Add code
Jun 10, 2025
Viaarxiv icon

MLVTG: Mamba-Based Feature Alignment and LLM-Driven Purification for Multi-Modal Video Temporal Grounding

Add code
Jun 10, 2025
Viaarxiv icon