Picture for Zhiyi Zhu

Zhiyi Zhu

MLVTG: Mamba-Based Feature Alignment and LLM-Driven Purification for Multi-Modal Video Temporal Grounding

Add code
Jun 10, 2025
Viaarxiv icon

Enhancing Video Memorability Prediction with Text-Motion Cross-modal Contrastive Loss and Its Application in Video Summarization

Add code
Jun 10, 2025
Viaarxiv icon