Picture for Erfan Bagheri Soula

Erfan Bagheri Soula

Enhancing Temporal Understanding in Video-LLMs through Stacked Temporal Attention in Vision Encoders

Add code
Oct 29, 2025
Viaarxiv icon