Picture for Zhou Tao

Zhou Tao

Dynamic Token Compression for Efficient Video Understanding through Reinforcement Learning

Add code
Mar 27, 2026
Viaarxiv icon

DiG: Differential Grounding for Enhancing Fine-Grained Perception in Multimodal Large Language Model

Add code
Dec 14, 2025
Viaarxiv icon