Picture for Shalini Chaudhuri

Shalini Chaudhuri

Learning to Rank Caption Chains for Video-Text Alignment

Add code
Mar 26, 2026
Viaarxiv icon

Perceptio: Perception Enhanced Vision Language Models via Spatial Token Generation

Add code
Mar 19, 2026
Viaarxiv icon