Picture for Shalini Chaudhuri

Shalini Chaudhuri

Perceptio: Perception Enhanced Vision Language Models via Spatial Token Generation

Add code
Mar 19, 2026
Viaarxiv icon