Picture for Sambit Ghosh

Sambit Ghosh

Do Vision Language Models Need to Process Image Tokens?

Add code
Apr 10, 2026
Viaarxiv icon