Picture for Juntong Feng

Juntong Feng

VISTA-Bench: Do Vision-Language Models Really Understand Visualized Text as Well as Pure Text?

Add code
Feb 04, 2026
Viaarxiv icon