Picture for Juneyoung Ro

Juneyoung Ro

How Well Do Vision--Language Models Understand Cities? A Comparative Study on Spatial Reasoning from Street-View Images

Add code
Aug 29, 2025
Viaarxiv icon