Picture for Zhedong Cen

Zhedong Cen

Can Multimodal Large Language Models Understand Spatial Relations?

Add code
May 25, 2025
Viaarxiv icon