Picture for Xiaoxin Lin

Xiaoxin Lin

RADAR: Benchmarking Vision-Language-Action Generalization via Real-World Dynamics, Spatial-Physical Intelligence, and Autonomous Evaluation

Add code
Feb 11, 2026
Viaarxiv icon

SIRI-Bench: Challenging VLMs' Spatial Intelligence through Complex Reasoning Tasks

Add code
Jun 17, 2025
Figure 1 for SIRI-Bench: Challenging VLMs' Spatial Intelligence through Complex Reasoning Tasks
Figure 2 for SIRI-Bench: Challenging VLMs' Spatial Intelligence through Complex Reasoning Tasks
Figure 3 for SIRI-Bench: Challenging VLMs' Spatial Intelligence through Complex Reasoning Tasks
Figure 4 for SIRI-Bench: Challenging VLMs' Spatial Intelligence through Complex Reasoning Tasks
Viaarxiv icon