Picture for Rongxin Gao

Rongxin Gao

Thinking in Dynamics: How Multimodal Large Language Models Perceive, Track, and Reason Dynamics in Physical 4D World

Add code
Mar 13, 2026
Viaarxiv icon