Picture for Zihao Dongfang

Zihao Dongfang

AndroTMem: From Interaction Trajectories to Anchored Memory in Long-Horizon GUI Agents

Add code
Mar 19, 2026
Viaarxiv icon

Temporal Gains, Spatial Costs: Revisiting Video Fine-Tuning in Multimodal Large Language Models

Add code
Mar 18, 2026
Viaarxiv icon

Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks

Add code
Oct 29, 2025
Viaarxiv icon

PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era

Add code
Sep 16, 2025
Viaarxiv icon

Are Multimodal Large Language Models Ready for Omnidirectional Spatial Reasoning?

Add code
May 17, 2025
Figure 1 for Are Multimodal Large Language Models Ready for Omnidirectional Spatial Reasoning?
Figure 2 for Are Multimodal Large Language Models Ready for Omnidirectional Spatial Reasoning?
Figure 3 for Are Multimodal Large Language Models Ready for Omnidirectional Spatial Reasoning?
Figure 4 for Are Multimodal Large Language Models Ready for Omnidirectional Spatial Reasoning?
Viaarxiv icon