Picture for Longteng Guo

Longteng Guo

Institute of Automation, Chinese Academy of Sciences, School of Artificial Intelligence, University of Chinese Academy of Sciences

NavWM: A Unified Navigation World Model for Foresight-Driven Planning

Add code
Jun 23, 2026
Viaarxiv icon

SurveilNav: Collaborative Object Goal Navigation with Robot and Surveillance System

Add code
Jun 23, 2026
Viaarxiv icon

VeriSpace: Spatially Grounded Action Verification for Vision-Language-Action Models

Add code
Jun 09, 2026
Viaarxiv icon

LongSpace: Exploring Long-Horizon Spatial Memory from Perception to Recall in Video

Add code
Jun 04, 2026
Viaarxiv icon

Can MLLMs Reason Beyond Language? VisReason: A Comprehensive Benchmark for Vision-Centric Reasoning

Add code
May 25, 2026
Viaarxiv icon

Semantic-Enriched Latent Visual Reasoning

Add code
May 19, 2026
Viaarxiv icon

When Robots Do the Chores: A Benchmark and Agent for Long-Horizon Household Task Execution

Add code
May 14, 2026
Viaarxiv icon

SciVQR: A Multidisciplinary Multimodal Benchmark for Advanced Scientific Reasoning Evaluation

Add code
May 11, 2026
Viaarxiv icon

M$^3$-VQA: A Benchmark for Multimodal, Multi-Entity, Multi-Hop Visual Question Answering

Add code
Apr 28, 2026
Viaarxiv icon

AdaSpark: Adaptive Sparsity for Efficient Long-Video Understanding

Add code
Apr 09, 2026
Viaarxiv icon