Picture for Yu Su

Yu Su

Why Far Looks Up: Probing Spatial Representation in Vision-Language Models

Add code
May 28, 2026
Viaarxiv icon

QUEST: Training Frontier Deep Research Agents with Fully Synthetic Tasks

Add code
May 22, 2026
Viaarxiv icon

Leveraging Latent Visual Reasoning in Silence

Add code
May 18, 2026
Viaarxiv icon

Automatic Image-Level Morphological Trait Annotation for Organismal Images

Add code
Apr 02, 2026
Viaarxiv icon

CUBE: A Standard for Unifying Agent Benchmarks

Add code
Mar 16, 2026
Viaarxiv icon

REMem: Reasoning with Episodic Memory in Language Agent

Add code
Feb 13, 2026
Viaarxiv icon

Autonomous Continual Learning of Computer-Use Agents for Environment Adaptation

Add code
Feb 10, 2026
Viaarxiv icon

When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents

Add code
Feb 09, 2026
Viaarxiv icon

Intelligent Multimodal Multi-Sensor Fusion-Based UAV Identification, Localization, and Countermeasures for Safeguarding Low-Altitude Economy

Add code
Oct 27, 2025
Viaarxiv icon

Watch and Learn: Learning to Use Computers from Online Videos

Add code
Oct 06, 2025
Viaarxiv icon