Picture for Yu Su

Yu Su

Automatic Image-Level Morphological Trait Annotation for Organismal Images

Add code
Apr 02, 2026
Viaarxiv icon

CUBE: A Standard for Unifying Agent Benchmarks

Add code
Mar 16, 2026
Viaarxiv icon

REMem: Reasoning with Episodic Memory in Language Agent

Add code
Feb 13, 2026
Viaarxiv icon

Autonomous Continual Learning of Computer-Use Agents for Environment Adaptation

Add code
Feb 10, 2026
Viaarxiv icon

When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents

Add code
Feb 09, 2026
Viaarxiv icon

Intelligent Multimodal Multi-Sensor Fusion-Based UAV Identification, Localization, and Countermeasures for Safeguarding Low-Altitude Economy

Add code
Oct 27, 2025
Viaarxiv icon

Watch and Learn: Learning to Use Computers from Online Videos

Add code
Oct 06, 2025
Viaarxiv icon

Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

Add code
Jun 26, 2025
Figure 1 for Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
Figure 2 for Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
Figure 3 for Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
Figure 4 for Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
Viaarxiv icon

OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation

Add code
Jun 05, 2025
Figure 1 for OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation
Figure 2 for OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation
Figure 3 for OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation
Figure 4 for OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation
Viaarxiv icon

BioCLIP 2: Emergent Properties from Scaling Hierarchical Contrastive Learning

Add code
May 29, 2025
Figure 1 for BioCLIP 2: Emergent Properties from Scaling Hierarchical Contrastive Learning
Figure 2 for BioCLIP 2: Emergent Properties from Scaling Hierarchical Contrastive Learning
Figure 3 for BioCLIP 2: Emergent Properties from Scaling Hierarchical Contrastive Learning
Figure 4 for BioCLIP 2: Emergent Properties from Scaling Hierarchical Contrastive Learning
Viaarxiv icon