Picture for Huan Sun

Huan Sun

EvoSchema: Towards Text-to-SQL Robustness Against Schema Evolution

Add code
Mar 11, 2026
Viaarxiv icon

REMem: Reasoning with Episodic Memory in Language Agent

Add code
Feb 13, 2026
Viaarxiv icon

Autonomous Continual Learning of Computer-Use Agents for Environment Adaptation

Add code
Feb 10, 2026
Viaarxiv icon

When Actions Go Off-Task: Detecting and Correcting Misaligned Actions in Computer-Use Agents

Add code
Feb 09, 2026
Viaarxiv icon

When Benign Inputs Lead to Severe Harms: Eliciting Unsafe Unintended Behaviors of Computer-Use Agents

Add code
Feb 09, 2026
Viaarxiv icon

LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning

Add code
Feb 06, 2026
Viaarxiv icon

Bridging Online and Offline RL: Contextual Bandit Learning for Multi-Turn Code Generation

Add code
Feb 03, 2026
Viaarxiv icon

Large Language Models Achieve Gold Medal Performance at International Astronomy & Astrophysics Olympiad

Add code
Oct 06, 2025
Viaarxiv icon

Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

Add code
Jun 26, 2025
Figure 1 for Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
Figure 2 for Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
Figure 3 for Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
Figure 4 for Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
Viaarxiv icon

AutoSDT: Scaling Data-Driven Discovery Tasks Toward Open Co-Scientists

Add code
Jun 09, 2025
Figure 1 for AutoSDT: Scaling Data-Driven Discovery Tasks Toward Open Co-Scientists
Figure 2 for AutoSDT: Scaling Data-Driven Discovery Tasks Toward Open Co-Scientists
Figure 3 for AutoSDT: Scaling Data-Driven Discovery Tasks Toward Open Co-Scientists
Figure 4 for AutoSDT: Scaling Data-Driven Discovery Tasks Toward Open Co-Scientists
Viaarxiv icon