Picture for Xingyao Wang

Xingyao Wang

The OpenHands Software Agent SDK: A Composable and Extensible Foundation for Production Agents

Add code
Nov 05, 2025
Viaarxiv icon

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Add code
Oct 29, 2025
Viaarxiv icon

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

Add code
Jul 08, 2025
Viaarxiv icon

LocAgent: Graph-Guided LLM Agents for Code Localization

Add code
Mar 12, 2025
Figure 1 for LocAgent: Graph-Guided LLM Agents for Code Localization
Figure 2 for LocAgent: Graph-Guided LLM Agents for Code Localization
Figure 3 for LocAgent: Graph-Guided LLM Agents for Code Localization
Figure 4 for LocAgent: Graph-Guided LLM Agents for Code Localization
Viaarxiv icon

The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks

Add code
Feb 12, 2025
Viaarxiv icon

SyncMind: Measuring Agent Out-of-Sync Recovery in Collaborative Software Engineering

Add code
Feb 10, 2025
Figure 1 for SyncMind: Measuring Agent Out-of-Sync Recovery in Collaborative Software Engineering
Figure 2 for SyncMind: Measuring Agent Out-of-Sync Recovery in Collaborative Software Engineering
Figure 3 for SyncMind: Measuring Agent Out-of-Sync Recovery in Collaborative Software Engineering
Figure 4 for SyncMind: Measuring Agent Out-of-Sync Recovery in Collaborative Software Engineering
Viaarxiv icon

Training Software Engineering Agents and Verifiers with SWE-Gym

Add code
Dec 30, 2024
Figure 1 for Training Software Engineering Agents and Verifiers with SWE-Gym
Figure 2 for Training Software Engineering Agents and Verifiers with SWE-Gym
Figure 3 for Training Software Engineering Agents and Verifiers with SWE-Gym
Figure 4 for Training Software Engineering Agents and Verifiers with SWE-Gym
Viaarxiv icon

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

Add code
Jul 23, 2024
Figure 1 for OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
Figure 2 for OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
Figure 3 for OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
Figure 4 for OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
Viaarxiv icon

A Single Transformer for Scalable Vision-Language Modeling

Add code
Jul 08, 2024
Viaarxiv icon

SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales

Add code
May 31, 2024
Figure 1 for SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales
Figure 2 for SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales
Figure 3 for SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales
Figure 4 for SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales
Viaarxiv icon