Picture for Xuhui Zhou

Xuhui Zhou

Reinforcing Human Behavior Simulation via Verbal Feedback

Add code
May 19, 2026
Viaarxiv icon

GoodPoint: Learning Constructive Scientific Paper Feedback from Author Responses

Add code
Apr 13, 2026
Viaarxiv icon

CodeScout: An Effective Recipe for Reinforcement Learning of Code Search Agents

Add code
Mar 18, 2026
Viaarxiv icon

Mind the Sim2Real Gap in User Simulation for Agentic Tasks

Add code
Mar 11, 2026
Viaarxiv icon

The OpenHands Software Agent SDK: A Composable and Extensible Foundation for Production Agents

Add code
Nov 05, 2025
Figure 1 for The OpenHands Software Agent SDK: A Composable and Extensible Foundation for Production Agents
Figure 2 for The OpenHands Software Agent SDK: A Composable and Extensible Foundation for Production Agents
Figure 3 for The OpenHands Software Agent SDK: A Composable and Extensible Foundation for Production Agents
Figure 4 for The OpenHands Software Agent SDK: A Composable and Extensible Foundation for Production Agents
Viaarxiv icon

1-2-3 Check: Enhancing Contextual Privacy in LLM via Multi-Agent Reasoning

Add code
Aug 11, 2025
Viaarxiv icon

OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety

Add code
Jul 08, 2025
Figure 1 for OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety
Figure 2 for OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety
Figure 3 for OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety
Figure 4 for OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety
Viaarxiv icon

Words Like Knives: Backstory-Personalized Modeling and Detection of Violent Communication

Add code
May 27, 2025
Viaarxiv icon

SOTOPIA-S4: a user-friendly system for flexible, customizable, and large-scale social simulation

Add code
Apr 19, 2025
Viaarxiv icon

Rethinking Theory of Mind Benchmarks for LLMs: Towards A User-Centered Perspective

Add code
Apr 15, 2025
Viaarxiv icon