Picture for Wenshuo Zhao

Wenshuo Zhao

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Add code
Oct 29, 2025
Viaarxiv icon

Model-Task Alignment Drives Distinct RL Outcomes

Add code
Aug 28, 2025
Viaarxiv icon

Steering LLM Thinking with Budget Guidance

Add code
Jun 16, 2025
Viaarxiv icon