Picture for Weikang Zhao

Weikang Zhao

MUA-RL: Multi-turn User-interacting Agent Reinforcement Learning for agentic tool use

Add code
Aug 26, 2025
Viaarxiv icon