Picture for Chengdi Ma

Chengdi Ma

MUA-RL: Multi-turn User-interacting Agent Reinforcement Learning for agentic tool use

Add code
Aug 26, 2025
Viaarxiv icon