Picture for Suneet Katrekar

Suneet Katrekar

Synthesize and Reward -- Reinforcement Learning for Multi-Step Tool Use in Live Environments

Add code
Jun 03, 2026
Viaarxiv icon