Picture for Zhenghui Jin

Zhenghui Jin

ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

Add code
Mar 19, 2026
Viaarxiv icon

Empowering Multi-Turn Tool-Integrated Reasoning with Group Turn Policy Optimization

Add code
Nov 18, 2025
Figure 1 for Empowering Multi-Turn Tool-Integrated Reasoning with Group Turn Policy Optimization
Figure 2 for Empowering Multi-Turn Tool-Integrated Reasoning with Group Turn Policy Optimization
Figure 3 for Empowering Multi-Turn Tool-Integrated Reasoning with Group Turn Policy Optimization
Figure 4 for Empowering Multi-Turn Tool-Integrated Reasoning with Group Turn Policy Optimization
Viaarxiv icon