Picture for Haoyun Deng

Haoyun Deng

CM2: Reinforcement Learning with Checklist Rewards for Multi-Turn and Multi-Step Agentic Tool Use

Add code
Feb 12, 2026
Viaarxiv icon

Complex Logical Instruction Generation

Add code
Aug 12, 2025
Viaarxiv icon