Picture for Quanling Liu

Quanling Liu

Same Evidence, Different Answers: Canonical-Context On-Policy Distillation for Multi-Turn Language Models

Add code
May 28, 2026
Viaarxiv icon