Picture for kaiyuan liu

kaiyuan liu

Backtracking When It Strays: Mitigating Dual Exposure Biases in LLM Reasoning Distillation

Add code
May 19, 2026
Viaarxiv icon