Picture for Kaixing Zhang

Kaixing Zhang

Interpreting and Controlling LLM Reasoning through Integrated Policy Gradient

Add code
Feb 03, 2026
Viaarxiv icon