Picture for Dongzhou Cheng

Dongzhou Cheng

One Refiner to Unlock Them All: Inference-Time Reasoning Elicitation via Reinforcement Query Refinement

Add code
Apr 28, 2026
Viaarxiv icon

Look Inward to Explore Outward: Learning Temperature Policy from LLM Internal States via Hierarchical RL

Add code
Feb 13, 2026
Viaarxiv icon