Picture for Xinghao Zhao

Xinghao Zhao

Entropy trajectory shape predicts LLM reasoning reliability: A diagnostic study of uncertainty dynamics in chain-of-thought

Add code
Mar 19, 2026
Viaarxiv icon

IIB-LPO: Latent Policy Optimization via Iterative Information Bottleneck

Add code
Jan 09, 2026
Viaarxiv icon