Picture for Hongyu Xian

Hongyu Xian

Policy of Thoughts: Scaling LLM Reasoning via Test-time Policy Evolution

Add code
Jan 28, 2026
Viaarxiv icon