Picture for Zhewei Kang

Zhewei Kang

Learning to Reason without External Rewards

Add code
May 26, 2025
Viaarxiv icon

Scalable Best-of-N Selection for Large Language Models via Self-Certainty

Add code
Feb 25, 2025
Viaarxiv icon