Picture for Arjun Prasaath Anbazhagan

Arjun Prasaath Anbazhagan

HiPO: Hierarchical Preference Optimization for Adaptive Reasoning in LLMs

Add code
Apr 22, 2026
Viaarxiv icon