Picture for Kendrick Phan

Kendrick Phan

SUPERNOVA: Eliciting General Reasoning in LLMs with Reinforcement Learning on Natural Instructions

Add code
Apr 09, 2026
Viaarxiv icon