Picture for Francisco Galuppo

Francisco Galuppo

When in Doubt, Plan It Out: Committed Small Language Model Deliberation for Reactive Reinforcement Learning

Add code
Jun 15, 2026
Viaarxiv icon