Alert button

V-STaR: Training Verifiers for Self-Taught Reasoners

Feb 09, 2024
Arian Hosseini, Xingdi Yuan, Nikolay Malkin, Aaron Courville, Alessandro Sordoni, Rishabh Agarwal

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: