Alert button

From $r$ to $Q^*$: Your Language Model is Secretly a Q-Function

Apr 18, 2024
Rafael Rafailov, Joey Hejna, Ryan Park, Chelsea Finn

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: