Alert button

Projected Off-Policy Q-Learning (POP-QL) for Stabilizing Offline Reinforcement Learning

Nov 25, 2023
Melrose Roderick, Gaurav Manek, Felix Berkenkamp, J. Zico Kolter

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: