Alert button

A short variational proof of equivalence between policy gradients and soft Q learning

Dec 22, 2017
Pierre H. Richemond, Brendan Maginnis

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: