Alert button

Efficient Sample Reuse in Policy Gradients with Parameter-based Exploration

Jan 17, 2013
Tingting Zhao, Hirotaka Hachiya, Voot Tangkaratt, Jun Morimoto, Masashi Sugiyama

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: