Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Return-based Scaling: Yet Another Normalisation Trick for Deep RL



Tom Schaul , Georg Ostrovski , Iurii Kemaev , Diana Borsa


   Access Paper or Ask Questions

Podracer architectures for scalable Reinforcement Learning



Matteo Hessel , Manuel Kroiss , Aidan Clark , Iurii Kemaev , John Quan , Thomas Keck , Fabio Viola , Hado van Hasselt


   Access Paper or Ask Questions

Discovery of Options via Meta-Learned Subgoals



Vivek Veeriah , Tom Zahavy , Matteo Hessel , Zhongwen Xu , Junhyuk Oh , Iurii Kemaev , Hado van Hasselt , David Silver , Satinder Singh


   Access Paper or Ask Questions

Discovering a set of policies for the worst case reward



Tom Zahavy , Andre Barreto , Daniel J Mankowitz , Shaobo Hou , Brendan O'Donoghue , Iurii Kemaev , Satinder Baveja Singh


   Access Paper or Ask Questions

ReSet: Learning Recurrent Dynamic Routing in ResNet-like Neural Networks



Iurii Kemaev , Daniil Polykovskiy , Dmitry Vetrov

* Proceedings of The 10th Asian Conference on Machine Learning, PMLR 95:422-437, 2018 
* Published in Proceedings of The 10th Asian Conference on Machine Learning, http://proceedings.mlr.press/v95/kemaev18a.html 

   Access Paper or Ask Questions