Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Pushing the limits of self-supervised ResNets: Can we outperform supervised learning without labels on ImageNet?



Nenad Tomasev , Ioana Bica , Brian McWilliams , Lars Buesing , Razvan Pascanu , Charles Blundell , Jovana Mitrovic


   Access Paper or Ask Questions

Counterfactual Credit Assignment in Model-Free Reinforcement Learning



Thomas Mesnard , Théophane Weber , Fabio Viola , Shantanu Thakoor , Alaa Saade , Anna Harutyunyan , Will Dabney , Tom Stepleton , Nicolas Heess , Arthur Guez , Marcus Hutter , Lars Buesing , Rémi Munos


   Access Paper or Ask Questions

On the role of planning in model-based deep reinforcement learning



Jessica B. Hamrick , Abram L. Friesen , Feryal Behbahani , Arthur Guez , Fabio Viola , Sims Witherspoon , Thomas Anthony , Lars Buesing , Petar Veličković , Théophane Weber


   Access Paper or Ask Questions

Representation Learning via Invariant Causal Mechanisms



Jovana Mitrovic , Brian McWilliams , Jacob Walker , Lars Buesing , Charles Blundell


   Access Paper or Ask Questions

Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban



Peter Karkus , Mehdi Mirza , Arthur Guez , Andrew Jaegle , Timothy Lillicrap , Lars Buesing , Nicolas Heess , Theophane Weber


   Access Paper or Ask Questions

Physically Embedded Planning Problems: New Challenges for Reinforcement Learning



Mehdi Mirza , Andrew Jaegle , Jonathan J. Hunt , Arthur Guez , Saran Tunyasuvunakool , Alistair Muldal , Théophane Weber , Peter Karkus , Sébastien Racanière , Lars Buesing , Timothy Lillicrap , Nicolas Heess


   Access Paper or Ask Questions

Pointer Graph Networks



Petar Veličković , Lars Buesing , Matthew C. Overlan , Razvan Pascanu , Oriol Vinyals , Charles Blundell


   Access Paper or Ask Questions

Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning



Giambattista Parascandolo , Lars Buesing , Josh Merel , Leonard Hasenclever , John Aslanides , Jessica B. Hamrick , Nicolas Heess , Alexander Neitz , Theophane Weber


   Access Paper or Ask Questions

Value-driven Hindsight Modelling



Arthur Guez , Fabio Viola , Théophane Weber , Lars Buesing , Steven Kapturowski , Doina Precup , David Silver , Nicolas Heess

* 8 pages + reference + appendix 

   Access Paper or Ask Questions

Causally Correct Partial Models for Reinforcement Learning



Danilo J. Rezende , Ivo Danihelka , George Papamakarios , Nan Rosemary Ke , Ray Jiang , Theophane Weber , Karol Gregor , Hamza Merzic , Fabio Viola , Jane Wang , Jovana Mitrovic , Frederic Besse , Ioannis Antonoglou , Lars Buesing


   Access Paper or Ask Questions

1
2
3
>>