Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

From Motor Control to Team Play in Simulated Humanoid Football



Siqi Liu , Guy Lever , Zhe Wang , Josh Merel , S. M. Ali Eslami , Daniel Hennes , Wojciech M. Czarnecki , Yuval Tassa , Shayegan Omidshafiei , Abbas Abdolmaleki , Noah Y. Siegel , Leonard Hasenclever , Luke Marris , Saran Tunyasuvunakool , H. Francis Song , Markus Wulfmeier , Paul Muller , Tuomas Haarnoja , Brendan D. Tracey , Karl Tuyls , Thore Graepel , Nicolas Heess


   Access Paper or Ask Questions

Biases for Emergent Communication in Multi-agent Reinforcement Learning



Tom Eccles , Yoram Bachrach , Guy Lever , Angeliki Lazaridou , Thore Graepel

* Accepted at NeurIPS 2019 

   Access Paper or Ask Questions

A Generalized Training Approach for Multiagent Learning



Paul Muller , Shayegan Omidshafiei , Mark Rowland , Karl Tuyls , Julien Perolat , Siqi Liu , Daniel Hennes , Luke Marris , Marc Lanctot , Edward Hughes , Zhe Wang , Guy Lever , Nicolas Heess , Thore Graepel , Remi Munos


   Access Paper or Ask Questions

Emergent Coordination Through Competition



Siqi Liu , Guy Lever , Josh Merel , Saran Tunyasuvunakool , Nicolas Heess , Thore Graepel


   Access Paper or Ask Questions

Human-level performance in first-person multiplayer games with population-based deep reinforcement learning



Max Jaderberg , Wojciech M. Czarnecki , Iain Dunning , Luke Marris , Guy Lever , Antonio Garcia Castaneda , Charles Beattie , Neil C. Rabinowitz , Ari S. Morcos , Avraham Ruderman , Nicolas Sonnerat , Tim Green , Louise Deason , Joel Z. Leibo , David Silver , Demis Hassabis , Koray Kavukcuoglu , Thore Graepel


   Access Paper or Ask Questions

Value-Decomposition Networks For Cooperative Multi-Agent Learning



Peter Sunehag , Guy Lever , Audrunas Gruslys , Wojciech Marian Czarnecki , Vinicius Zambaldi , Max Jaderberg , Marc Lanctot , Nicolas Sonnerat , Joel Z. Leibo , Karl Tuyls , Thore Graepel


   Access Paper or Ask Questions

Nesterov's Accelerated Gradient and Momentum as approximations to Regularised Update Descent



Aleksandar Botev , Guy Lever , David Barber


   Access Paper or Ask Questions

A Gauss-Newton Method for Markov Decision Processes



Thomas Furmston , Guy Lever


   Access Paper or Ask Questions

Modeling transition dynamics in MDPs with RKHS embeddings of conditional distributions



Steffen Grünewälder , Luca Baldassarre , Massimiliano Pontil , Arthur Gretton , Guy Lever

* The article can now be found under arXiv:1206.4655. We combined both versions and are withdrawing this version because of the resulting redundancy 

   Access Paper or Ask Questions

Conditional mean embeddings as regressors - supplementary



Steffen Grünewälder , Guy Lever , Luca Baldassarre , Sam Patterson , Arthur Gretton , Massimilano Pontil


   Access Paper or Ask Questions

1
2
>>