Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

From Motor Control to Team Play in Simulated Humanoid Football

Siqi Liu , Guy Lever , Zhe Wang , Josh Merel , S. M. Ali Eslami , Daniel Hennes , Wojciech M. Czarnecki , Yuval Tassa , Shayegan Omidshafiei , Abbas Abdolmaleki , Noah Y. Siegel , Leonard Hasenclever , Luke Marris , Saran Tunyasuvunakool , H. Francis Song , Markus Wulfmeier , Paul Muller , Tuomas Haarnoja , Brendan D. Tracey , Karl Tuyls , Thore Graepel , Nicolas Heess

   Access Paper or Ask Questions

Biases for Emergent Communication in Multi-agent Reinforcement Learning

Tom Eccles , Yoram Bachrach , Guy Lever , Angeliki Lazaridou , Thore Graepel

* Accepted at NeurIPS 2019 

   Access Paper or Ask Questions

A Generalized Training Approach for Multiagent Learning

Paul Muller , Shayegan Omidshafiei , Mark Rowland , Karl Tuyls , Julien Perolat , Siqi Liu , Daniel Hennes , Luke Marris , Marc Lanctot , Edward Hughes , Zhe Wang , Guy Lever , Nicolas Heess , Thore Graepel , Remi Munos

   Access Paper or Ask Questions

Emergent Coordination Through Competition

Siqi Liu , Guy Lever , Josh Merel , Saran Tunyasuvunakool , Nicolas Heess , Thore Graepel

   Access Paper or Ask Questions

Human-level performance in first-person multiplayer games with population-based deep reinforcement learning

Max Jaderberg , Wojciech M. Czarnecki , Iain Dunning , Luke Marris , Guy Lever , Antonio Garcia Castaneda , Charles Beattie , Neil C. Rabinowitz , Ari S. Morcos , Avraham Ruderman , Nicolas Sonnerat , Tim Green , Louise Deason , Joel Z. Leibo , David Silver , Demis Hassabis , Koray Kavukcuoglu , Thore Graepel

   Access Paper or Ask Questions

Value-Decomposition Networks For Cooperative Multi-Agent Learning

Peter Sunehag , Guy Lever , Audrunas Gruslys , Wojciech Marian Czarnecki , Vinicius Zambaldi , Max Jaderberg , Marc Lanctot , Nicolas Sonnerat , Joel Z. Leibo , Karl Tuyls , Thore Graepel

   Access Paper or Ask Questions

Nesterov's Accelerated Gradient and Momentum as approximations to Regularised Update Descent

Aleksandar Botev , Guy Lever , David Barber

   Access Paper or Ask Questions

A Gauss-Newton Method for Markov Decision Processes

Thomas Furmston , Guy Lever

   Access Paper or Ask Questions

Modeling transition dynamics in MDPs with RKHS embeddings of conditional distributions

Steffen Grünewälder , Luca Baldassarre , Massimiliano Pontil , Arthur Gretton , Guy Lever

* The article can now be found under arXiv:1206.4655. We combined both versions and are withdrawing this version because of the resulting redundancy 

   Access Paper or Ask Questions

Conditional mean embeddings as regressors - supplementary

Steffen Grünewälder , Guy Lever , Luca Baldassarre , Sam Patterson , Arthur Gretton , Massimilano Pontil

   Access Paper or Ask Questions