Alert button
Picture for Stephen McAleer

Stephen McAleer

Alert button

Language Models can Solve Computer Tasks

Add code
Bookmark button
Alert button
Mar 30, 2023
Geunwoo Kim, Pierre Baldi, Stephen McAleer

Figure 1 for Language Models can Solve Computer Tasks
Figure 2 for Language Models can Solve Computer Tasks
Figure 3 for Language Models can Solve Computer Tasks
Figure 4 for Language Models can Solve Computer Tasks
Viaarxiv icon

Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 02, 2023
Lukas Schäfer, Oliver Slumbers, Stephen McAleer, Yali Du, Stefano V. Albrecht, David Mguni

Figure 1 for Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning
Figure 2 for Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning
Figure 3 for Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning
Figure 4 for Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning
Viaarxiv icon

ASP: Learn a Universal Neural Solver!

Add code
Bookmark button
Alert button
Mar 01, 2023
Chenguang Wang, Zhouliang Yu, Stephen McAleer, Tianshu Yu, Yaodong Yang

Figure 1 for ASP: Learn a Universal Neural Solver!
Figure 2 for ASP: Learn a Universal Neural Solver!
Figure 3 for ASP: Learn a Universal Neural Solver!
Figure 4 for ASP: Learn a Universal Neural Solver!
Viaarxiv icon

Game Theoretic Rating in N-player general-sum games with Equilibria

Add code
Bookmark button
Alert button
Oct 05, 2022
Luke Marris, Marc Lanctot, Ian Gemp, Shayegan Omidshafiei, Stephen McAleer, Jerome Connor, Karl Tuyls, Thore Graepel

Figure 1 for Game Theoretic Rating in N-player general-sum games with Equilibria
Figure 2 for Game Theoretic Rating in N-player general-sum games with Equilibria
Figure 3 for Game Theoretic Rating in N-player general-sum games with Equilibria
Figure 4 for Game Theoretic Rating in N-player general-sum games with Equilibria
Viaarxiv icon

Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks

Add code
Bookmark button
Alert button
Sep 16, 2022
Litian Liang, Yaosheng Xu, Stephen McAleer, Dailin Hu, Alexander Ihler, Pieter Abbeel, Roy Fox

Figure 1 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Figure 2 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Figure 3 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Figure 4 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Viaarxiv icon

Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments

Add code
Bookmark button
Alert button
Jul 19, 2022
JB Lanier, Stephen McAleer, Pierre Baldi, Roy Fox

Figure 1 for Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments
Figure 2 for Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments
Figure 3 for Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments
Figure 4 for Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments
Viaarxiv icon

Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games

Add code
Bookmark button
Alert button
Jul 13, 2022
Stephen McAleer, JB Lanier, Kevin Wang, Pierre Baldi, Roy Fox, Tuomas Sandholm

Figure 1 for Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Figure 2 for Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Figure 3 for Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Figure 4 for Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Viaarxiv icon

Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 30, 2022
Julien Perolat, Bart de Vylder, Daniel Hennes, Eugene Tarassov, Florian Strub, Vincent de Boer, Paul Muller, Jerome T. Connor, Neil Burch, Thomas Anthony, Stephen McAleer, Romuald Elie, Sarah H. Cen, Zhe Wang, Audrunas Gruslys, Aleksandra Malysheva, Mina Khan, Sherjil Ozair, Finbarr Timbers, Toby Pohlen, Tom Eccles, Mark Rowland, Marc Lanctot, Jean-Baptiste Lespiau, Bilal Piot, Shayegan Omidshafiei, Edward Lockhart, Laurent Sifre, Nathalie Beauguerlange, Remi Munos, David Silver, Satinder Singh, Demis Hassabis, Karl Tuyls

Figure 1 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Figure 2 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Figure 3 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Figure 4 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Viaarxiv icon