Picture for Stephen McAleer

Stephen McAleer

Language Models can Solve Computer Tasks

Add code
Mar 30, 2023
Figure 1 for Language Models can Solve Computer Tasks
Figure 2 for Language Models can Solve Computer Tasks
Figure 3 for Language Models can Solve Computer Tasks
Figure 4 for Language Models can Solve Computer Tasks
Viaarxiv icon

Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning

Add code
Mar 02, 2023
Viaarxiv icon

ASP: Learn a Universal Neural Solver!

Add code
Mar 01, 2023
Figure 1 for ASP: Learn a Universal Neural Solver!
Figure 2 for ASP: Learn a Universal Neural Solver!
Figure 3 for ASP: Learn a Universal Neural Solver!
Figure 4 for ASP: Learn a Universal Neural Solver!
Viaarxiv icon

Game Theoretic Rating in N-player general-sum games with Equilibria

Add code
Oct 05, 2022
Figure 1 for Game Theoretic Rating in N-player general-sum games with Equilibria
Figure 2 for Game Theoretic Rating in N-player general-sum games with Equilibria
Figure 3 for Game Theoretic Rating in N-player general-sum games with Equilibria
Figure 4 for Game Theoretic Rating in N-player general-sum games with Equilibria
Viaarxiv icon

Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks

Add code
Sep 16, 2022
Figure 1 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Figure 2 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Figure 3 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Figure 4 for Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks
Viaarxiv icon

Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments

Add code
Jul 19, 2022
Figure 1 for Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments
Figure 2 for Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments
Figure 3 for Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments
Figure 4 for Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments
Viaarxiv icon

Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games

Add code
Jul 13, 2022
Figure 1 for Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Figure 2 for Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Figure 3 for Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Figure 4 for Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
Viaarxiv icon

Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning

Add code
Jun 30, 2022
Figure 1 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Figure 2 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Figure 3 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Figure 4 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Viaarxiv icon

ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret

Add code
Jun 08, 2022
Figure 1 for ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret
Figure 2 for ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret
Figure 3 for ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret
Figure 4 for ESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret
Viaarxiv icon

Learning Risk-Averse Equilibria in Multi-Agent Systems

Add code
May 30, 2022
Figure 1 for Learning Risk-Averse Equilibria in Multi-Agent Systems
Figure 2 for Learning Risk-Averse Equilibria in Multi-Agent Systems
Figure 3 for Learning Risk-Averse Equilibria in Multi-Agent Systems
Figure 4 for Learning Risk-Averse Equilibria in Multi-Agent Systems
Viaarxiv icon