Alert button
Picture for John Schultz

John Schultz

Alert button

Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 02, 2023
Marc Lanctot, John Schultz, Neil Burch, Max Olan Smith, Daniel Hennes, Thomas Anthony, Julien Perolat

Figure 1 for Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning
Figure 2 for Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning
Figure 3 for Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning
Figure 4 for Population-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning
Viaarxiv icon

Learning to Navigate Wikipedia by Taking Random Walks

Add code
Bookmark button
Alert button
Oct 31, 2022
Manzil Zaheer, Kenneth Marino, Will Grathwohl, John Schultz, Wendy Shang, Sheila Babayan, Arun Ahuja, Ishita Dasgupta, Christine Kaeser-Chen, Rob Fergus

Figure 1 for Learning to Navigate Wikipedia by Taking Random Walks
Figure 2 for Learning to Navigate Wikipedia by Taking Random Walks
Figure 3 for Learning to Navigate Wikipedia by Taking Random Walks
Figure 4 for Learning to Navigate Wikipedia by Taking Random Walks
Viaarxiv icon

The Advantage Regret-Matching Actor-Critic

Add code
Bookmark button
Alert button
Aug 27, 2020
Audrūnas Gruslys, Marc Lanctot, Rémi Munos, Finbarr Timbers, Martin Schmid, Julien Perolat, Dustin Morrill, Vinicius Zambaldi, Jean-Baptiste Lespiau, John Schultz, Mohammad Gheshlaghi Azar, Michael Bowling, Karl Tuyls

Figure 1 for The Advantage Regret-Matching Actor-Critic
Figure 2 for The Advantage Regret-Matching Actor-Critic
Figure 3 for The Advantage Regret-Matching Actor-Critic
Figure 4 for The Advantage Regret-Matching Actor-Critic
Viaarxiv icon