Alert button
Picture for Jean-Baptiste Lespiau

Jean-Baptiste Lespiau

Alert button

From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization

Add code
Bookmark button
Alert button
Feb 19, 2020
Julien Perolat, Remi Munos, Jean-Baptiste Lespiau, Shayegan Omidshafiei, Mark Rowland, Pedro Ortega, Neil Burch, Thomas Anthony, David Balduzzi, Bart De Vylder, Georgios Piliouras, Marc Lanctot, Karl Tuyls

Figure 1 for From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization
Figure 2 for From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization
Figure 3 for From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization
Figure 4 for From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization
Viaarxiv icon

OpenSpiel: A Framework for Reinforcement Learning in Games

Add code
Bookmark button
Alert button
Oct 10, 2019
Marc Lanctot, Edward Lockhart, Jean-Baptiste Lespiau, Vinicius Zambaldi, Satyaki Upadhyay, Julien Pérolat, Sriram Srinivasan, Finbarr Timbers, Karl Tuyls, Shayegan Omidshafiei, Daniel Hennes, Dustin Morrill, Paul Muller, Timo Ewalds, Ryan Faulkner, János Kramár, Bart De Vylder, Brennan Saeta, James Bradbury, David Ding, Sebastian Borgeaud, Matthew Lai, Julian Schrittwieser, Thomas Anthony, Edward Hughes, Ivo Danihelka, Jonah Ryan-Davis

Figure 1 for OpenSpiel: A Framework for Reinforcement Learning in Games
Figure 2 for OpenSpiel: A Framework for Reinforcement Learning in Games
Figure 3 for OpenSpiel: A Framework for Reinforcement Learning in Games
Figure 4 for OpenSpiel: A Framework for Reinforcement Learning in Games
Viaarxiv icon

Neural Replicator Dynamics

Add code
Bookmark button
Alert button
Jun 01, 2019
Shayegan Omidshafiei, Daniel Hennes, Dustin Morrill, Remi Munos, Julien Perolat, Marc Lanctot, Audrunas Gruslys, Jean-Baptiste Lespiau, Karl Tuyls

Figure 1 for Neural Replicator Dynamics
Figure 2 for Neural Replicator Dynamics
Figure 3 for Neural Replicator Dynamics
Figure 4 for Neural Replicator Dynamics
Viaarxiv icon

Computing Approximate Equilibria in Sequential Adversarial Games by Exploitability Descent

Add code
Bookmark button
Alert button
Mar 21, 2019
Edward Lockhart, Marc Lanctot, Julien Pérolat, Jean-Baptiste Lespiau, Dustin Morrill, Finbarr Timbers, Karl Tuyls

Figure 1 for Computing Approximate Equilibria in Sequential Adversarial Games by Exploitability Descent
Viaarxiv icon

Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search

Add code
Bookmark button
Alert button
Nov 15, 2018
Lars Buesing, Theophane Weber, Yori Zwols, Sebastien Racaniere, Arthur Guez, Jean-Baptiste Lespiau, Nicolas Heess

Figure 1 for Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search
Figure 2 for Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search
Figure 3 for Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search
Figure 4 for Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search
Viaarxiv icon