Alert button
Picture for Matthieu Geist

Matthieu Geist

Alert button

Large Batch Experience Replay

Add code
Bookmark button
Alert button
Oct 04, 2021
Thibault Lahire, Matthieu Geist, Emmanuel Rachelson

Figure 1 for Large Batch Experience Replay
Figure 2 for Large Batch Experience Replay
Figure 3 for Large Batch Experience Replay
Figure 4 for Large Batch Experience Replay
Viaarxiv icon

Generalization in Mean Field Games by Learning Master Policies

Add code
Bookmark button
Alert button
Sep 20, 2021
Sarah Perrin, Mathieu Laurière, Julien Pérolat, Romuald Élie, Matthieu Geist, Olivier Pietquin

Figure 1 for Generalization in Mean Field Games by Learning Master Policies
Figure 2 for Generalization in Mean Field Games by Learning Master Policies
Figure 3 for Generalization in Mean Field Games by Learning Master Policies
Figure 4 for Generalization in Mean Field Games by Learning Master Policies
Viaarxiv icon

Implicitly Regularized RL with Implicit Q-Values

Add code
Bookmark button
Alert button
Aug 16, 2021
Nino Vieillard, Marcin Andrychowicz, Anton Raichuk, Olivier Pietquin, Matthieu Geist

Figure 1 for Implicitly Regularized RL with Implicit Q-Values
Figure 2 for Implicitly Regularized RL with Implicit Q-Values
Figure 3 for Implicitly Regularized RL with Implicit Q-Values
Figure 4 for Implicitly Regularized RL with Implicit Q-Values
Viaarxiv icon

A functional mirror ascent view of policy gradient methods with function approximation

Add code
Bookmark button
Alert button
Aug 12, 2021
Sharan Vaswani, Olivier Bachem, Simone Totaro, Robert Mueller, Matthieu Geist, Marlos C. Machado, Pablo Samuel Castro, Nicolas Le Roux

Figure 1 for A functional mirror ascent view of policy gradient methods with function approximation
Figure 2 for A functional mirror ascent view of policy gradient methods with function approximation
Figure 3 for A functional mirror ascent view of policy gradient methods with function approximation
Figure 4 for A functional mirror ascent view of policy gradient methods with function approximation
Viaarxiv icon

Offline Reinforcement Learning as Anti-Exploration

Add code
Bookmark button
Alert button
Jun 11, 2021
Shideh Rezaeifar, Robert Dadashi, Nino Vieillard, Léonard Hussenot, Olivier Bachem, Olivier Pietquin, Matthieu Geist

Figure 1 for Offline Reinforcement Learning as Anti-Exploration
Figure 2 for Offline Reinforcement Learning as Anti-Exploration
Figure 3 for Offline Reinforcement Learning as Anti-Exploration
Figure 4 for Offline Reinforcement Learning as Anti-Exploration
Viaarxiv icon

There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 09, 2021
Nathan Grinsztajn, Johan Ferret, Olivier Pietquin, Philippe Preux, Matthieu Geist

Figure 1 for There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning
Figure 2 for There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning
Figure 3 for There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning
Figure 4 for There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning
Viaarxiv icon

Concave Utility Reinforcement Learning: the Mean-field Game viewpoint

Add code
Bookmark button
Alert button
Jun 09, 2021
Matthieu Geist, Julien Pérolat, Mathieu Laurière, Romuald Elie, Sarah Perrin, Olivier Bachem, Rémi Munos, Olivier Pietquin

Figure 1 for Concave Utility Reinforcement Learning: the Mean-field Game viewpoint
Figure 2 for Concave Utility Reinforcement Learning: the Mean-field Game viewpoint
Figure 3 for Concave Utility Reinforcement Learning: the Mean-field Game viewpoint
Figure 4 for Concave Utility Reinforcement Learning: the Mean-field Game viewpoint
Viaarxiv icon

What Matters for Adversarial Imitation Learning?

Add code
Bookmark button
Alert button
Jun 01, 2021
Manu Orsini, Anton Raichuk, Léonard Hussenot, Damien Vincent, Robert Dadashi, Sertan Girgin, Matthieu Geist, Olivier Bachem, Olivier Pietquin, Marcin Andrychowicz

Figure 1 for What Matters for Adversarial Imitation Learning?
Figure 2 for What Matters for Adversarial Imitation Learning?
Figure 3 for What Matters for Adversarial Imitation Learning?
Figure 4 for What Matters for Adversarial Imitation Learning?
Viaarxiv icon