Alert button
Picture for Olivier Pietquin

Olivier Pietquin

Alert button

RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 04, 2021
Sabela Ramos, Sertan Girgin, Léonard Hussenot, Damien Vincent, Hanna Yakubovich, Daniel Toyama, Anita Gergely, Piotr Stanczyk, Raphael Marinier, Jeremiah Harmsen, Olivier Pietquin, Nikola Momchev

Figure 1 for RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning
Figure 2 for RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning
Figure 3 for RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning
Figure 4 for RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning
Viaarxiv icon

Continuous Control with Action Quantization from Demonstrations

Add code
Bookmark button
Alert button
Oct 19, 2021
Robert Dadashi, Léonard Hussenot, Damien Vincent, Sertan Girgin, Anton Raichuk, Matthieu Geist, Olivier Pietquin

Figure 1 for Continuous Control with Action Quantization from Demonstrations
Figure 2 for Continuous Control with Action Quantization from Demonstrations
Figure 3 for Continuous Control with Action Quantization from Demonstrations
Figure 4 for Continuous Control with Action Quantization from Demonstrations
Viaarxiv icon

Generalization in Mean Field Games by Learning Master Policies

Add code
Bookmark button
Alert button
Sep 20, 2021
Sarah Perrin, Mathieu Laurière, Julien Pérolat, Romuald Élie, Matthieu Geist, Olivier Pietquin

Figure 1 for Generalization in Mean Field Games by Learning Master Policies
Figure 2 for Generalization in Mean Field Games by Learning Master Policies
Figure 3 for Generalization in Mean Field Games by Learning Master Policies
Figure 4 for Generalization in Mean Field Games by Learning Master Policies
Viaarxiv icon

Learning Natural Language Generation from Scratch

Add code
Bookmark button
Alert button
Sep 20, 2021
Alice Martin Donati, Guillaume Quispe, Charles Ollion, Sylvain Le Corff, Florian Strub, Olivier Pietquin

Figure 1 for Learning Natural Language Generation from Scratch
Figure 2 for Learning Natural Language Generation from Scratch
Figure 3 for Learning Natural Language Generation from Scratch
Figure 4 for Learning Natural Language Generation from Scratch
Viaarxiv icon

Implicitly Regularized RL with Implicit Q-Values

Add code
Bookmark button
Alert button
Aug 16, 2021
Nino Vieillard, Marcin Andrychowicz, Anton Raichuk, Olivier Pietquin, Matthieu Geist

Figure 1 for Implicitly Regularized RL with Implicit Q-Values
Figure 2 for Implicitly Regularized RL with Implicit Q-Values
Figure 3 for Implicitly Regularized RL with Implicit Q-Values
Figure 4 for Implicitly Regularized RL with Implicit Q-Values
Viaarxiv icon

Offline Reinforcement Learning as Anti-Exploration

Add code
Bookmark button
Alert button
Jun 11, 2021
Shideh Rezaeifar, Robert Dadashi, Nino Vieillard, Léonard Hussenot, Olivier Bachem, Olivier Pietquin, Matthieu Geist

Figure 1 for Offline Reinforcement Learning as Anti-Exploration
Figure 2 for Offline Reinforcement Learning as Anti-Exploration
Figure 3 for Offline Reinforcement Learning as Anti-Exploration
Figure 4 for Offline Reinforcement Learning as Anti-Exploration
Viaarxiv icon

There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 09, 2021
Nathan Grinsztajn, Johan Ferret, Olivier Pietquin, Philippe Preux, Matthieu Geist

Figure 1 for There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning
Figure 2 for There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning
Figure 3 for There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning
Figure 4 for There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning
Viaarxiv icon

Concave Utility Reinforcement Learning: the Mean-field Game viewpoint

Add code
Bookmark button
Alert button
Jun 09, 2021
Matthieu Geist, Julien Pérolat, Mathieu Laurière, Romuald Elie, Sarah Perrin, Olivier Bachem, Rémi Munos, Olivier Pietquin

Figure 1 for Concave Utility Reinforcement Learning: the Mean-field Game viewpoint
Figure 2 for Concave Utility Reinforcement Learning: the Mean-field Game viewpoint
Figure 3 for Concave Utility Reinforcement Learning: the Mean-field Game viewpoint
Figure 4 for Concave Utility Reinforcement Learning: the Mean-field Game viewpoint
Viaarxiv icon