Picture for Olivier Pietquin

Olivier Pietquin

Lazy-MDPs: Towards Interpretable Reinforcement Learning by Learning When to Act

Add code
Mar 16, 2022
Figure 1 for Lazy-MDPs: Towards Interpretable Reinforcement Learning by Learning When to Act
Figure 2 for Lazy-MDPs: Towards Interpretable Reinforcement Learning by Learning When to Act
Figure 3 for Lazy-MDPs: Towards Interpretable Reinforcement Learning by Learning When to Act
Figure 4 for Lazy-MDPs: Towards Interpretable Reinforcement Learning by Learning When to Act
Viaarxiv icon

RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning

Add code
Nov 04, 2021
Figure 1 for RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning
Figure 2 for RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning
Figure 3 for RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning
Figure 4 for RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement Learning
Viaarxiv icon

Continuous Control with Action Quantization from Demonstrations

Add code
Oct 19, 2021
Figure 1 for Continuous Control with Action Quantization from Demonstrations
Figure 2 for Continuous Control with Action Quantization from Demonstrations
Figure 3 for Continuous Control with Action Quantization from Demonstrations
Figure 4 for Continuous Control with Action Quantization from Demonstrations
Viaarxiv icon

Generalization in Mean Field Games by Learning Master Policies

Add code
Sep 20, 2021
Figure 1 for Generalization in Mean Field Games by Learning Master Policies
Figure 2 for Generalization in Mean Field Games by Learning Master Policies
Figure 3 for Generalization in Mean Field Games by Learning Master Policies
Figure 4 for Generalization in Mean Field Games by Learning Master Policies
Viaarxiv icon

Learning Natural Language Generation from Scratch

Add code
Sep 20, 2021
Figure 1 for Learning Natural Language Generation from Scratch
Figure 2 for Learning Natural Language Generation from Scratch
Figure 3 for Learning Natural Language Generation from Scratch
Figure 4 for Learning Natural Language Generation from Scratch
Viaarxiv icon

Implicitly Regularized RL with Implicit Q-Values

Add code
Aug 16, 2021
Figure 1 for Implicitly Regularized RL with Implicit Q-Values
Figure 2 for Implicitly Regularized RL with Implicit Q-Values
Figure 3 for Implicitly Regularized RL with Implicit Q-Values
Figure 4 for Implicitly Regularized RL with Implicit Q-Values
Viaarxiv icon

Offline Reinforcement Learning as Anti-Exploration

Add code
Jun 11, 2021
Figure 1 for Offline Reinforcement Learning as Anti-Exploration
Figure 2 for Offline Reinforcement Learning as Anti-Exploration
Figure 3 for Offline Reinforcement Learning as Anti-Exploration
Figure 4 for Offline Reinforcement Learning as Anti-Exploration
Viaarxiv icon

There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning

Add code
Jun 09, 2021
Figure 1 for There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning
Figure 2 for There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning
Figure 3 for There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning
Figure 4 for There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning
Viaarxiv icon

Concave Utility Reinforcement Learning: the Mean-field Game viewpoint

Add code
Jun 09, 2021
Figure 1 for Concave Utility Reinforcement Learning: the Mean-field Game viewpoint
Figure 2 for Concave Utility Reinforcement Learning: the Mean-field Game viewpoint
Figure 3 for Concave Utility Reinforcement Learning: the Mean-field Game viewpoint
Figure 4 for Concave Utility Reinforcement Learning: the Mean-field Game viewpoint
Viaarxiv icon

What Matters for Adversarial Imitation Learning?

Add code
Jun 01, 2021
Figure 1 for What Matters for Adversarial Imitation Learning?
Figure 2 for What Matters for Adversarial Imitation Learning?
Figure 3 for What Matters for Adversarial Imitation Learning?
Figure 4 for What Matters for Adversarial Imitation Learning?
Viaarxiv icon