Alert button
Picture for Maxime Chevalier-Boisvert

Maxime Chevalier-Boisvert

Alert button

Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented Tasks

Add code
Bookmark button
Alert button
Jun 24, 2023
Maxime Chevalier-Boisvert, Bolun Dai, Mark Towers, Rodrigo de Lazcano, Lucas Willems, Salem Lahlou, Suman Pal, Pablo Samuel Castro, Jordan Terry

Figure 1 for Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented Tasks
Figure 2 for Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented Tasks
Figure 3 for Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented Tasks
Figure 4 for Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented Tasks
Viaarxiv icon

DeepDrummer : Generating Drum Loops using Deep Learning and a Human in the Loop

Add code
Bookmark button
Alert button
Aug 26, 2020
Guillaume Alain, Maxime Chevalier-Boisvert, Frederic Osterrath, Remi Piche-Taillefer

Figure 1 for DeepDrummer : Generating Drum Loops using Deep Learning and a Human in the Loop
Figure 2 for DeepDrummer : Generating Drum Loops using Deep Learning and a Human in the Loop
Figure 3 for DeepDrummer : Generating Drum Loops using Deep Learning and a Human in the Loop
Figure 4 for DeepDrummer : Generating Drum Loops using Deep Learning and a Human in the Loop
Viaarxiv icon

BabyAI 1.1

Add code
Bookmark button
Alert button
Jul 24, 2020
David Yu-Tung Hui, Maxime Chevalier-Boisvert, Dzmitry Bahdanau, Yoshua Bengio

Figure 1 for BabyAI 1.1
Figure 2 for BabyAI 1.1
Figure 3 for BabyAI 1.1
Figure 4 for BabyAI 1.1
Viaarxiv icon

Combating False Negatives in Adversarial Imitation Learning

Add code
Bookmark button
Alert button
Feb 02, 2020
Konrad Zolna, Chitwan Saharia, Leonard Boussioux, David Yu-Tung Hui, Maxime Chevalier-Boisvert, Dzmitry Bahdanau, Yoshua Bengio

Figure 1 for Combating False Negatives in Adversarial Imitation Learning
Figure 2 for Combating False Negatives in Adversarial Imitation Learning
Figure 3 for Combating False Negatives in Adversarial Imitation Learning
Figure 4 for Combating False Negatives in Adversarial Imitation Learning
Viaarxiv icon

Option-critic in cooperative multi-agent systems

Add code
Bookmark button
Alert button
Jan 06, 2020
Jhelum Chakravorty, Nadeem Ward, Julien Roy, Maxime Chevalier-Boisvert, Sumana Basu, Andrei Lupu, Doina Precup

Figure 1 for Option-critic in cooperative multi-agent systems
Figure 2 for Option-critic in cooperative multi-agent systems
Figure 3 for Option-critic in cooperative multi-agent systems
Figure 4 for Option-critic in cooperative multi-agent systems
Viaarxiv icon

Options of Interest: Temporal Abstraction with Interest Functions

Add code
Bookmark button
Alert button
Jan 01, 2020
Khimya Khetarpal, Martin Klissarov, Maxime Chevalier-Boisvert, Pierre-Luc Bacon, Doina Precup

Figure 1 for Options of Interest: Temporal Abstraction with Interest Functions
Figure 2 for Options of Interest: Temporal Abstraction with Interest Functions
Figure 3 for Options of Interest: Temporal Abstraction with Interest Functions
Figure 4 for Options of Interest: Temporal Abstraction with Interest Functions
Viaarxiv icon

Automated curriculum generation for Policy Gradients from Demonstrations

Add code
Bookmark button
Alert button
Dec 01, 2019
Anirudh Srinivasan, Dzmitry Bahdanau, Maxime Chevalier-Boisvert, Yoshua Bengio

Figure 1 for Automated curriculum generation for Policy Gradients from Demonstrations
Figure 2 for Automated curriculum generation for Policy Gradients from Demonstrations
Figure 3 for Automated curriculum generation for Policy Gradients from Demonstrations
Figure 4 for Automated curriculum generation for Policy Gradients from Demonstrations
Viaarxiv icon

Robo-PlaNet: Learning to Poke in a Day

Add code
Bookmark button
Alert button
Nov 19, 2019
Maxime Chevalier-Boisvert, Guillaume Alain, Florian Golemo, Derek Nowrouzezahrai

Figure 1 for Robo-PlaNet: Learning to Poke in a Day
Figure 2 for Robo-PlaNet: Learning to Poke in a Day
Figure 3 for Robo-PlaNet: Learning to Poke in a Day
Viaarxiv icon