Alert button
Picture for Joelle Pineau

Joelle Pineau

Alert button

Novelty Search in Representational Space for Sample Efficient Exploration

Add code
Bookmark button
Alert button
Oct 21, 2020
Ruo Yu Tao, Vincent François-Lavet, Joelle Pineau

Figure 1 for Novelty Search in Representational Space for Sample Efficient Exploration
Figure 2 for Novelty Search in Representational Space for Sample Efficient Exploration
Figure 3 for Novelty Search in Representational Space for Sample Efficient Exploration
Figure 4 for Novelty Search in Representational Space for Sample Efficient Exploration
Viaarxiv icon

Regularized Inverse Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 07, 2020
Wonseok Jeon, Chen-Yang Su, Paul Barde, Thang Doan, Derek Nowrouzezahrai, Joelle Pineau

Figure 1 for Regularized Inverse Reinforcement Learning
Figure 2 for Regularized Inverse Reinforcement Learning
Figure 3 for Regularized Inverse Reinforcement Learning
Figure 4 for Regularized Inverse Reinforcement Learning
Viaarxiv icon

Novelty Search in representational space for sample efficient exploration

Add code
Bookmark button
Alert button
Sep 28, 2020
Ruo Yu Tao, Vincent François-Lavet, Joelle Pineau

Figure 1 for Novelty Search in representational space for sample efficient exploration
Figure 2 for Novelty Search in representational space for sample efficient exploration
Figure 3 for Novelty Search in representational space for sample efficient exploration
Figure 4 for Novelty Search in representational space for sample efficient exploration
Viaarxiv icon

Constrained Markov Decision Processes via Backward Value Functions

Add code
Bookmark button
Alert button
Aug 26, 2020
Harsh Satija, Philip Amortila, Joelle Pineau

Figure 1 for Constrained Markov Decision Processes via Backward Value Functions
Figure 2 for Constrained Markov Decision Processes via Backward Value Functions
Figure 3 for Constrained Markov Decision Processes via Backward Value Functions
Viaarxiv icon

How To Evaluate Your Dialogue System: Probe Tasks as an Alternative for Token-level Evaluation Metrics

Add code
Bookmark button
Alert button
Aug 24, 2020
Prasanna Parthasarathi, Joelle Pineau, Sarath Chandar

Figure 1 for How To Evaluate Your Dialogue System: Probe Tasks as an Alternative for Token-level Evaluation Metrics
Figure 2 for How To Evaluate Your Dialogue System: Probe Tasks as an Alternative for Token-level Evaluation Metrics
Figure 3 for How To Evaluate Your Dialogue System: Probe Tasks as an Alternative for Token-level Evaluation Metrics
Figure 4 for How To Evaluate Your Dialogue System: Probe Tasks as an Alternative for Token-level Evaluation Metrics
Viaarxiv icon

Multi-Task Reinforcement Learning as a Hidden-Parameter Block MDP

Add code
Bookmark button
Alert button
Jul 28, 2020
Amy Zhang, Shagun Sodhani, Khimya Khetarpal, Joelle Pineau

Figure 1 for Multi-Task Reinforcement Learning as a Hidden-Parameter Block MDP
Figure 2 for Multi-Task Reinforcement Learning as a Hidden-Parameter Block MDP
Figure 3 for Multi-Task Reinforcement Learning as a Hidden-Parameter Block MDP
Figure 4 for Multi-Task Reinforcement Learning as a Hidden-Parameter Block MDP
Viaarxiv icon

TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?

Add code
Bookmark button
Alert button
Jul 06, 2020
Joshua Romoff, Peter Henderson, David Kanaa, Emmanuel Bengio, Ahmed Touati, Pierre-Luc Bacon, Joelle Pineau

Figure 1 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 2 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 3 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 4 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Viaarxiv icon

Deep interpretability for GWAS

Add code
Bookmark button
Alert button
Jul 03, 2020
Deepak Sharma, Audrey Durand, Marc-André Legault, Louis-Philippe Lemieux Perreault, Audrey Lemaçon, Marie-Pierre Dubé, Joelle Pineau

Figure 1 for Deep interpretability for GWAS
Figure 2 for Deep interpretability for GWAS
Figure 3 for Deep interpretability for GWAS
Viaarxiv icon

Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization

Add code
Bookmark button
Alert button
Jun 23, 2020
Paul Barde, Julien Roy, Wonseok Jeon, Joelle Pineau, Christopher Pal, Derek Nowrouzezahrai

Figure 1 for Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization
Figure 2 for Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization
Figure 3 for Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization
Figure 4 for Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization
Viaarxiv icon