Picture for Joelle Pineau

Joelle Pineau

Editors

Novelty Search in Representational Space for Sample Efficient Exploration

Add code
Oct 21, 2020
Figure 1 for Novelty Search in Representational Space for Sample Efficient Exploration
Figure 2 for Novelty Search in Representational Space for Sample Efficient Exploration
Figure 3 for Novelty Search in Representational Space for Sample Efficient Exploration
Figure 4 for Novelty Search in Representational Space for Sample Efficient Exploration
Viaarxiv icon

Regularized Inverse Reinforcement Learning

Add code
Oct 07, 2020
Figure 1 for Regularized Inverse Reinforcement Learning
Figure 2 for Regularized Inverse Reinforcement Learning
Figure 3 for Regularized Inverse Reinforcement Learning
Figure 4 for Regularized Inverse Reinforcement Learning
Viaarxiv icon

Constrained Markov Decision Processes via Backward Value Functions

Add code
Aug 26, 2020
Figure 1 for Constrained Markov Decision Processes via Backward Value Functions
Figure 2 for Constrained Markov Decision Processes via Backward Value Functions
Figure 3 for Constrained Markov Decision Processes via Backward Value Functions
Viaarxiv icon

How To Evaluate Your Dialogue System: Probe Tasks as an Alternative for Token-level Evaluation Metrics

Add code
Aug 24, 2020
Figure 1 for How To Evaluate Your Dialogue System: Probe Tasks as an Alternative for Token-level Evaluation Metrics
Figure 2 for How To Evaluate Your Dialogue System: Probe Tasks as an Alternative for Token-level Evaluation Metrics
Figure 3 for How To Evaluate Your Dialogue System: Probe Tasks as an Alternative for Token-level Evaluation Metrics
Figure 4 for How To Evaluate Your Dialogue System: Probe Tasks as an Alternative for Token-level Evaluation Metrics
Viaarxiv icon

Multi-Task Reinforcement Learning as a Hidden-Parameter Block MDP

Add code
Jul 28, 2020
Figure 1 for Multi-Task Reinforcement Learning as a Hidden-Parameter Block MDP
Figure 2 for Multi-Task Reinforcement Learning as a Hidden-Parameter Block MDP
Figure 3 for Multi-Task Reinforcement Learning as a Hidden-Parameter Block MDP
Figure 4 for Multi-Task Reinforcement Learning as a Hidden-Parameter Block MDP
Viaarxiv icon

TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?

Add code
Jul 06, 2020
Figure 1 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 2 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 3 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Figure 4 for TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?
Viaarxiv icon

Deep interpretability for GWAS

Add code
Jul 03, 2020
Figure 1 for Deep interpretability for GWAS
Figure 2 for Deep interpretability for GWAS
Figure 3 for Deep interpretability for GWAS
Viaarxiv icon

Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization

Add code
Jun 23, 2020
Figure 1 for Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization
Figure 2 for Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization
Figure 3 for Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization
Figure 4 for Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization
Viaarxiv icon

Automated Personalized Feedback Improves Learning Gains in an Intelligent Tutoring System

Add code
May 07, 2020
Figure 1 for Automated Personalized Feedback Improves Learning Gains in an Intelligent Tutoring System
Figure 2 for Automated Personalized Feedback Improves Learning Gains in an Intelligent Tutoring System
Viaarxiv icon

Plan2Vec: Unsupervised Representation Learning by Latent Plans

Add code
May 07, 2020
Figure 1 for Plan2Vec: Unsupervised Representation Learning by Latent Plans
Figure 2 for Plan2Vec: Unsupervised Representation Learning by Latent Plans
Figure 3 for Plan2Vec: Unsupervised Representation Learning by Latent Plans
Figure 4 for Plan2Vec: Unsupervised Representation Learning by Latent Plans
Viaarxiv icon