Picture for Joelle Pineau

Joelle Pineau

Editors

A Generalized Bootstrap Target for Value-Learning, Efficiently Combining Value and Feature Predictions

Add code
Jan 05, 2022
Figure 1 for A Generalized Bootstrap Target for Value-Learning, Efficiently Combining Value and Feature Predictions
Figure 2 for A Generalized Bootstrap Target for Value-Learning, Efficiently Combining Value and Feature Predictions
Figure 3 for A Generalized Bootstrap Target for Value-Learning, Efficiently Combining Value and Feature Predictions
Figure 4 for A Generalized Bootstrap Target for Value-Learning, Efficiently Combining Value and Feature Predictions
Viaarxiv icon

Block Contextual MDPs for Continual Learning

Add code
Oct 13, 2021
Figure 1 for Block Contextual MDPs for Continual Learning
Figure 2 for Block Contextual MDPs for Continual Learning
Figure 3 for Block Contextual MDPs for Continual Learning
Figure 4 for Block Contextual MDPs for Continual Learning
Viaarxiv icon

OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation

Add code
Jun 21, 2021
Figure 1 for OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
Figure 2 for OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
Figure 3 for OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
Figure 4 for OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation
Viaarxiv icon

Do Encoder Representations of Generative Dialogue Models Encode Sufficient Information about the Task ?

Add code
Jun 20, 2021
Figure 1 for Do Encoder Representations of Generative Dialogue Models Encode Sufficient Information about the Task ?
Figure 2 for Do Encoder Representations of Generative Dialogue Models Encode Sufficient Information about the Task ?
Figure 3 for Do Encoder Representations of Generative Dialogue Models Encode Sufficient Information about the Task ?
Figure 4 for Do Encoder Representations of Generative Dialogue Models Encode Sufficient Information about the Task ?
Viaarxiv icon

A Brief Study on the Effects of Training Generative Dialogue Models with a Semantic loss

Add code
Jun 20, 2021
Figure 1 for A Brief Study on the Effects of Training Generative Dialogue Models with a Semantic loss
Figure 2 for A Brief Study on the Effects of Training Generative Dialogue Models with a Semantic loss
Figure 3 for A Brief Study on the Effects of Training Generative Dialogue Models with a Semantic loss
Figure 4 for A Brief Study on the Effects of Training Generative Dialogue Models with a Semantic loss
Viaarxiv icon

SPeCiaL: Self-Supervised Pretraining for Continual Learning

Add code
Jun 16, 2021
Figure 1 for SPeCiaL: Self-Supervised Pretraining for Continual Learning
Figure 2 for SPeCiaL: Self-Supervised Pretraining for Continual Learning
Figure 3 for SPeCiaL: Self-Supervised Pretraining for Continual Learning
Figure 4 for SPeCiaL: Self-Supervised Pretraining for Continual Learning
Viaarxiv icon

Correcting Momentum in Temporal Difference Learning

Add code
Jun 07, 2021
Figure 1 for Correcting Momentum in Temporal Difference Learning
Figure 2 for Correcting Momentum in Temporal Difference Learning
Figure 3 for Correcting Momentum in Temporal Difference Learning
Figure 4 for Correcting Momentum in Temporal Difference Learning
Viaarxiv icon

Multi-Objective SPIBB: Seldonian Offline Policy Improvement with Safety Constraints in Finite MDPs

Add code
May 31, 2021
Figure 1 for Multi-Objective SPIBB: Seldonian Offline Policy Improvement with Safety Constraints in Finite MDPs
Figure 2 for Multi-Objective SPIBB: Seldonian Offline Policy Improvement with Safety Constraints in Finite MDPs
Figure 3 for Multi-Objective SPIBB: Seldonian Offline Policy Improvement with Safety Constraints in Finite MDPs
Figure 4 for Multi-Objective SPIBB: Seldonian Offline Policy Improvement with Safety Constraints in Finite MDPs
Viaarxiv icon

Sometimes We Want Translationese

Add code
Apr 15, 2021
Figure 1 for Sometimes We Want Translationese
Figure 2 for Sometimes We Want Translationese
Figure 3 for Sometimes We Want Translationese
Figure 4 for Sometimes We Want Translationese
Viaarxiv icon

Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little

Add code
Apr 14, 2021
Figure 1 for Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little
Figure 2 for Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little
Figure 3 for Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little
Figure 4 for Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little
Viaarxiv icon