Alert button
Picture for Joelle Pineau

Joelle Pineau

Alert button

Stable Policy Optimization via Off-Policy Divergence Regularization

Add code
Bookmark button
Alert button
Mar 09, 2020
Ahmed Touati, Amy Zhang, Joelle Pineau, Pascal Vincent

Figure 1 for Stable Policy Optimization via Off-Policy Divergence Regularization
Figure 2 for Stable Policy Optimization via Off-Policy Divergence Regularization
Figure 3 for Stable Policy Optimization via Off-Policy Divergence Regularization
Figure 4 for Stable Policy Optimization via Off-Policy Divergence Regularization
Viaarxiv icon

Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic

Add code
Bookmark button
Alert button
Feb 24, 2020
Wonseok Jeon, Paul Barde, Derek Nowrouzezahrai, Joelle Pineau

Figure 1 for Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic
Figure 2 for Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic
Figure 3 for Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic
Figure 4 for Scalable Multi-Agent Inverse Reinforcement Learning via Actor-Attention-Critic
Viaarxiv icon

Provably efficient reconstruction of policy networks

Add code
Bookmark button
Alert button
Feb 07, 2020
Bogdan Mazoure, Thang Doan, Tianyu Li, Vladimir Makarenkov, Joelle Pineau, Doina Precup, Guillaume Rabusseau

Figure 1 for Provably efficient reconstruction of policy networks
Figure 2 for Provably efficient reconstruction of policy networks
Figure 3 for Provably efficient reconstruction of policy networks
Figure 4 for Provably efficient reconstruction of policy networks
Viaarxiv icon

On the interaction between supervision and self-play in emergent communication

Add code
Bookmark button
Alert button
Feb 04, 2020
Ryan Lowe, Abhinav Gupta, Jakob Foerster, Douwe Kiela, Joelle Pineau

Figure 1 for On the interaction between supervision and self-play in emergent communication
Figure 2 for On the interaction between supervision and self-play in emergent communication
Figure 3 for On the interaction between supervision and self-play in emergent communication
Figure 4 for On the interaction between supervision and self-play in emergent communication
Viaarxiv icon

Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning

Add code
Bookmark button
Alert button
Jan 31, 2020
Peter Henderson, Jieru Hu, Joshua Romoff, Emma Brunskill, Dan Jurafsky, Joelle Pineau

Figure 1 for Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning
Figure 2 for Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning
Figure 3 for Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning
Figure 4 for Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning
Viaarxiv icon

Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift

Add code
Bookmark button
Alert button
Dec 01, 2019
Riashat Islam, Komal K. Teru, Deepak Sharma, Joelle Pineau

Figure 1 for Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift
Figure 2 for Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift
Figure 3 for Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift
Figure 4 for Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift
Viaarxiv icon

Exploiting Spatial Invariance for Scalable Unsupervised Object Tracking

Add code
Bookmark button
Alert button
Nov 20, 2019
Eric Crawford, Joelle Pineau

Figure 1 for Exploiting Spatial Invariance for Scalable Unsupervised Object Tracking
Figure 2 for Exploiting Spatial Invariance for Scalable Unsupervised Object Tracking
Figure 3 for Exploiting Spatial Invariance for Scalable Unsupervised Object Tracking
Figure 4 for Exploiting Spatial Invariance for Scalable Unsupervised Object Tracking
Viaarxiv icon

Online Learned Continual Compression with Stacked Quantization Module

Add code
Bookmark button
Alert button
Nov 19, 2019
Lucas Caccia, Eugene Belilovsky, Massimo Caccia, Joelle Pineau

Figure 1 for Online Learned Continual Compression with Stacked Quantization Module
Figure 2 for Online Learned Continual Compression with Stacked Quantization Module
Figure 3 for Online Learned Continual Compression with Stacked Quantization Module
Figure 4 for Online Learned Continual Compression with Stacked Quantization Module
Viaarxiv icon

MVFST-RL: An Asynchronous RL Framework for Congestion Control with Delayed Actions

Add code
Bookmark button
Alert button
Oct 30, 2019
Viswanath Sivakumar, Tim Rocktäschel, Alexander H. Miller, Heinrich Küttler, Nantas Nardelli, Mike Rabbat, Joelle Pineau, Sebastian Riedel

Figure 1 for MVFST-RL: An Asynchronous RL Framework for Congestion Control with Delayed Actions
Figure 2 for MVFST-RL: An Asynchronous RL Framework for Congestion Control with Delayed Actions
Figure 3 for MVFST-RL: An Asynchronous RL Framework for Congestion Control with Delayed Actions
Viaarxiv icon