Picture for Damien Vincent

Damien Vincent

Google Research Football: A Novel Reinforcement Learning Environment

Add code
Jul 25, 2019
Figure 1 for Google Research Football: A Novel Reinforcement Learning Environment
Figure 2 for Google Research Football: A Novel Reinforcement Learning Environment
Figure 3 for Google Research Football: A Novel Reinforcement Learning Environment
Figure 4 for Google Research Football: A Novel Reinforcement Learning Environment
Viaarxiv icon

MULEX: Disentangling Exploitation from Exploration in Deep RL

Add code
Jul 01, 2019
Figure 1 for MULEX: Disentangling Exploitation from Exploration in Deep RL
Figure 2 for MULEX: Disentangling Exploitation from Exploration in Deep RL
Figure 3 for MULEX: Disentangling Exploitation from Exploration in Deep RL
Figure 4 for MULEX: Disentangling Exploitation from Exploration in Deep RL
Viaarxiv icon

Adaptive Temporal-Difference Learning for Policy Evaluation with Per-State Uncertainty Estimates

Add code
Jun 19, 2019
Figure 1 for Adaptive Temporal-Difference Learning for Policy Evaluation with Per-State Uncertainty Estimates
Figure 2 for Adaptive Temporal-Difference Learning for Policy Evaluation with Per-State Uncertainty Estimates
Figure 3 for Adaptive Temporal-Difference Learning for Policy Evaluation with Per-State Uncertainty Estimates
Figure 4 for Adaptive Temporal-Difference Learning for Policy Evaluation with Per-State Uncertainty Estimates
Viaarxiv icon

Episodic Curiosity through Reachability

Add code
Feb 22, 2019
Figure 1 for Episodic Curiosity through Reachability
Figure 2 for Episodic Curiosity through Reachability
Figure 3 for Episodic Curiosity through Reachability
Figure 4 for Episodic Curiosity through Reachability
Viaarxiv icon

Clustering Meets Implicit Generative Models

Add code
Aug 02, 2018
Figure 1 for Clustering Meets Implicit Generative Models
Figure 2 for Clustering Meets Implicit Generative Models
Figure 3 for Clustering Meets Implicit Generative Models
Figure 4 for Clustering Meets Implicit Generative Models
Viaarxiv icon

Temporal Difference Learning with Neural Networks - Study of the Leakage Propagation Problem

Add code
Jul 09, 2018
Figure 1 for Temporal Difference Learning with Neural Networks - Study of the Leakage Propagation Problem
Figure 2 for Temporal Difference Learning with Neural Networks - Study of the Leakage Propagation Problem
Figure 3 for Temporal Difference Learning with Neural Networks - Study of the Leakage Propagation Problem
Figure 4 for Temporal Difference Learning with Neural Networks - Study of the Leakage Propagation Problem
Viaarxiv icon

Spatially adaptive image compression using a tiled deep network

Add code
Feb 07, 2018
Figure 1 for Spatially adaptive image compression using a tiled deep network
Figure 2 for Spatially adaptive image compression using a tiled deep network
Figure 3 for Spatially adaptive image compression using a tiled deep network
Figure 4 for Spatially adaptive image compression using a tiled deep network
Viaarxiv icon

Full Resolution Image Compression with Recurrent Neural Networks

Add code
Jul 07, 2017
Figure 1 for Full Resolution Image Compression with Recurrent Neural Networks
Figure 2 for Full Resolution Image Compression with Recurrent Neural Networks
Figure 3 for Full Resolution Image Compression with Recurrent Neural Networks
Figure 4 for Full Resolution Image Compression with Recurrent Neural Networks
Viaarxiv icon

Toward Optimal Run Racing: Application to Deep Learning Calibration

Add code
Jun 20, 2017
Figure 1 for Toward Optimal Run Racing: Application to Deep Learning Calibration
Figure 2 for Toward Optimal Run Racing: Application to Deep Learning Calibration
Figure 3 for Toward Optimal Run Racing: Application to Deep Learning Calibration
Figure 4 for Toward Optimal Run Racing: Application to Deep Learning Calibration
Viaarxiv icon

Critical Hyper-Parameters: No Random, No Cry

Add code
Jun 10, 2017
Figure 1 for Critical Hyper-Parameters: No Random, No Cry
Figure 2 for Critical Hyper-Parameters: No Random, No Cry
Figure 3 for Critical Hyper-Parameters: No Random, No Cry
Figure 4 for Critical Hyper-Parameters: No Random, No Cry
Viaarxiv icon