Picture for Bogdan Mazoure

Bogdan Mazoure

The Sandbox Environment for Generalizable Agent Research (SEGAR)

Add code
Mar 19, 2022
Figure 1 for The Sandbox Environment for Generalizable Agent Research (SEGAR)
Figure 2 for The Sandbox Environment for Generalizable Agent Research (SEGAR)
Figure 3 for The Sandbox Environment for Generalizable Agent Research (SEGAR)
Figure 4 for The Sandbox Environment for Generalizable Agent Research (SEGAR)
Viaarxiv icon

Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions

Add code
Nov 29, 2021
Figure 1 for Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions
Figure 2 for Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions
Figure 3 for Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions
Figure 4 for Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions
Viaarxiv icon

Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL

Add code
Jun 04, 2021
Figure 1 for Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL
Figure 2 for Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL
Figure 3 for Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL
Figure 4 for Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL
Viaarxiv icon

Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL

Add code
Jun 01, 2021
Figure 1 for Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL
Figure 2 for Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL
Figure 3 for Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL
Figure 4 for Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL
Viaarxiv icon

A Theoretical Analysis of Catastrophic Forgetting through the NTK Overlap Matrix

Add code
Oct 07, 2020
Figure 1 for A Theoretical Analysis of Catastrophic Forgetting through the NTK Overlap Matrix
Figure 2 for A Theoretical Analysis of Catastrophic Forgetting through the NTK Overlap Matrix
Figure 3 for A Theoretical Analysis of Catastrophic Forgetting through the NTK Overlap Matrix
Figure 4 for A Theoretical Analysis of Catastrophic Forgetting through the NTK Overlap Matrix
Viaarxiv icon

Deep Reinforcement and InfoMax Learning

Add code
Jun 12, 2020
Figure 1 for Deep Reinforcement and InfoMax Learning
Figure 2 for Deep Reinforcement and InfoMax Learning
Figure 3 for Deep Reinforcement and InfoMax Learning
Figure 4 for Deep Reinforcement and InfoMax Learning
Viaarxiv icon

Provably efficient reconstruction of policy networks

Add code
Feb 07, 2020
Figure 1 for Provably efficient reconstruction of policy networks
Figure 2 for Provably efficient reconstruction of policy networks
Figure 3 for Provably efficient reconstruction of policy networks
Figure 4 for Provably efficient reconstruction of policy networks
Viaarxiv icon

Efficient Planning under Partial Observability with Unnormalized Q Functions and Spectral Learning

Add code
Nov 22, 2019
Figure 1 for Efficient Planning under Partial Observability with Unnormalized Q Functions and Spectral Learning
Figure 2 for Efficient Planning under Partial Observability with Unnormalized Q Functions and Spectral Learning
Figure 3 for Efficient Planning under Partial Observability with Unnormalized Q Functions and Spectral Learning
Viaarxiv icon

Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning

Add code
Sep 24, 2019
Figure 1 for Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning
Figure 2 for Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning
Figure 3 for Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning
Figure 4 for Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning
Viaarxiv icon

Learning Gaussian Graphical Models with Ordered Weighted L1 Regularization

Add code
Jun 06, 2019
Figure 1 for Learning Gaussian Graphical Models with Ordered Weighted L1 Regularization
Figure 2 for Learning Gaussian Graphical Models with Ordered Weighted L1 Regularization
Figure 3 for Learning Gaussian Graphical Models with Ordered Weighted L1 Regularization
Viaarxiv icon