Alert button
Picture for Bogdan Mazoure

Bogdan Mazoure

Alert button

Large Language Models as Generalizable Policies for Embodied Tasks

Oct 26, 2023
Andrew Szot, Max Schwarzer, Harsh Agrawal, Bogdan Mazoure, Walter Talbott, Katherine Metcalf, Natalie Mackraz, Devon Hjelm, Alexander Toshev

Viaarxiv icon

Value function estimation using conditional diffusion models for control

Jun 09, 2023
Bogdan Mazoure, Walter Talbott, Miguel Angel Bautista, Devon Hjelm, Alexander Toshev, Josh Susskind

Figure 1 for Value function estimation using conditional diffusion models for control
Figure 2 for Value function estimation using conditional diffusion models for control
Figure 3 for Value function estimation using conditional diffusion models for control
Figure 4 for Value function estimation using conditional diffusion models for control
Viaarxiv icon

Accelerating exploration and representation learning with offline pre-training

Mar 31, 2023
Bogdan Mazoure, Jake Bruce, Doina Precup, Rob Fergus, Ankit Anand

Figure 1 for Accelerating exploration and representation learning with offline pre-training
Figure 2 for Accelerating exploration and representation learning with offline pre-training
Figure 3 for Accelerating exploration and representation learning with offline pre-training
Figure 4 for Accelerating exploration and representation learning with offline pre-training
Viaarxiv icon

Contrastive Value Learning: Implicit Models for Simple Offline RL

Nov 03, 2022
Bogdan Mazoure, Benjamin Eysenbach, Ofir Nachum, Jonathan Tompson

Figure 1 for Contrastive Value Learning: Implicit Models for Simple Offline RL
Figure 2 for Contrastive Value Learning: Implicit Models for Simple Offline RL
Figure 3 for Contrastive Value Learning: Implicit Models for Simple Offline RL
Figure 4 for Contrastive Value Learning: Implicit Models for Simple Offline RL
Viaarxiv icon

Sequential Density Estimation via NCWFAs Sequential Density Estimation via Nonlinear Continuous Weighted Finite Automata

Jun 08, 2022
Tianyu Li, Bogdan Mazoure, Guillaume Rabusseau

Figure 1 for Sequential Density Estimation via NCWFAs Sequential Density Estimation via Nonlinear Continuous Weighted Finite Automata
Viaarxiv icon

The Sandbox Environment for Generalizable Agent Research (SEGAR)

Mar 19, 2022
R Devon Hjelm, Bogdan Mazoure, Florian Golemo, Felipe Frujeri, Mihai Jalobeanu, Andrey Kolobov

Figure 1 for The Sandbox Environment for Generalizable Agent Research (SEGAR)
Figure 2 for The Sandbox Environment for Generalizable Agent Research (SEGAR)
Figure 3 for The Sandbox Environment for Generalizable Agent Research (SEGAR)
Figure 4 for The Sandbox Environment for Generalizable Agent Research (SEGAR)
Viaarxiv icon

Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions

Nov 29, 2021
Bogdan Mazoure, Ilya Kostrikov, Ofir Nachum, Jonathan Tompson

Figure 1 for Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions
Figure 2 for Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions
Figure 3 for Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions
Figure 4 for Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions
Viaarxiv icon

Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL

Jun 04, 2021
Bogdan Mazoure, Ahmed M. Ahmed, Patrick MacAlpine, R Devon Hjelm, Andrey Kolobov

Figure 1 for Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL
Figure 2 for Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL
Figure 3 for Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL
Figure 4 for Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL
Viaarxiv icon

Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL

Jun 01, 2021
Bogdan Mazoure, Paul Mineiro, Pavithra Srinath, Reza Sharifi Sedeh, Doina Precup, Adith Swaminathan

Figure 1 for Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL
Figure 2 for Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL
Figure 3 for Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL
Figure 4 for Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL
Viaarxiv icon

A Theoretical Analysis of Catastrophic Forgetting through the NTK Overlap Matrix

Oct 07, 2020
Thang Doan, Mehdi Bennani, Bogdan Mazoure, Guillaume Rabusseau, Pierre Alquier

Figure 1 for A Theoretical Analysis of Catastrophic Forgetting through the NTK Overlap Matrix
Figure 2 for A Theoretical Analysis of Catastrophic Forgetting through the NTK Overlap Matrix
Figure 3 for A Theoretical Analysis of Catastrophic Forgetting through the NTK Overlap Matrix
Figure 4 for A Theoretical Analysis of Catastrophic Forgetting through the NTK Overlap Matrix
Viaarxiv icon