Picture for Marc G. Bellemare

Marc G. Bellemare

Controlling Large Language Model Agents with Entropic Activation Steering

Add code
Jun 01, 2024
Viaarxiv icon

A Distributional Analogue to the Successor Representation

Add code
Feb 13, 2024
Viaarxiv icon

Learning and Controlling Silicon Dopant Transitions in Graphene using Scanning Transmission Electron Microscopy

Add code
Nov 21, 2023
Figure 1 for Learning and Controlling Silicon Dopant Transitions in Graphene using Scanning Transmission Electron Microscopy
Figure 2 for Learning and Controlling Silicon Dopant Transitions in Graphene using Scanning Transmission Electron Microscopy
Figure 3 for Learning and Controlling Silicon Dopant Transitions in Graphene using Scanning Transmission Electron Microscopy
Figure 4 for Learning and Controlling Silicon Dopant Transitions in Graphene using Scanning Transmission Electron Microscopy
Viaarxiv icon

Small batch deep reinforcement learning

Add code
Oct 05, 2023
Viaarxiv icon

Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control

Add code
Sep 26, 2023
Figure 1 for Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
Figure 2 for Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
Figure 3 for Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
Figure 4 for Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
Viaarxiv icon

Bootstrapped Representations in Reinforcement Learning

Add code
Jun 16, 2023
Figure 1 for Bootstrapped Representations in Reinforcement Learning
Figure 2 for Bootstrapped Representations in Reinforcement Learning
Figure 3 for Bootstrapped Representations in Reinforcement Learning
Figure 4 for Bootstrapped Representations in Reinforcement Learning
Viaarxiv icon

The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation

Add code
May 28, 2023
Figure 1 for The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation
Figure 2 for The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation
Figure 3 for The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation
Figure 4 for The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation
Viaarxiv icon

Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks

Add code
Apr 25, 2023
Figure 1 for Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks
Figure 2 for Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks
Figure 3 for Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks
Figure 4 for Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks
Viaarxiv icon

An Analysis of Quantile Temporal-Difference Learning

Add code
Jan 11, 2023
Figure 1 for An Analysis of Quantile Temporal-Difference Learning
Figure 2 for An Analysis of Quantile Temporal-Difference Learning
Figure 3 for An Analysis of Quantile Temporal-Difference Learning
Figure 4 for An Analysis of Quantile Temporal-Difference Learning
Viaarxiv icon

A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces

Add code
Dec 08, 2022
Figure 1 for A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces
Figure 2 for A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces
Figure 3 for A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces
Figure 4 for A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces
Viaarxiv icon