Picture for Pierre-Luc Bacon

Pierre-Luc Bacon

Decoupling regularization from the action space

Add code
Jun 10, 2024
Viaarxiv icon

Generative Active Learning for the Search of Small-molecule Protein Binders

Add code
May 02, 2024
Viaarxiv icon

Maxwell's Demon at Work: Efficient Pruning by Leveraging Saturation of Neurons

Add code
Mar 12, 2024
Figure 1 for Maxwell's Demon at Work: Efficient Pruning by Leveraging Saturation of Neurons
Figure 2 for Maxwell's Demon at Work: Efficient Pruning by Leveraging Saturation of Neurons
Figure 3 for Maxwell's Demon at Work: Efficient Pruning by Leveraging Saturation of Neurons
Figure 4 for Maxwell's Demon at Work: Efficient Pruning by Leveraging Saturation of Neurons
Viaarxiv icon

Do Transformer World Models Give Better Policy Gradients?

Add code
Feb 11, 2024
Figure 1 for Do Transformer World Models Give Better Policy Gradients?
Figure 2 for Do Transformer World Models Give Better Policy Gradients?
Figure 3 for Do Transformer World Models Give Better Policy Gradients?
Figure 4 for Do Transformer World Models Give Better Policy Gradients?
Viaarxiv icon

Bridging State and History Representations: Understanding Self-Predictive RL

Add code
Jan 17, 2024
Figure 1 for Bridging State and History Representations: Understanding Self-Predictive RL
Figure 2 for Bridging State and History Representations: Understanding Self-Predictive RL
Figure 3 for Bridging State and History Representations: Understanding Self-Predictive RL
Figure 4 for Bridging State and History Representations: Understanding Self-Predictive RL
Viaarxiv icon

Maximum entropy GFlowNets with soft Q-learning

Add code
Dec 21, 2023
Viaarxiv icon

Course Correcting Koopman Representations

Add code
Oct 23, 2023
Figure 1 for Course Correcting Koopman Representations
Figure 2 for Course Correcting Koopman Representations
Figure 3 for Course Correcting Koopman Representations
Figure 4 for Course Correcting Koopman Representations
Viaarxiv icon

Motif: Intrinsic Motivation from Artificial Intelligence Feedback

Add code
Sep 29, 2023
Figure 1 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Figure 2 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Figure 3 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Figure 4 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Viaarxiv icon

Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control

Add code
Sep 26, 2023
Figure 1 for Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
Figure 2 for Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
Figure 3 for Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
Figure 4 for Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
Viaarxiv icon

When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment

Add code
Jul 31, 2023
Figure 1 for When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment
Figure 2 for When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment
Figure 3 for When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment
Figure 4 for When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment
Viaarxiv icon