Alert button
Picture for Pierre-Luc Bacon

Pierre-Luc Bacon

Alert button

Maxwell's Demon at Work: Efficient Pruning by Leveraging Saturation of Neurons

Add code
Bookmark button
Alert button
Mar 12, 2024
Simon Dufort-Labbé, Pierluca D'Oro, Evgenii Nikishin, Razvan Pascanu, Pierre-Luc Bacon, Aristide Baratin

Figure 1 for Maxwell's Demon at Work: Efficient Pruning by Leveraging Saturation of Neurons
Figure 2 for Maxwell's Demon at Work: Efficient Pruning by Leveraging Saturation of Neurons
Figure 3 for Maxwell's Demon at Work: Efficient Pruning by Leveraging Saturation of Neurons
Figure 4 for Maxwell's Demon at Work: Efficient Pruning by Leveraging Saturation of Neurons
Viaarxiv icon

Do Transformer World Models Give Better Policy Gradients?

Add code
Bookmark button
Alert button
Feb 11, 2024
Michel Ma, Tianwei Ni, Clement Gehring, Pierluca D'Oro, Pierre-Luc Bacon

Viaarxiv icon

Bridging State and History Representations: Understanding Self-Predictive RL

Add code
Bookmark button
Alert button
Jan 17, 2024
Tianwei Ni, Benjamin Eysenbach, Erfan Seyedsalehi, Michel Ma, Clement Gehring, Aditya Mahajan, Pierre-Luc Bacon

Viaarxiv icon

Maximum entropy GFlowNets with soft Q-learning

Add code
Bookmark button
Alert button
Dec 21, 2023
Sobhan Mohammadpour, Emmanuel Bengio, Emma Frejinger, Pierre-Luc Bacon

Viaarxiv icon

Course Correcting Koopman Representations

Add code
Bookmark button
Alert button
Oct 23, 2023
Mahan Fathi, Clement Gehring, Jonathan Pilault, David Kanaa, Pierre-Luc Bacon, Ross Goroshin

Viaarxiv icon

Motif: Intrinsic Motivation from Artificial Intelligence Feedback

Add code
Bookmark button
Alert button
Sep 29, 2023
Martin Klissarov, Pierluca D'Oro, Shagun Sodhani, Roberta Raileanu, Pierre-Luc Bacon, Pascal Vincent, Amy Zhang, Mikael Henaff

Figure 1 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Figure 2 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Figure 3 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Figure 4 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Viaarxiv icon

Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control

Add code
Bookmark button
Alert button
Sep 26, 2023
Nate Rahn, Pierluca D'Oro, Harley Wiltzer, Pierre-Luc Bacon, Marc G. Bellemare

Figure 1 for Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
Figure 2 for Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
Figure 3 for Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
Figure 4 for Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
Viaarxiv icon

When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment

Add code
Bookmark button
Alert button
Jul 31, 2023
Tianwei Ni, Michel Ma, Benjamin Eysenbach, Pierre-Luc Bacon

Figure 1 for When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment
Figure 2 for When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment
Figure 3 for When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment
Figure 4 for When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment
Viaarxiv icon