Alert button
Picture for Bernardo Avila Pires

Bernardo Avila Pires

Alert button

Human Alignment of Large Language Models through Online Preference Optimisation

Mar 13, 2024
Daniele Calandriello, Daniel Guo, Remi Munos, Mark Rowland, Yunhao Tang, Bernardo Avila Pires, Pierre Harvey Richemond, Charline Le Lan, Michal Valko, Tianqi Liu, Rishabh Joshi, Zeyu Zheng, Bilal Piot

Viaarxiv icon

Understanding plasticity in neural networks

Mar 02, 2023
Clare Lyle, Zeyu Zheng, Evgenii Nikishin, Bernardo Avila Pires, Razvan Pascanu, Will Dabney

Figure 1 for Understanding plasticity in neural networks
Figure 2 for Understanding plasticity in neural networks
Figure 3 for Understanding plasticity in neural networks
Figure 4 for Understanding plasticity in neural networks
Viaarxiv icon

Hierarchical Reinforcement Learning in Complex 3D Environments

Feb 28, 2023
Bernardo Avila Pires, Feryal Behbahani, Hubert Soyer, Kyriacos Nikiforou, Thomas Keck, Satinder Singh

Figure 1 for Hierarchical Reinforcement Learning in Complex 3D Environments
Figure 2 for Hierarchical Reinforcement Learning in Complex 3D Environments
Figure 3 for Hierarchical Reinforcement Learning in Complex 3D Environments
Figure 4 for Hierarchical Reinforcement Learning in Complex 3D Environments
Viaarxiv icon

BYOL-Explore: Exploration by Bootstrapped Prediction

Jun 16, 2022
Zhaohan Daniel Guo, Shantanu Thakoor, Miruna Pîslar, Bernardo Avila Pires, Florent Altché, Corentin Tallec, Alaa Saade, Daniele Calandriello, Jean-Bastien Grill, Yunhao Tang, Michal Valko, Rémi Munos, Mohammad Gheshlaghi Azar, Bilal Piot

Figure 1 for BYOL-Explore: Exploration by Bootstrapped Prediction
Figure 2 for BYOL-Explore: Exploration by Bootstrapped Prediction
Figure 3 for BYOL-Explore: Exploration by Bootstrapped Prediction
Figure 4 for BYOL-Explore: Exploration by Bootstrapped Prediction
Viaarxiv icon

Neural Recursive Belief States in Multi-Agent Reinforcement Learning

Feb 03, 2021
Pol Moreno, Edward Hughes, Kevin R. McKee, Bernardo Avila Pires, Théophane Weber

Figure 1 for Neural Recursive Belief States in Multi-Agent Reinforcement Learning
Figure 2 for Neural Recursive Belief States in Multi-Agent Reinforcement Learning
Figure 3 for Neural Recursive Belief States in Multi-Agent Reinforcement Learning
Figure 4 for Neural Recursive Belief States in Multi-Agent Reinforcement Learning
Viaarxiv icon

Geometric Entropic Exploration

Jan 07, 2021
Zhaohan Daniel Guo, Mohammad Gheshlaghi Azar, Alaa Saade, Shantanu Thakoor, Bilal Piot, Bernardo Avila Pires, Michal Valko, Thomas Mesnard, Tor Lattimore, Rémi Munos

Figure 1 for Geometric Entropic Exploration
Figure 2 for Geometric Entropic Exploration
Figure 3 for Geometric Entropic Exploration
Figure 4 for Geometric Entropic Exploration
Viaarxiv icon

Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning

Jun 13, 2020
Jean-Bastien Grill, Florian Strub, Florent Altché, Corentin Tallec, Pierre H. Richemond, Elena Buchatskaya, Carl Doersch, Bernardo Avila Pires, Zhaohan Daniel Guo, Mohammad Gheshlaghi Azar, Bilal Piot, Koray Kavukcuoglu, Rémi Munos, Michal Valko

Figure 1 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Figure 2 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Figure 3 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Figure 4 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Viaarxiv icon

Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning

Apr 30, 2020
Daniel Guo, Bernardo Avila Pires, Bilal Piot, Jean-bastien Grill, Florent Altché, Rémi Munos, Mohammad Gheshlaghi Azar

Figure 1 for Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Figure 2 for Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Figure 3 for Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Figure 4 for Bootstrap Latent-Predictive Representations for Multitask Reinforcement Learning
Viaarxiv icon

World Discovery Models

Mar 01, 2019
Mohammad Gheshlaghi Azar, Bilal Piot, Bernardo Avila Pires, Jean-Bastien Grill, Florent Altché, Rémi Munos

Figure 1 for World Discovery Models
Figure 2 for World Discovery Models
Figure 3 for World Discovery Models
Figure 4 for World Discovery Models
Viaarxiv icon