Picture for Charline Le Lan

Charline Le Lan

Human Alignment of Large Language Models through Online Preference Optimisation

Add code
Mar 13, 2024
Figure 1 for Human Alignment of Large Language Models through Online Preference Optimisation
Figure 2 for Human Alignment of Large Language Models through Online Preference Optimisation
Figure 3 for Human Alignment of Large Language Models through Online Preference Optimisation
Figure 4 for Human Alignment of Large Language Models through Online Preference Optimisation
Viaarxiv icon

Gemma: Open Models Based on Gemini Research and Technology

Add code
Mar 13, 2024
Figure 1 for Gemma: Open Models Based on Gemini Research and Technology
Figure 2 for Gemma: Open Models Based on Gemini Research and Technology
Figure 3 for Gemma: Open Models Based on Gemini Research and Technology
Figure 4 for Gemma: Open Models Based on Gemini Research and Technology
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Bootstrapped Representations in Reinforcement Learning

Add code
Jun 16, 2023
Figure 1 for Bootstrapped Representations in Reinforcement Learning
Figure 2 for Bootstrapped Representations in Reinforcement Learning
Figure 3 for Bootstrapped Representations in Reinforcement Learning
Figure 4 for Bootstrapped Representations in Reinforcement Learning
Viaarxiv icon

Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks

Add code
Apr 25, 2023
Figure 1 for Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks
Figure 2 for Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks
Figure 3 for Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks
Figure 4 for Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks
Viaarxiv icon

A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces

Add code
Dec 08, 2022
Figure 1 for A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces
Figure 2 for A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces
Figure 3 for A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces
Figure 4 for A Novel Stochastic Gradient Descent Algorithm for Learning Principal Subspaces
Viaarxiv icon

Understanding Self-Predictive Learning for Reinforcement Learning

Add code
Dec 06, 2022
Figure 1 for Understanding Self-Predictive Learning for Reinforcement Learning
Figure 2 for Understanding Self-Predictive Learning for Reinforcement Learning
Figure 3 for Understanding Self-Predictive Learning for Reinforcement Learning
Figure 4 for Understanding Self-Predictive Learning for Reinforcement Learning
Viaarxiv icon

On the Generalization of Representations in Reinforcement Learning

Add code
Mar 01, 2022
Figure 1 for On the Generalization of Representations in Reinforcement Learning
Figure 2 for On the Generalization of Representations in Reinforcement Learning
Figure 3 for On the Generalization of Representations in Reinforcement Learning
Figure 4 for On the Generalization of Representations in Reinforcement Learning
Viaarxiv icon

Metrics and continuity in reinforcement learning

Add code
Feb 02, 2021
Figure 1 for Metrics and continuity in reinforcement learning
Figure 2 for Metrics and continuity in reinforcement learning
Figure 3 for Metrics and continuity in reinforcement learning
Figure 4 for Metrics and continuity in reinforcement learning
Viaarxiv icon