Picture for Paul Michel

Paul Michel

Countering Reward Over-optimization in LLM with Demonstration-Guided Reinforcement Learning

Add code
Apr 30, 2024
Viaarxiv icon

Language Evolution with Deep Learning

Add code
Mar 18, 2024
Figure 1 for Language Evolution with Deep Learning
Figure 2 for Language Evolution with Deep Learning
Figure 3 for Language Evolution with Deep Learning
Figure 4 for Language Evolution with Deep Learning
Viaarxiv icon

Gemma: Open Models Based on Gemini Research and Technology

Add code
Mar 13, 2024
Figure 1 for Gemma: Open Models Based on Gemini Research and Technology
Figure 2 for Gemma: Open Models Based on Gemini Research and Technology
Figure 3 for Gemma: Open Models Based on Gemini Research and Technology
Figure 4 for Gemma: Open Models Based on Gemini Research and Technology
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Curriculum Learning with Adam: The Devil Is in the Wrong Details

Add code
Aug 23, 2023
Figure 1 for Curriculum Learning with Adam: The Devil Is in the Wrong Details
Figure 2 for Curriculum Learning with Adam: The Devil Is in the Wrong Details
Figure 3 for Curriculum Learning with Adam: The Devil Is in the Wrong Details
Figure 4 for Curriculum Learning with Adam: The Devil Is in the Wrong Details
Viaarxiv icon

Emergent Communication: Generalization and Overfitting in Lewis Games

Add code
Sep 30, 2022
Figure 1 for Emergent Communication: Generalization and Overfitting in Lewis Games
Figure 2 for Emergent Communication: Generalization and Overfitting in Lewis Games
Figure 3 for Emergent Communication: Generalization and Overfitting in Lewis Games
Figure 4 for Emergent Communication: Generalization and Overfitting in Lewis Games
Viaarxiv icon

AANG: Automating Auxiliary Learning

Add code
May 27, 2022
Figure 1 for AANG: Automating Auxiliary Learning
Figure 2 for AANG: Automating Auxiliary Learning
Figure 3 for AANG: Automating Auxiliary Learning
Figure 4 for AANG: Automating Auxiliary Learning
Viaarxiv icon

Distributionally Robust Models with Parametric Likelihood Ratios

Add code
Apr 13, 2022
Figure 1 for Distributionally Robust Models with Parametric Likelihood Ratios
Figure 2 for Distributionally Robust Models with Parametric Likelihood Ratios
Figure 3 for Distributionally Robust Models with Parametric Likelihood Ratios
Figure 4 for Distributionally Robust Models with Parametric Likelihood Ratios
Viaarxiv icon

Balancing Average and Worst-case Accuracy in Multitask Learning

Add code
Oct 12, 2021
Figure 1 for Balancing Average and Worst-case Accuracy in Multitask Learning
Figure 2 for Balancing Average and Worst-case Accuracy in Multitask Learning
Figure 3 for Balancing Average and Worst-case Accuracy in Multitask Learning
Figure 4 for Balancing Average and Worst-case Accuracy in Multitask Learning
Viaarxiv icon