Picture for Jean-Bastien Grill

Jean-Bastien Grill

Black-box optimization of noisy functions with unknown smoothness

Add code
May 04, 2026
Viaarxiv icon

Planning in entropy-regularized Markov decision processes and games

Add code
Apr 21, 2026
Viaarxiv icon

Blazing the trails before beating the path: Sample-efficient Monte-Carlo planning

Add code
Apr 16, 2026
Viaarxiv icon

Gemma 2: Improving Open Language Models at a Practical Size

Add code
Aug 02, 2024
Figure 1 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 2 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 3 for Gemma 2: Improving Open Language Models at a Practical Size
Figure 4 for Gemma 2: Improving Open Language Models at a Practical Size
Viaarxiv icon

Stepping on the Edge: Curvature Aware Learning Rate Tuners

Add code
Jul 08, 2024
Figure 1 for Stepping on the Edge: Curvature Aware Learning Rate Tuners
Figure 2 for Stepping on the Edge: Curvature Aware Learning Rate Tuners
Figure 3 for Stepping on the Edge: Curvature Aware Learning Rate Tuners
Figure 4 for Stepping on the Edge: Curvature Aware Learning Rate Tuners
Viaarxiv icon

Emergent Communication: Generalization and Overfitting in Lewis Games

Add code
Sep 30, 2022
Figure 1 for Emergent Communication: Generalization and Overfitting in Lewis Games
Figure 2 for Emergent Communication: Generalization and Overfitting in Lewis Games
Figure 3 for Emergent Communication: Generalization and Overfitting in Lewis Games
Figure 4 for Emergent Communication: Generalization and Overfitting in Lewis Games
Viaarxiv icon

BYOL-Explore: Exploration by Bootstrapped Prediction

Add code
Jun 16, 2022
Figure 1 for BYOL-Explore: Exploration by Bootstrapped Prediction
Figure 2 for BYOL-Explore: Exploration by Bootstrapped Prediction
Figure 3 for BYOL-Explore: Exploration by Bootstrapped Prediction
Figure 4 for BYOL-Explore: Exploration by Bootstrapped Prediction
Viaarxiv icon

Broaden Your Views for Self-Supervised Video Learning

Add code
Mar 30, 2021
Figure 1 for Broaden Your Views for Self-Supervised Video Learning
Figure 2 for Broaden Your Views for Self-Supervised Video Learning
Figure 3 for Broaden Your Views for Self-Supervised Video Learning
Figure 4 for Broaden Your Views for Self-Supervised Video Learning
Viaarxiv icon

BYOL works even without batch statistics

Add code
Oct 20, 2020
Figure 1 for BYOL works even without batch statistics
Figure 2 for BYOL works even without batch statistics
Viaarxiv icon

Monte-Carlo Tree Search as Regularized Policy Optimization

Add code
Jul 24, 2020
Figure 1 for Monte-Carlo Tree Search as Regularized Policy Optimization
Figure 2 for Monte-Carlo Tree Search as Regularized Policy Optimization
Figure 3 for Monte-Carlo Tree Search as Regularized Policy Optimization
Figure 4 for Monte-Carlo Tree Search as Regularized Policy Optimization
Viaarxiv icon