Alert button
Picture for Jean-Baptiste Gaya

Jean-Baptiste Gaya

Alert button

WorldSense: A Synthetic Benchmark for Grounded Reasoning in Large Language Models

Add code
Bookmark button
Alert button
Nov 27, 2023
Youssef Benchekroun, Megi Dervishi, Mark Ibrahim, Jean-Baptiste Gaya, Xavier Martinet, Grégoire Mialon, Thomas Scialom, Emmanuel Dupoux, Dieuwke Hupkes, Pascal Vincent

Viaarxiv icon

Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards

Add code
Bookmark button
Alert button
Jun 07, 2023
Alexandre Rame, Guillaume Couairon, Mustafa Shukor, Corentin Dancette, Jean-Baptiste Gaya, Laure Soulier, Matthieu Cord

Figure 1 for Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards
Figure 2 for Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards
Figure 3 for Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards
Figure 4 for Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards
Viaarxiv icon

Building a Subspace of Policies for Scalable Continual Learning

Add code
Bookmark button
Alert button
Nov 18, 2022
Jean-Baptiste Gaya, Thang Doan, Lucas Caccia, Laure Soulier, Ludovic Denoyer, Roberta Raileanu

Figure 1 for Building a Subspace of Policies for Scalable Continual Learning
Figure 2 for Building a Subspace of Policies for Scalable Continual Learning
Figure 3 for Building a Subspace of Policies for Scalable Continual Learning
Figure 4 for Building a Subspace of Policies for Scalable Continual Learning
Viaarxiv icon

SaLinA: Sequential Learning of Agents

Add code
Bookmark button
Alert button
Oct 15, 2021
Ludovic Denoyer, Alfredo de la Fuente, Song Duong, Jean-Baptiste Gaya, Pierre-Alexandre Kamienny, Daniel H. Thompson

Figure 1 for SaLinA: Sequential Learning of Agents
Figure 2 for SaLinA: Sequential Learning of Agents
Figure 3 for SaLinA: Sequential Learning of Agents
Figure 4 for SaLinA: Sequential Learning of Agents
Viaarxiv icon

Learning a subspace of policies for online adaptation in Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 11, 2021
Jean-Baptiste Gaya, Laure Soulier, Ludovic Denoyer

Figure 1 for Learning a subspace of policies for online adaptation in Reinforcement Learning
Figure 2 for Learning a subspace of policies for online adaptation in Reinforcement Learning
Figure 3 for Learning a subspace of policies for online adaptation in Reinforcement Learning
Figure 4 for Learning a subspace of policies for online adaptation in Reinforcement Learning
Viaarxiv icon