Picture for Louis Castricato

Louis Castricato

Results of the NeurIPS 2023 Neural MMO Competition on Multi-task Reinforcement Learning

Add code
Aug 17, 2025
Viaarxiv icon

Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models

Add code
Feb 24, 2025
Viaarxiv icon

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Thought

Add code
Jan 08, 2025
Viaarxiv icon

Self-Directed Synthetic Dialogues and Revisions Technical Report

Add code
Jul 25, 2024
Viaarxiv icon

PERSONA: A Reproducible Testbed for Pluralistic Alignment

Add code
Jul 24, 2024
Viaarxiv icon

Suppressing Pink Elephants with Direct Principle Feedback

Add code
Feb 13, 2024
Viaarxiv icon

Neural MMO 2.0: A Massively Multi-task Addition to Massively Multi-agent Learning

Add code
Nov 07, 2023
Figure 1 for Neural MMO 2.0: A Massively Multi-task Addition to Massively Multi-agent Learning
Figure 2 for Neural MMO 2.0: A Massively Multi-task Addition to Massively Multi-agent Learning
Viaarxiv icon

Robust Preference Learning for Storytelling via Contrastive Reinforcement Learning

Add code
Oct 14, 2022
Figure 1 for Robust Preference Learning for Storytelling via Contrastive Reinforcement Learning
Figure 2 for Robust Preference Learning for Storytelling via Contrastive Reinforcement Learning
Figure 3 for Robust Preference Learning for Storytelling via Contrastive Reinforcement Learning
Figure 4 for Robust Preference Learning for Storytelling via Contrastive Reinforcement Learning
Viaarxiv icon

EleutherAI: Going Beyond "Open Science" to "Science in the Open"

Add code
Oct 12, 2022
Viaarxiv icon

Linearly Mapping from Image to Text Space

Add code
Sep 30, 2022
Figure 1 for Linearly Mapping from Image to Text Space
Figure 2 for Linearly Mapping from Image to Text Space
Figure 3 for Linearly Mapping from Image to Text Space
Figure 4 for Linearly Mapping from Image to Text Space
Viaarxiv icon