Picture for Nathan Lambert

Nathan Lambert

RewardBench: Evaluating Reward Models for Language Modeling

Add code
Mar 20, 2024
Viaarxiv icon

A Survey on Data Selection for Language Models

Add code
Mar 08, 2024
Viaarxiv icon

OLMo: Accelerating the Science of Language Models

Add code
Feb 07, 2024
Figure 1 for OLMo: Accelerating the Science of Language Models
Figure 2 for OLMo: Accelerating the Science of Language Models
Figure 3 for OLMo: Accelerating the Science of Language Models
Figure 4 for OLMo: Accelerating the Science of Language Models
Viaarxiv icon

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Add code
Jan 31, 2024
Figure 1 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Figure 2 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Figure 3 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Figure 4 for Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
Viaarxiv icon

Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

Add code
Nov 20, 2023
Viaarxiv icon

The Alignment Ceiling: Objective Mismatch in Reinforcement Learning from Human Feedback

Add code
Oct 31, 2023
Viaarxiv icon

Zephyr: Direct Distillation of LM Alignment

Add code
Oct 25, 2023
Viaarxiv icon

A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning

Add code
Oct 10, 2023
Viaarxiv icon

BLISS: Interplanetary Exploration with Swarms of Low-Cost Spacecraft

Add code
Jul 20, 2023
Figure 1 for BLISS: Interplanetary Exploration with Swarms of Low-Cost Spacecraft
Figure 2 for BLISS: Interplanetary Exploration with Swarms of Low-Cost Spacecraft
Figure 3 for BLISS: Interplanetary Exploration with Swarms of Low-Cost Spacecraft
Figure 4 for BLISS: Interplanetary Exploration with Swarms of Low-Cost Spacecraft
Viaarxiv icon

Measuring Data

Add code
Dec 09, 2022
Figure 1 for Measuring Data
Figure 2 for Measuring Data
Figure 3 for Measuring Data
Viaarxiv icon