Picture for Ludwig Schmidt

Ludwig Schmidt

Shammie

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

Add code
Jan 17, 2026
Viaarxiv icon

NitroGen: An Open Foundation Model for Generalist Gaming Agents

Add code
Jan 04, 2026
Viaarxiv icon

Reusing Pre-Training Data at Test Time is a Compute Multiplier

Add code
Nov 06, 2025
Figure 1 for Reusing Pre-Training Data at Test Time is a Compute Multiplier
Figure 2 for Reusing Pre-Training Data at Test Time is a Compute Multiplier
Figure 3 for Reusing Pre-Training Data at Test Time is a Compute Multiplier
Figure 4 for Reusing Pre-Training Data at Test Time is a Compute Multiplier
Viaarxiv icon

OLMoASR: Open Models and Data for Training Robust Speech Recognition Models

Add code
Aug 28, 2025
Viaarxiv icon

Recycling the Web: A Method to Enhance Pre-training Data Quality and Quantity for Language Models

Add code
Jun 05, 2025
Viaarxiv icon

OpenThoughts: Data Recipes for Reasoning Models

Add code
Jun 05, 2025
Viaarxiv icon

SWE-smith: Scaling Data for Software Engineering Agents

Add code
Apr 30, 2025
Viaarxiv icon

Datasets, Documents, and Repetitions: The Practicalities of Unequal Data Quality

Add code
Mar 10, 2025
Viaarxiv icon

Should VLMs be Pre-trained with Image Data?

Add code
Mar 10, 2025
Viaarxiv icon

Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs

Add code
Feb 26, 2025
Figure 1 for Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs
Figure 2 for Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs
Figure 3 for Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs
Figure 4 for Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs
Viaarxiv icon