Picture for Ludwig Schmidt

Ludwig Schmidt

Shammie

ZEBRAARENA: A Diagnostic Simulation Environment for Studying Reasoning-Action Coupling in Tool-Augmented LLMs

Add code
Mar 19, 2026
Viaarxiv icon

Beyond a Single Extractor: Re-thinking HTML-to-Text Extraction for LLM Pretraining

Add code
Feb 23, 2026
Viaarxiv icon

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

Add code
Jan 17, 2026
Viaarxiv icon

NitroGen: An Open Foundation Model for Generalist Gaming Agents

Add code
Jan 04, 2026
Viaarxiv icon

Reusing Pre-Training Data at Test Time is a Compute Multiplier

Add code
Nov 06, 2025
Figure 1 for Reusing Pre-Training Data at Test Time is a Compute Multiplier
Figure 2 for Reusing Pre-Training Data at Test Time is a Compute Multiplier
Figure 3 for Reusing Pre-Training Data at Test Time is a Compute Multiplier
Figure 4 for Reusing Pre-Training Data at Test Time is a Compute Multiplier
Viaarxiv icon

OLMoASR: Open Models and Data for Training Robust Speech Recognition Models

Add code
Aug 28, 2025
Viaarxiv icon

OpenThoughts: Data Recipes for Reasoning Models

Add code
Jun 05, 2025
Viaarxiv icon

Recycling the Web: A Method to Enhance Pre-training Data Quality and Quantity for Language Models

Add code
Jun 05, 2025
Viaarxiv icon

SWE-smith: Scaling Data for Software Engineering Agents

Add code
Apr 30, 2025
Viaarxiv icon

Datasets, Documents, and Repetitions: The Practicalities of Unequal Data Quality

Add code
Mar 10, 2025
Viaarxiv icon