Picture for Aaron Gokaslan

Aaron Gokaslan

The Diffusion Duality

Add code
Jun 12, 2025
Viaarxiv icon

OpenThoughts: Data Recipes for Reasoning Models

Add code
Jun 05, 2025
Viaarxiv icon

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Add code
Jun 05, 2025
Viaarxiv icon

Extracting memorized pieces of (copyrighted) books from open-weight language models

Add code
May 18, 2025
Viaarxiv icon

RanDeS: Randomized Delta Superposition for Multi-Model Compression

Add code
May 16, 2025
Viaarxiv icon

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Add code
Mar 12, 2025
Viaarxiv icon

The GAN is dead; long live the GAN! A Modern GAN Baseline

Add code
Jan 09, 2025
Figure 1 for The GAN is dead; long live the GAN! A Modern GAN Baseline
Figure 2 for The GAN is dead; long live the GAN! A Modern GAN Baseline
Figure 3 for The GAN is dead; long live the GAN! A Modern GAN Baseline
Figure 4 for The GAN is dead; long live the GAN! A Modern GAN Baseline
Viaarxiv icon

Self-Directed Synthetic Dialogues and Revisions Technical Report

Add code
Jul 25, 2024
Viaarxiv icon

DataComp-LM: In search of the next generation of training sets for language models

Add code
Jun 18, 2024
Figure 1 for DataComp-LM: In search of the next generation of training sets for language models
Figure 2 for DataComp-LM: In search of the next generation of training sets for language models
Figure 3 for DataComp-LM: In search of the next generation of training sets for language models
Figure 4 for DataComp-LM: In search of the next generation of training sets for language models
Viaarxiv icon

Vid3D: Synthesis of Dynamic 3D Scenes using 2D Video Diffusion

Add code
Jun 17, 2024
Viaarxiv icon