Picture for Jerome Wang

Jerome Wang

1.5-Pints Technical Report: Pretraining in Days, Not Months -- Your Language Model Thrives on Quality Data

Add code
Aug 07, 2024
Viaarxiv icon