Picture for Carla Varela-Rosa

Carla Varela-Rosa

Scaling Performance of Large Language Model Pretraining

Add code
Sep 05, 2025
Viaarxiv icon