Alert button

The Fine-Grained Complexity of Gradient Computation for Training Large Language Models

Feb 07, 2024
Josh Alman, Zhao Song

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: