Power Lines: Scaling Laws for Weight Decay and Batch Size in LLM Pre-training

Add code
May 19, 2025
Figure 1 for Power Lines: Scaling Laws for Weight Decay and Batch Size in LLM Pre-training
Figure 2 for Power Lines: Scaling Laws for Weight Decay and Batch Size in LLM Pre-training
Figure 3 for Power Lines: Scaling Laws for Weight Decay and Batch Size in LLM Pre-training
Figure 4 for Power Lines: Scaling Laws for Weight Decay and Batch Size in LLM Pre-training

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: