Picture for Peng Pei

Peng Pei

GradPower: Powering Gradients for Faster Language Model Pre-Training

Add code
May 30, 2025
Viaarxiv icon