Picture for Jane Xu

Jane Xu

GaLore 2: Large-Scale LLM Pre-Training by Gradient Low-Rank Projection

Add code
Apr 29, 2025
Viaarxiv icon