Alert button

DiLoCo: Distributed Low-Communication Training of Language Models

Nov 14, 2023
Arthur Douillard, Qixuan Feng, Andrei A. Rusu, Rachita Chhaparia, Yani Donchev, Adhiguna Kuncoro, Marc'Aurelio Ranzato, Arthur Szlam, Jiajun Shen

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: