Alert button

Distributed SLIDE: Enabling Training Large Neural Networks on Low Bandwidth and Simple CPU-Clusters via Model Parallelism and Sparsity

Jan 29, 2022
Minghao Yan, Nicholas Meisburger, Tharun Medini, Anshumali Shrivastava

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: