Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:PipeMare: Asynchronous Pipeline Parallel DNN Training

Oct 09, 2019

Bowen Yang, Jian Zhang, Jonathan Li, Christopher Ré, Christopher R. Aberger, Christopher De Sa

Figure 1 for PipeMare: Asynchronous Pipeline Parallel DNN Training

Figure 2 for PipeMare: Asynchronous Pipeline Parallel DNN Training

Figure 3 for PipeMare: Asynchronous Pipeline Parallel DNN Training

Figure 4 for PipeMare: Asynchronous Pipeline Parallel DNN Training

Share this with someone who'll enjoy it:

Abstract:Recently there has been a flurry of interest around using pipeline parallelism while training neural networks. Pipeline parallelism enables larger models to be partitioned spatially across chips and within a chip, leading to both lower network communication and overall higher hardware utilization. Unfortunately, to preserve statistical efficiency, existing pipeline-parallelism techniques sacrifice hardware efficiency by introducing bubbles into the pipeline and/or incurring extra memory costs. In this paper, we investigate to what extent these sacrifices are necessary. Theoretically, we derive a simple but robust training method, called PipeMare, that tolerates asynchronous updates during pipeline-parallel execution. Using this, we show empirically, on a ResNet network and a Transformer network, that PipeMare can achieve final model qualities that match those of synchronous training techniques (at most 0.9% worse test accuracy and 0.3 better test BLEU score) while either using up to 2.0X less weight and optimizer memory or being up to 3.3X faster than other pipeline parallel training techniques. To the best of our knowledge we are the first to explore these techniques and fine-grained pipeline parallelism (e.g. the number of pipeline stages equals to the number of layers) during neural network training.

View paper on

Share this with someone who'll enjoy it:

Title:PipeMare: Asynchronous Pipeline Parallel DNN Training

Paper and Code