Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Jonathan Hseu

Reducing BERT Pre-Training Time from 3 Days to 76 Minutes

Apr 01, 2019
Yang You, Jing Li, Jonathan Hseu, Xiaodan Song, James Demmel, Cho-Jui Hsieh

  Access Paper or Ask Questions

Large-Batch Training for LSTM and Beyond

Jan 24, 2019
Yang You, Jonathan Hseu, Chris Ying, James Demmel, Kurt Keutzer, Cho-Jui Hsieh

* Preprint. Work in progress. We may update this draft recently 

  Access Paper or Ask Questions