Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Igor Gitman

Understanding the Role of Momentum in Stochastic Gradient Methods

Oct 30, 2019
Igor Gitman, Hunter Lang, Pengchuan Zhang, Lin Xiao

* 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada 

  Access Paper or Ask Questions

OpenSeq2Seq: extensible toolkit for distributed and mixed precision training of sequence-to-sequence models

May 25, 2018
Oleksii Kuchaiev, Boris Ginsburg, Igor Gitman, Vitaly Lavrukhin, Carl Case, Paulius Micikevicius

* to be presented at Workshop for Natural Language Processing Open Source Software (NLP-OSS), co-located with ACL2018 

  Access Paper or Ask Questions

Novel Prediction Techniques Based on Clusterwise Linear Regression

Apr 28, 2018
Igor Gitman, Jieshi Chen, Eric Lei, Artur Dubrawski

  Access Paper or Ask Questions

Convergence Analysis of Gradient Descent Algorithms with Proportional Updates

Jan 09, 2018
Igor Gitman, Deepak Dilipkumar, Ben Parr

* Source code (uses TensorFlow): 

  Access Paper or Ask Questions

Comparison of Batch Normalization and Weight Normalization Algorithms for the Large-scale Image Classification

Oct 07, 2017
Igor Gitman, Boris Ginsburg

  Access Paper or Ask Questions

Large Batch Training of Convolutional Networks

Sep 13, 2017
Yang You, Igor Gitman, Boris Ginsburg

  Access Paper or Ask Questions