Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Understanding the Role of Momentum in Stochastic Gradient Methods


Oct 30, 2019
Igor Gitman, Hunter Lang, Pengchuan Zhang, Lin Xiao

Add code

* 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

OpenSeq2Seq: extensible toolkit for distributed and mixed precision training of sequence-to-sequence models


May 25, 2018
Oleksii Kuchaiev, Boris Ginsburg, Igor Gitman, Vitaly Lavrukhin, Carl Case, Paulius Micikevicius

Add code

* to be presented at Workshop for Natural Language Processing Open Source Software (NLP-OSS), co-located with ACL2018 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Novel Prediction Techniques Based on Clusterwise Linear Regression


Apr 28, 2018
Igor Gitman, Jieshi Chen, Eric Lei, Artur Dubrawski

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Convergence Analysis of Gradient Descent Algorithms with Proportional Updates


Jan 09, 2018
Igor Gitman, Deepak Dilipkumar, Ben Parr

Add code

* Source code (uses TensorFlow): https://github.com/bparr/lars 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Comparison of Batch Normalization and Weight Normalization Algorithms for the Large-scale Image Classification


Oct 07, 2017
Igor Gitman, Boris Ginsburg

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Large Batch Training of Convolutional Networks


Sep 13, 2017
Yang You, Igor Gitman, Boris Ginsburg

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email