Alert button
Picture for Igor Gitman

Igor Gitman

Alert button

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset

Add code
Bookmark button
Alert button
Feb 15, 2024
Shubham Toshniwal, Ivan Moshkov, Sean Narenthiran, Daria Gitman, Fei Jia, Igor Gitman

Viaarxiv icon

Confidence-based Ensembles of End-to-End Speech Recognition Models

Add code
Bookmark button
Alert button
Jun 27, 2023
Igor Gitman, Vitaly Lavrukhin, Aleksandr Laptev, Boris Ginsburg

Figure 1 for Confidence-based Ensembles of End-to-End Speech Recognition Models
Figure 2 for Confidence-based Ensembles of End-to-End Speech Recognition Models
Figure 3 for Confidence-based Ensembles of End-to-End Speech Recognition Models
Figure 4 for Confidence-based Ensembles of End-to-End Speech Recognition Models
Viaarxiv icon

Powerful and Extensible WFST Framework for RNN-Transducer Losses

Add code
Bookmark button
Alert button
Mar 18, 2023
Aleksandr Laptev, Vladimir Bataev, Igor Gitman, Boris Ginsburg

Figure 1 for Powerful and Extensible WFST Framework for RNN-Transducer Losses
Figure 2 for Powerful and Extensible WFST Framework for RNN-Transducer Losses
Figure 3 for Powerful and Extensible WFST Framework for RNN-Transducer Losses
Figure 4 for Powerful and Extensible WFST Framework for RNN-Transducer Losses
Viaarxiv icon

Understanding the Role of Momentum in Stochastic Gradient Methods

Add code
Bookmark button
Alert button
Oct 30, 2019
Igor Gitman, Hunter Lang, Pengchuan Zhang, Lin Xiao

Figure 1 for Understanding the Role of Momentum in Stochastic Gradient Methods
Figure 2 for Understanding the Role of Momentum in Stochastic Gradient Methods
Figure 3 for Understanding the Role of Momentum in Stochastic Gradient Methods
Figure 4 for Understanding the Role of Momentum in Stochastic Gradient Methods
Viaarxiv icon

OpenSeq2Seq: extensible toolkit for distributed and mixed precision training of sequence-to-sequence models

Add code
Bookmark button
Alert button
May 25, 2018
Oleksii Kuchaiev, Boris Ginsburg, Igor Gitman, Vitaly Lavrukhin, Carl Case, Paulius Micikevicius

Figure 1 for OpenSeq2Seq: extensible toolkit for distributed and mixed precision training of sequence-to-sequence models
Figure 2 for OpenSeq2Seq: extensible toolkit for distributed and mixed precision training of sequence-to-sequence models
Figure 3 for OpenSeq2Seq: extensible toolkit for distributed and mixed precision training of sequence-to-sequence models
Figure 4 for OpenSeq2Seq: extensible toolkit for distributed and mixed precision training of sequence-to-sequence models
Viaarxiv icon

Novel Prediction Techniques Based on Clusterwise Linear Regression

Add code
Bookmark button
Alert button
Apr 28, 2018
Igor Gitman, Jieshi Chen, Eric Lei, Artur Dubrawski

Figure 1 for Novel Prediction Techniques Based on Clusterwise Linear Regression
Figure 2 for Novel Prediction Techniques Based on Clusterwise Linear Regression
Figure 3 for Novel Prediction Techniques Based on Clusterwise Linear Regression
Figure 4 for Novel Prediction Techniques Based on Clusterwise Linear Regression
Viaarxiv icon

Convergence Analysis of Gradient Descent Algorithms with Proportional Updates

Add code
Bookmark button
Alert button
Jan 09, 2018
Igor Gitman, Deepak Dilipkumar, Ben Parr

Figure 1 for Convergence Analysis of Gradient Descent Algorithms with Proportional Updates
Figure 2 for Convergence Analysis of Gradient Descent Algorithms with Proportional Updates
Figure 3 for Convergence Analysis of Gradient Descent Algorithms with Proportional Updates
Viaarxiv icon

Comparison of Batch Normalization and Weight Normalization Algorithms for the Large-scale Image Classification

Add code
Bookmark button
Alert button
Oct 07, 2017
Igor Gitman, Boris Ginsburg

Figure 1 for Comparison of Batch Normalization and Weight Normalization Algorithms for the Large-scale Image Classification
Figure 2 for Comparison of Batch Normalization and Weight Normalization Algorithms for the Large-scale Image Classification
Figure 3 for Comparison of Batch Normalization and Weight Normalization Algorithms for the Large-scale Image Classification
Figure 4 for Comparison of Batch Normalization and Weight Normalization Algorithms for the Large-scale Image Classification
Viaarxiv icon

Large Batch Training of Convolutional Networks

Add code
Bookmark button
Alert button
Sep 13, 2017
Yang You, Igor Gitman, Boris Ginsburg

Figure 1 for Large Batch Training of Convolutional Networks
Figure 2 for Large Batch Training of Convolutional Networks
Figure 3 for Large Batch Training of Convolutional Networks
Figure 4 for Large Batch Training of Convolutional Networks
Viaarxiv icon