Alert button
Picture for Atli Kosson

Atli Kosson

Alert button

Memory Efficient Mixed-Precision Optimizers

Add code
Bookmark button
Alert button
Sep 21, 2023
Basile Lewandowski, Atli Kosson

Viaarxiv icon

Rotational Optimizers: Simple & Robust DNN Training

Add code
Bookmark button
Alert button
May 26, 2023
Atli Kosson, Bettina Messmer, Martin Jaggi

Figure 1 for Rotational Optimizers: Simple & Robust DNN Training
Figure 2 for Rotational Optimizers: Simple & Robust DNN Training
Figure 3 for Rotational Optimizers: Simple & Robust DNN Training
Figure 4 for Rotational Optimizers: Simple & Robust DNN Training
Viaarxiv icon

Ghost Noise for Regularizing Deep Neural Networks

Add code
Bookmark button
Alert button
May 26, 2023
Atli Kosson, Dongyang Fan, Martin Jaggi

Figure 1 for Ghost Noise for Regularizing Deep Neural Networks
Figure 2 for Ghost Noise for Regularizing Deep Neural Networks
Figure 3 for Ghost Noise for Regularizing Deep Neural Networks
Figure 4 for Ghost Noise for Regularizing Deep Neural Networks
Viaarxiv icon

Hardware-Efficient Transformer Training via Piecewise Affine Operations

Add code
Bookmark button
Alert button
May 26, 2023
Atli Kosson, Martin Jaggi

Figure 1 for Hardware-Efficient Transformer Training via Piecewise Affine Operations
Figure 2 for Hardware-Efficient Transformer Training via Piecewise Affine Operations
Figure 3 for Hardware-Efficient Transformer Training via Piecewise Affine Operations
Figure 4 for Hardware-Efficient Transformer Training via Piecewise Affine Operations
Viaarxiv icon

Adaptive Braking for Mitigating Gradient Delay

Add code
Bookmark button
Alert button
Jul 10, 2020
Abhinav Venigalla, Atli Kosson, Vitaliy Chiley, Urs Köster

Figure 1 for Adaptive Braking for Mitigating Gradient Delay
Figure 2 for Adaptive Braking for Mitigating Gradient Delay
Figure 3 for Adaptive Braking for Mitigating Gradient Delay
Figure 4 for Adaptive Braking for Mitigating Gradient Delay
Viaarxiv icon

Pipelined Backpropagation at Scale: Training Large Models without Batches

Add code
Bookmark button
Alert button
Mar 25, 2020
Atli Kosson, Vitaliy Chiley, Abhinav Venigalla, Joel Hestness, Urs Köster

Figure 1 for Pipelined Backpropagation at Scale: Training Large Models without Batches
Figure 2 for Pipelined Backpropagation at Scale: Training Large Models without Batches
Figure 3 for Pipelined Backpropagation at Scale: Training Large Models without Batches
Figure 4 for Pipelined Backpropagation at Scale: Training Large Models without Batches
Viaarxiv icon

Online Normalization for Training Neural Networks

Add code
Bookmark button
Alert button
May 28, 2019
Vitaliy Chiley, Ilya Sharapov, Atli Kosson, Urs Koster, Ryan Reece, Sofia Samaniego de la Fuente, Vishal Subbiah, Michael James

Figure 1 for Online Normalization for Training Neural Networks
Figure 2 for Online Normalization for Training Neural Networks
Figure 3 for Online Normalization for Training Neural Networks
Figure 4 for Online Normalization for Training Neural Networks
Viaarxiv icon