Alert button
Picture for Martin Jaggi

Martin Jaggi

Alert button

Ghost Noise for Regularizing Deep Neural Networks

Add code
Bookmark button
Alert button
May 26, 2023
Atli Kosson, Dongyang Fan, Martin Jaggi

Figure 1 for Ghost Noise for Regularizing Deep Neural Networks
Figure 2 for Ghost Noise for Regularizing Deep Neural Networks
Figure 3 for Ghost Noise for Regularizing Deep Neural Networks
Figure 4 for Ghost Noise for Regularizing Deep Neural Networks
Viaarxiv icon

Hardware-Efficient Transformer Training via Piecewise Affine Operations

Add code
Bookmark button
Alert button
May 26, 2023
Atli Kosson, Martin Jaggi

Figure 1 for Hardware-Efficient Transformer Training via Piecewise Affine Operations
Figure 2 for Hardware-Efficient Transformer Training via Piecewise Affine Operations
Figure 3 for Hardware-Efficient Transformer Training via Piecewise Affine Operations
Figure 4 for Hardware-Efficient Transformer Training via Piecewise Affine Operations
Viaarxiv icon

Landmark Attention: Random-Access Infinite Context Length for Transformers

Add code
Bookmark button
Alert button
May 25, 2023
Amirkeivan Mohtashami, Martin Jaggi

Figure 1 for Landmark Attention: Random-Access Infinite Context Length for Transformers
Figure 2 for Landmark Attention: Random-Access Infinite Context Length for Transformers
Figure 3 for Landmark Attention: Random-Access Infinite Context Length for Transformers
Figure 4 for Landmark Attention: Random-Access Infinite Context Length for Transformers
Viaarxiv icon

Linearization Algorithms for Fully Composite Optimization

Add code
Bookmark button
Alert button
Feb 24, 2023
Maria-Luiza Vladarean, Nikita Doikov, Martin Jaggi, Nicolas Flammarion

Figure 1 for Linearization Algorithms for Fully Composite Optimization
Figure 2 for Linearization Algorithms for Fully Composite Optimization
Viaarxiv icon

Unified Convergence Theory of Stochastic and Variance-Reduced Cubic Newton Methods

Add code
Bookmark button
Alert button
Feb 23, 2023
El Mahdi Chayti, Nikita Doikov, Martin Jaggi

Viaarxiv icon

Beyond spectral gap (extended): The role of the topology in decentralized learning

Add code
Bookmark button
Alert button
Jan 05, 2023
Thijs Vogels, Hadrien Hendrikx, Martin Jaggi

Figure 1 for Beyond spectral gap (extended): The role of the topology in decentralized learning
Figure 2 for Beyond spectral gap (extended): The role of the topology in decentralized learning
Figure 3 for Beyond spectral gap (extended): The role of the topology in decentralized learning
Figure 4 for Beyond spectral gap (extended): The role of the topology in decentralized learning
Viaarxiv icon

Scalable Collaborative Learning via Representation Sharing

Add code
Bookmark button
Alert button
Dec 13, 2022
Frédéric Berdoz, Abhishek Singh, Martin Jaggi, Ramesh Raskar

Figure 1 for Scalable Collaborative Learning via Representation Sharing
Figure 2 for Scalable Collaborative Learning via Representation Sharing
Figure 3 for Scalable Collaborative Learning via Representation Sharing
Figure 4 for Scalable Collaborative Learning via Representation Sharing
Viaarxiv icon

Second-order optimization with lazy Hessians

Add code
Bookmark button
Alert button
Dec 13, 2022
Nikita Doikov, El Mahdi Chayti, Martin Jaggi

Figure 1 for Second-order optimization with lazy Hessians
Viaarxiv icon

Accuracy Boosters: Epoch-Driven Mixed-Mantissa Block Floating-Point for DNN Training

Add code
Bookmark button
Alert button
Nov 22, 2022
Simla Burcu Harma, Canberk Sönmez, Babak Falsafi, Martin Jaggi, Yunho Oh

Figure 1 for Accuracy Boosters: Epoch-Driven Mixed-Mantissa Block Floating-Point for DNN Training
Figure 2 for Accuracy Boosters: Epoch-Driven Mixed-Mantissa Block Floating-Point for DNN Training
Figure 3 for Accuracy Boosters: Epoch-Driven Mixed-Mantissa Block Floating-Point for DNN Training
Figure 4 for Accuracy Boosters: Epoch-Driven Mixed-Mantissa Block Floating-Point for DNN Training
Viaarxiv icon