Picture for Suvinay Subramanian

Suvinay Subramanian

Effective Interplay between Sparsity and Quantization: From Theory to Practice

Add code
May 31, 2024
Viaarxiv icon

Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers

Add code
Feb 07, 2024
Figure 1 for Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers
Figure 2 for Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers
Figure 3 for Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers
Figure 4 for Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers
Viaarxiv icon

JaxPruner: A concise library for sparsity research

Add code
May 02, 2023
Figure 1 for JaxPruner: A concise library for sparsity research
Figure 2 for JaxPruner: A concise library for sparsity research
Viaarxiv icon

TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings

Add code
Apr 20, 2023
Figure 1 for TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings
Figure 2 for TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings
Figure 3 for TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings
Figure 4 for TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings
Viaarxiv icon

STEP: Learning N:M Structured Sparsity Masks from Scratch with Precondition

Add code
Feb 02, 2023
Figure 1 for STEP: Learning N:M Structured Sparsity Masks from Scratch with Precondition
Figure 2 for STEP: Learning N:M Structured Sparsity Masks from Scratch with Precondition
Figure 3 for STEP: Learning N:M Structured Sparsity Masks from Scratch with Precondition
Figure 4 for STEP: Learning N:M Structured Sparsity Masks from Scratch with Precondition
Viaarxiv icon

Training Recipe for N:M Structured Sparsity with Decaying Pruning Mask

Add code
Sep 15, 2022
Figure 1 for Training Recipe for N:M Structured Sparsity with Decaying Pruning Mask
Figure 2 for Training Recipe for N:M Structured Sparsity with Decaying Pruning Mask
Figure 3 for Training Recipe for N:M Structured Sparsity with Decaying Pruning Mask
Figure 4 for Training Recipe for N:M Structured Sparsity with Decaying Pruning Mask
Viaarxiv icon

ATTACC the Quadratic Bottleneck of Attention Layers

Add code
Jul 13, 2021
Figure 1 for ATTACC the Quadratic Bottleneck of Attention Layers
Figure 2 for ATTACC the Quadratic Bottleneck of Attention Layers
Figure 3 for ATTACC the Quadratic Bottleneck of Attention Layers
Figure 4 for ATTACC the Quadratic Bottleneck of Attention Layers
Viaarxiv icon