Alert button
Picture for Suvinay Subramanian

Suvinay Subramanian

Alert button

Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers

Add code
Bookmark button
Alert button
Feb 07, 2024
Abhimanyu Rajeshkumar Bambhaniya, Amir Yazdanbakhsh, Suvinay Subramanian, Sheng-Chun Kao, Shivani Agrawal, Utku Evci, Tushar Krishna

Viaarxiv icon

JaxPruner: A concise library for sparsity research

Add code
Bookmark button
Alert button
May 02, 2023
Joo Hyung Lee, Wonpyo Park, Nicole Mitchell, Jonathan Pilault, Johan Obando-Ceron, Han-Byul Kim, Namhoon Lee, Elias Frantar, Yun Long, Amir Yazdanbakhsh, Shivani Agrawal, Suvinay Subramanian, Xin Wang, Sheng-Chun Kao, Xingyao Zhang, Trevor Gale, Aart Bik, Woohyun Han, Milen Ferev, Zhonglin Han, Hong-Seok Kim, Yann Dauphin, Gintare Karolina Dziugaite, Pablo Samuel Castro, Utku Evci

Figure 1 for JaxPruner: A concise library for sparsity research
Figure 2 for JaxPruner: A concise library for sparsity research
Viaarxiv icon

TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings

Add code
Bookmark button
Alert button
Apr 20, 2023
Norman P. Jouppi, George Kurian, Sheng Li, Peter Ma, Rahul Nagarajan, Lifeng Nai, Nishant Patil, Suvinay Subramanian, Andy Swing, Brian Towles, Cliff Young, Xiang Zhou, Zongwei Zhou, David Patterson

Figure 1 for TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings
Figure 2 for TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings
Figure 3 for TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings
Figure 4 for TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings
Viaarxiv icon

STEP: Learning N:M Structured Sparsity Masks from Scratch with Precondition

Add code
Bookmark button
Alert button
Feb 02, 2023
Yucheng Lu, Shivani Agrawal, Suvinay Subramanian, Oleg Rybakov, Christopher De Sa, Amir Yazdanbakhsh

Figure 1 for STEP: Learning N:M Structured Sparsity Masks from Scratch with Precondition
Figure 2 for STEP: Learning N:M Structured Sparsity Masks from Scratch with Precondition
Figure 3 for STEP: Learning N:M Structured Sparsity Masks from Scratch with Precondition
Figure 4 for STEP: Learning N:M Structured Sparsity Masks from Scratch with Precondition
Viaarxiv icon

Training Recipe for N:M Structured Sparsity with Decaying Pruning Mask

Add code
Bookmark button
Alert button
Sep 15, 2022
Sheng-Chun Kao, Amir Yazdanbakhsh, Suvinay Subramanian, Shivani Agrawal, Utku Evci, Tushar Krishna

Figure 1 for Training Recipe for N:M Structured Sparsity with Decaying Pruning Mask
Figure 2 for Training Recipe for N:M Structured Sparsity with Decaying Pruning Mask
Figure 3 for Training Recipe for N:M Structured Sparsity with Decaying Pruning Mask
Figure 4 for Training Recipe for N:M Structured Sparsity with Decaying Pruning Mask
Viaarxiv icon

ATTACC the Quadratic Bottleneck of Attention Layers

Add code
Bookmark button
Alert button
Jul 13, 2021
Sheng-Chun Kao, Suvinay Subramanian, Gaurav Agrawal, Tushar Krishna

Figure 1 for ATTACC the Quadratic Bottleneck of Attention Layers
Figure 2 for ATTACC the Quadratic Bottleneck of Attention Layers
Figure 3 for ATTACC the Quadratic Bottleneck of Attention Layers
Figure 4 for ATTACC the Quadratic Bottleneck of Attention Layers
Viaarxiv icon