Alert button
Picture for Abhimanyu Rajeshkumar Bambhaniya

Abhimanyu Rajeshkumar Bambhaniya

Alert button

Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers

Add code
Bookmark button
Alert button
Feb 07, 2024
Abhimanyu Rajeshkumar Bambhaniya, Amir Yazdanbakhsh, Suvinay Subramanian, Sheng-Chun Kao, Shivani Agrawal, Utku Evci, Tushar Krishna

Viaarxiv icon

Subgraph Stationary Hardware-Software Inference Co-Design

Add code
Bookmark button
Alert button
Jun 21, 2023
Payman Behnam, Jianming Tong, Alind Khare, Yangyu Chen, Yue Pan, Pranav Gadikar, Abhimanyu Rajeshkumar Bambhaniya, Tushar Krishna, Alexey Tumanov

Figure 1 for Subgraph Stationary Hardware-Software Inference Co-Design
Figure 2 for Subgraph Stationary Hardware-Software Inference Co-Design
Figure 3 for Subgraph Stationary Hardware-Software Inference Co-Design
Figure 4 for Subgraph Stationary Hardware-Software Inference Co-Design
Viaarxiv icon

VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs

Add code
Bookmark button
Alert button
Feb 23, 2023
Geonhwa Jeong, Sana Damani, Abhimanyu Rajeshkumar Bambhaniya, Eric Qin, Christopher J. Hughes, Sreenivas Subramoney, Hyesoon Kim, Tushar Krishna

Figure 1 for VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs
Figure 2 for VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs
Figure 3 for VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs
Figure 4 for VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs
Viaarxiv icon

COMET: A Comprehensive Cluster Design Methodology for Distributed Deep Learning Training

Add code
Bookmark button
Alert button
Nov 30, 2022
Divya Kiran Kadiyala, Saeed Rashidi, Taekyung Heo, Abhimanyu Rajeshkumar Bambhaniya, Tushar Krishna, Alexandros Daglis

Figure 1 for COMET: A Comprehensive Cluster Design Methodology for Distributed Deep Learning Training
Figure 2 for COMET: A Comprehensive Cluster Design Methodology for Distributed Deep Learning Training
Figure 3 for COMET: A Comprehensive Cluster Design Methodology for Distributed Deep Learning Training
Figure 4 for COMET: A Comprehensive Cluster Design Methodology for Distributed Deep Learning Training
Viaarxiv icon