Alert button
Picture for Animesh Jain

Animesh Jain

Alert button

Automatic Attention Pruning: Improving and Automating Model Pruning using Attentions

Add code
Bookmark button
Alert button
Mar 14, 2023
Kaiqi Zhao, Animesh Jain, Ming Zhao

Figure 1 for Automatic Attention Pruning: Improving and Automating Model Pruning using Attentions
Figure 2 for Automatic Attention Pruning: Improving and Automating Model Pruning using Attentions
Figure 3 for Automatic Attention Pruning: Improving and Automating Model Pruning using Attentions
Figure 4 for Automatic Attention Pruning: Improving and Automating Model Pruning using Attentions
Viaarxiv icon

Iterative Activation-based Structured Pruning

Add code
Bookmark button
Alert button
Jan 22, 2022
Kaiqi Zhao, Animesh Jain, Ming Zhao

Figure 1 for Iterative Activation-based Structured Pruning
Figure 2 for Iterative Activation-based Structured Pruning
Figure 3 for Iterative Activation-based Structured Pruning
Figure 4 for Iterative Activation-based Structured Pruning
Viaarxiv icon

Adaptive Activation-based Structured Pruning

Add code
Bookmark button
Alert button
Jan 21, 2022
Kaiqi Zhao, Animesh Jain, Ming Zhao

Figure 1 for Adaptive Activation-based Structured Pruning
Figure 2 for Adaptive Activation-based Structured Pruning
Figure 3 for Adaptive Activation-based Structured Pruning
Figure 4 for Adaptive Activation-based Structured Pruning
Viaarxiv icon

Automated Backend-Aware Post-Training Quantization

Add code
Bookmark button
Alert button
Mar 27, 2021
Ziheng Jiang, Animesh Jain, Andrew Liu, Josh Fromm, Chengqian Ma, Tianqi Chen, Luis Ceze

Figure 1 for Automated Backend-Aware Post-Training Quantization
Figure 2 for Automated Backend-Aware Post-Training Quantization
Figure 3 for Automated Backend-Aware Post-Training Quantization
Figure 4 for Automated Backend-Aware Post-Training Quantization
Viaarxiv icon

UNIT: Unifying Tensorized Instruction Compilation

Add code
Bookmark button
Alert button
Jan 21, 2021
Jian Weng, Animesh Jain, Jie Wang, Leyuan Wang, Yida Wang, Tony Nowatzki

Figure 1 for UNIT: Unifying Tensorized Instruction Compilation
Figure 2 for UNIT: Unifying Tensorized Instruction Compilation
Figure 3 for UNIT: Unifying Tensorized Instruction Compilation
Figure 4 for UNIT: Unifying Tensorized Instruction Compilation
Viaarxiv icon

Efficient Execution of Quantized Deep Learning Models: A Compiler Approach

Add code
Bookmark button
Alert button
Jun 18, 2020
Animesh Jain, Shoubhik Bhattacharya, Masahiro Masuda, Vin Sharma, Yida Wang

Figure 1 for Efficient Execution of Quantized Deep Learning Models: A Compiler Approach
Figure 2 for Efficient Execution of Quantized Deep Learning Models: A Compiler Approach
Figure 3 for Efficient Execution of Quantized Deep Learning Models: A Compiler Approach
Figure 4 for Efficient Execution of Quantized Deep Learning Models: A Compiler Approach
Viaarxiv icon

Optimizing Memory-Access Patterns for Deep Learning Accelerators

Add code
Bookmark button
Alert button
Feb 27, 2020
Hongbin Zheng, Sejong Oh, Huiqing Wang, Preston Briggs, Jiading Gai, Animesh Jain, Yizhi Liu, Rich Heaton, Randy Huang, Yida Wang

Viaarxiv icon