Alert button
Picture for Zhewei Yao

Zhewei Yao

Alert button

What's Hidden in a One-layer Randomly Weighted Transformer?

Add code
Bookmark button
Alert button
Sep 08, 2021
Sheng Shen, Zhewei Yao, Douwe Kiela, Kurt Keutzer, Michael W. Mahoney

Figure 1 for What's Hidden in a One-layer Randomly Weighted Transformer?
Figure 2 for What's Hidden in a One-layer Randomly Weighted Transformer?
Figure 3 for What's Hidden in a One-layer Randomly Weighted Transformer?
Figure 4 for What's Hidden in a One-layer Randomly Weighted Transformer?
Viaarxiv icon

How Much Can CLIP Benefit Vision-and-Language Tasks?

Add code
Bookmark button
Alert button
Jul 13, 2021
Sheng Shen, Liunian Harold Li, Hao Tan, Mohit Bansal, Anna Rohrbach, Kai-Wei Chang, Zhewei Yao, Kurt Keutzer

Figure 1 for How Much Can CLIP Benefit Vision-and-Language Tasks?
Figure 2 for How Much Can CLIP Benefit Vision-and-Language Tasks?
Figure 3 for How Much Can CLIP Benefit Vision-and-Language Tasks?
Figure 4 for How Much Can CLIP Benefit Vision-and-Language Tasks?
Viaarxiv icon

MLPruning: A Multilevel Structured Pruning Framework for Transformer-based Models

Add code
Bookmark button
Alert button
May 30, 2021
Zhewei Yao, Linjian Ma, Sheng Shen, Kurt Keutzer, Michael W. Mahoney

Figure 1 for MLPruning: A Multilevel Structured Pruning Framework for Transformer-based Models
Figure 2 for MLPruning: A Multilevel Structured Pruning Framework for Transformer-based Models
Figure 3 for MLPruning: A Multilevel Structured Pruning Framework for Transformer-based Models
Figure 4 for MLPruning: A Multilevel Structured Pruning Framework for Transformer-based Models
Viaarxiv icon

ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training

Add code
Bookmark button
Alert button
Apr 29, 2021
Jianfei Chen, Lianmin Zheng, Zhewei Yao, Dequan Wang, Ion Stoica, Michael W. Mahoney, Joseph E. Gonzalez

Figure 1 for ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training
Figure 2 for ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training
Figure 3 for ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training
Figure 4 for ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training
Viaarxiv icon

Q-ASR: Integer-only Zero-shot Quantization for Efficient Speech Recognition

Add code
Bookmark button
Alert button
Mar 31, 2021
Sehoon Kim, Amir Gholami, Zhewei Yao, Anirudda Nrusimha, Bohan Zhai, Tianren Gao, Michael W. Mahoney, Kurt Keutzer

Figure 1 for Q-ASR: Integer-only Zero-shot Quantization for Efficient Speech Recognition
Figure 2 for Q-ASR: Integer-only Zero-shot Quantization for Efficient Speech Recognition
Figure 3 for Q-ASR: Integer-only Zero-shot Quantization for Efficient Speech Recognition
Figure 4 for Q-ASR: Integer-only Zero-shot Quantization for Efficient Speech Recognition
Viaarxiv icon

A Survey of Quantization Methods for Efficient Neural Network Inference

Add code
Bookmark button
Alert button
Mar 25, 2021
Amir Gholami, Sehoon Kim, Zhen Dong, Zhewei Yao, Michael W. Mahoney, Kurt Keutzer

Figure 1 for A Survey of Quantization Methods for Efficient Neural Network Inference
Figure 2 for A Survey of Quantization Methods for Efficient Neural Network Inference
Figure 3 for A Survey of Quantization Methods for Efficient Neural Network Inference
Figure 4 for A Survey of Quantization Methods for Efficient Neural Network Inference
Viaarxiv icon

I-BERT: Integer-only BERT Quantization

Add code
Bookmark button
Alert button
Feb 11, 2021
Sehoon Kim, Amir Gholami, Zhewei Yao, Michael W. Mahoney, Kurt Keutzer

Figure 1 for I-BERT: Integer-only BERT Quantization
Figure 2 for I-BERT: Integer-only BERT Quantization
Figure 3 for I-BERT: Integer-only BERT Quantization
Figure 4 for I-BERT: Integer-only BERT Quantization
Viaarxiv icon

Hessian-Aware Pruning and Optimal Neural Implant

Add code
Bookmark button
Alert button
Feb 06, 2021
Shixing Yu, Zhewei Yao, Amir Gholami, Zhen Dong, Michael W Mahoney, Kurt Keutzer

Figure 1 for Hessian-Aware Pruning and Optimal Neural Implant
Figure 2 for Hessian-Aware Pruning and Optimal Neural Implant
Figure 3 for Hessian-Aware Pruning and Optimal Neural Implant
Figure 4 for Hessian-Aware Pruning and Optimal Neural Implant
Viaarxiv icon