Alert button
Picture for Yaohui Cai

Yaohui Cai

Alert button

Trainable Fixed-Point Quantization for Deep Learning Acceleration on FPGAs

Add code
Bookmark button
Alert button
Jan 31, 2024
Dingyi Dai, Yichi Zhang, Jiahao Zhang, Zhanqiu Hu, Yaohui Cai, Qi Sun, Zhiru Zhang

Viaarxiv icon

Understanding the Potential of FPGA-Based Spatial Acceleration for Large Language Model Inference

Add code
Bookmark button
Alert button
Dec 23, 2023
Hongzheng Chen, Jiahao Zhang, Yixiao Du, Shaojie Xiang, Zichao Yue, Niansong Zhang, Yaohui Cai, Zhiru Zhang

Viaarxiv icon

QuIP: 2-Bit Quantization of Large Language Models With Guarantees

Add code
Bookmark button
Alert button
Jul 25, 2023
Jerry Chee, Yaohui Cai, Volodymyr Kuleshov, Christopher De Sa

Figure 1 for QuIP: 2-Bit Quantization of Large Language Models With Guarantees
Figure 2 for QuIP: 2-Bit Quantization of Large Language Models With Guarantees
Figure 3 for QuIP: 2-Bit Quantization of Large Language Models With Guarantees
Figure 4 for QuIP: 2-Bit Quantization of Large Language Models With Guarantees
Viaarxiv icon

Structured Pruning is All You Need for Pruning CNNs at Initialization

Add code
Bookmark button
Alert button
Mar 04, 2022
Yaohui Cai, Weizhe Hua, Hongzheng Chen, G. Edward Suh, Christopher De Sa, Zhiru Zhang

Figure 1 for Structured Pruning is All You Need for Pruning CNNs at Initialization
Figure 2 for Structured Pruning is All You Need for Pruning CNNs at Initialization
Figure 3 for Structured Pruning is All You Need for Pruning CNNs at Initialization
Figure 4 for Structured Pruning is All You Need for Pruning CNNs at Initialization
Viaarxiv icon

SPADE: A Spectral Method for Black-Box Adversarial Robustness Evaluation

Add code
Bookmark button
Alert button
Feb 07, 2021
Wuxinlin Cheng, Chenhui Deng, Zhiqiang Zhao, Yaohui Cai, Zhiru Zhang, Zhuo Feng

Figure 1 for SPADE: A Spectral Method for Black-Box Adversarial Robustness Evaluation
Figure 2 for SPADE: A Spectral Method for Black-Box Adversarial Robustness Evaluation
Figure 3 for SPADE: A Spectral Method for Black-Box Adversarial Robustness Evaluation
Figure 4 for SPADE: A Spectral Method for Black-Box Adversarial Robustness Evaluation
Viaarxiv icon

CoDeNet: Algorithm-hardware Co-design for Deformable Convolution

Add code
Bookmark button
Alert button
Jun 12, 2020
Zhen Dong, Dequan Wang, Qijing Huang, Yizhao Gao, Yaohui Cai, Bichen Wu, Kurt Keutzer, John Wawrzynek

Figure 1 for CoDeNet: Algorithm-hardware Co-design for Deformable Convolution
Figure 2 for CoDeNet: Algorithm-hardware Co-design for Deformable Convolution
Figure 3 for CoDeNet: Algorithm-hardware Co-design for Deformable Convolution
Figure 4 for CoDeNet: Algorithm-hardware Co-design for Deformable Convolution
Viaarxiv icon

Algorithm-hardware Co-design for Deformable Convolution

Add code
Bookmark button
Alert button
Feb 19, 2020
Qijing Huang, Dequan Wang, Yizhao Gao, Yaohui Cai, Zhen Dong, Bichen Wu, Kurt Keutzer, John Wawrzynek

Figure 1 for Algorithm-hardware Co-design for Deformable Convolution
Figure 2 for Algorithm-hardware Co-design for Deformable Convolution
Viaarxiv icon

ZeroQ: A Novel Zero Shot Quantization Framework

Add code
Bookmark button
Alert button
Jan 01, 2020
Yaohui Cai, Zhewei Yao, Zhen Dong, Amir Gholami, Michael W. Mahoney, Kurt Keutzer

Figure 1 for ZeroQ: A Novel Zero Shot Quantization Framework
Figure 2 for ZeroQ: A Novel Zero Shot Quantization Framework
Figure 3 for ZeroQ: A Novel Zero Shot Quantization Framework
Figure 4 for ZeroQ: A Novel Zero Shot Quantization Framework
Viaarxiv icon

HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks

Add code
Bookmark button
Alert button
Nov 10, 2019
Zhen Dong, Zhewei Yao, Yaohui Cai, Daiyaan Arfeen, Amir Gholami, Michael W. Mahoney, Kurt Keutzer

Figure 1 for HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks
Figure 2 for HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks
Figure 3 for HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks
Figure 4 for HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks
Viaarxiv icon