Alert button
Picture for Xipeng Zhang

Xipeng Zhang

Alert button

E-Sparse: Boosting the Large Language Model Inference through Entropy-based N:M Sparsity

Add code
Bookmark button
Alert button
Oct 24, 2023
Yun Li, Lin Niu, Xipeng Zhang, Kai Liu, Jianchen Zhu, Zhanhui Kang

Viaarxiv icon

MKQ-BERT: Quantized BERT with 4-bits Weights and Activations

Add code
Bookmark button
Alert button
Mar 25, 2022
Hanlin Tang, Xipeng Zhang, Kai Liu, Jianchen Zhu, Zhanhui Kang

Figure 1 for MKQ-BERT: Quantized BERT with 4-bits Weights and Activations
Figure 2 for MKQ-BERT: Quantized BERT with 4-bits Weights and Activations
Figure 3 for MKQ-BERT: Quantized BERT with 4-bits Weights and Activations
Viaarxiv icon