Alert button
Picture for Chenhao Xue

Chenhao Xue

Alert button

LLM Inference Unveiled: Survey and Roofline Model Insights

Add code
Bookmark button
Alert button
Mar 11, 2024
Zhihang Yuan, Yuzhang Shang, Yang Zhou, Zhen Dong, Zhe Zhou, Chenhao Xue, Bingzhe Wu, Zhikai Li, Qingyi Gu, Yong Jae Lee, Yan Yan, Beidi Chen, Guangyu Sun, Kurt Keutzer

Viaarxiv icon

Latency-aware Spatial-wise Dynamic Networks

Add code
Bookmark button
Alert button
Oct 12, 2022
Yizeng Han, Zhihang Yuan, Yifan Pu, Chenhao Xue, Shiji Song, Guangyu Sun, Gao Huang

Figure 1 for Latency-aware Spatial-wise Dynamic Networks
Figure 2 for Latency-aware Spatial-wise Dynamic Networks
Figure 3 for Latency-aware Spatial-wise Dynamic Networks
Figure 4 for Latency-aware Spatial-wise Dynamic Networks
Viaarxiv icon

PTQ4ViT: Post-Training Quantization Framework for Vision Transformers

Add code
Bookmark button
Alert button
Nov 24, 2021
Zhihang Yuan, Chenhao Xue, Yiqi Chen, Qiang Wu, Guangyu Sun

Figure 1 for PTQ4ViT: Post-Training Quantization Framework for Vision Transformers
Figure 2 for PTQ4ViT: Post-Training Quantization Framework for Vision Transformers
Figure 3 for PTQ4ViT: Post-Training Quantization Framework for Vision Transformers
Figure 4 for PTQ4ViT: Post-Training Quantization Framework for Vision Transformers
Viaarxiv icon

PTQ-SL: Exploring the Sub-layerwise Post-training Quantization

Add code
Bookmark button
Alert button
Oct 18, 2021
Zhihang Yuan, Yiqi Chen, Chenhao Xue, Chenguang Zhang, Qiankun Wang, Guangyu Sun

Figure 1 for PTQ-SL: Exploring the Sub-layerwise Post-training Quantization
Figure 2 for PTQ-SL: Exploring the Sub-layerwise Post-training Quantization
Figure 3 for PTQ-SL: Exploring the Sub-layerwise Post-training Quantization
Figure 4 for PTQ-SL: Exploring the Sub-layerwise Post-training Quantization
Viaarxiv icon