Picture for Hanwen Chang

Hanwen Chang

Efficient LLM Inference on CPUs

Add code
Nov 01, 2023
Figure 1 for Efficient LLM Inference on CPUs
Figure 2 for Efficient LLM Inference on CPUs
Figure 3 for Efficient LLM Inference on CPUs
Figure 4 for Efficient LLM Inference on CPUs
Viaarxiv icon

An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs

Add code
Jun 28, 2023
Figure 1 for An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs
Figure 2 for An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs
Figure 3 for An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs
Figure 4 for An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs
Viaarxiv icon

Fast DistilBERT on CPUs

Add code
Oct 27, 2022
Figure 1 for Fast DistilBERT on CPUs
Figure 2 for Fast DistilBERT on CPUs
Figure 3 for Fast DistilBERT on CPUs
Figure 4 for Fast DistilBERT on CPUs
Viaarxiv icon