Picture for Alex Kogan

Alex Kogan

Is (Selective) Round-To-Nearest Quantization All You Need?

Add code
May 21, 2025
Viaarxiv icon

Improving Inference Performance of Machine Learning with the Divide-and-Conquer Principle

Add code
Jan 12, 2023
Viaarxiv icon

Optimizing Inference Performance of Transformers on CPUs

Add code
Feb 22, 2021
Figure 1 for Optimizing Inference Performance of Transformers on CPUs
Figure 2 for Optimizing Inference Performance of Transformers on CPUs
Figure 3 for Optimizing Inference Performance of Transformers on CPUs
Figure 4 for Optimizing Inference Performance of Transformers on CPUs
Viaarxiv icon