Alert button
Picture for Saleh Ashkboos

Saleh Ashkboos

Alert button

QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs

Add code
Bookmark button
Alert button
Mar 30, 2024
Saleh Ashkboos, Amirkeivan Mohtashami, Maximilian L. Croci, Bo Li, Martin Jaggi, Dan Alistarh, Torsten Hoefler, James Hensman

Viaarxiv icon

SliceGPT: Compress Large Language Models by Deleting Rows and Columns

Add code
Bookmark button
Alert button
Jan 26, 2024
Saleh Ashkboos, Maximilian L. Croci, Marcelo Gennari do Nascimento, Torsten Hoefler, James Hensman

Viaarxiv icon

Towards End-to-end 4-Bit Inference on Generative Large Language Models

Add code
Bookmark button
Alert button
Oct 13, 2023
Saleh Ashkboos, Ilia Markov, Elias Frantar, Tingxuan Zhong, Xincheng Wang, Jie Ren, Torsten Hoefler, Dan Alistarh

Viaarxiv icon

SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression

Add code
Bookmark button
Alert button
Jun 05, 2023
Tim Dettmers, Ruslan Svirschevski, Vage Egiazarian, Denis Kuznedelev, Elias Frantar, Saleh Ashkboos, Alexander Borzunov, Torsten Hoefler, Dan Alistarh

Figure 1 for SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression
Figure 2 for SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression
Figure 3 for SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression
Figure 4 for SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression
Viaarxiv icon

STen: Productive and Efficient Sparsity in PyTorch

Add code
Bookmark button
Alert button
Apr 15, 2023
Andrei Ivanov, Nikoli Dryden, Tal Ben-Nun, Saleh Ashkboos, Torsten Hoefler

Figure 1 for STen: Productive and Efficient Sparsity in PyTorch
Figure 2 for STen: Productive and Efficient Sparsity in PyTorch
Figure 3 for STen: Productive and Efficient Sparsity in PyTorch
Figure 4 for STen: Productive and Efficient Sparsity in PyTorch
Viaarxiv icon

GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers

Add code
Bookmark button
Alert button
Oct 31, 2022
Elias Frantar, Saleh Ashkboos, Torsten Hoefler, Dan Alistarh

Figure 1 for GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers
Figure 2 for GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers
Figure 3 for GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers
Figure 4 for GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers
Viaarxiv icon

ENS-10: A Dataset For Post-Processing Ensemble Weather Forecast

Add code
Bookmark button
Alert button
Jun 29, 2022
Saleh Ashkboos, Langwen Huang, Nikoli Dryden, Tal Ben-Nun, Peter Dueben, Lukas Gianinazzi, Luca Kummer, Torsten Hoefler

Figure 1 for ENS-10: A Dataset For Post-Processing Ensemble Weather Forecast
Figure 2 for ENS-10: A Dataset For Post-Processing Ensemble Weather Forecast
Figure 3 for ENS-10: A Dataset For Post-Processing Ensemble Weather Forecast
Figure 4 for ENS-10: A Dataset For Post-Processing Ensemble Weather Forecast
Viaarxiv icon

Motif Prediction with Graph Neural Networks

Add code
Bookmark button
Alert button
Jun 05, 2021
Maciej Besta, Raphael Grob, Cesare Miglioli, Nicola Bernold, Grzegorz Kwasniewski, Gabriel Gjini, Raghavendra Kanakagiri, Saleh Ashkboos, Lukas Gianinazzi, Nikoli Dryden, Torsten Hoefler

Figure 1 for Motif Prediction with Graph Neural Networks
Figure 2 for Motif Prediction with Graph Neural Networks
Figure 3 for Motif Prediction with Graph Neural Networks
Viaarxiv icon

Distributed Mean Estimation with Optimal Error Bounds

Add code
Bookmark button
Alert button
Feb 24, 2020
Dan Alistarh, Saleh Ashkboos, Peter Davies

Figure 1 for Distributed Mean Estimation with Optimal Error Bounds
Figure 2 for Distributed Mean Estimation with Optimal Error Bounds
Figure 3 for Distributed Mean Estimation with Optimal Error Bounds
Figure 4 for Distributed Mean Estimation with Optimal Error Bounds
Viaarxiv icon