Alert button
Picture for Dan Alistarh

Dan Alistarh

Alert button

RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust Adaptation

Jan 12, 2024
Mahdi Nikdan, Soroush Tabesh, Dan Alistarh

Viaarxiv icon

Extreme Compression of Large Language Models via Additive Quantization

Jan 11, 2024
Vage Egiazarian, Andrei Panferov, Denis Kuznedelev, Elias Frantar, Artem Babenko, Dan Alistarh

Viaarxiv icon

How to Prune Your Language Model: Recovering Accuracy on the "Sparsity May Cry'' Benchmark

Dec 21, 2023
Eldar Kurtic, Torsten Hoefler, Dan Alistarh

Viaarxiv icon

ELSA: Partial Weight Freezing for Overhead-Free Sparse Network Deployment

Dec 17, 2023
Paniz Halvachi, Alexandra Peste, Dan Alistarh, Christoph H. Lampert

Viaarxiv icon

AsGrad: A Sharp Unified Analysis of Asynchronous-SGD Algorithms

Oct 31, 2023
Rustem Islamov, Mher Safaryan, Dan Alistarh

Viaarxiv icon

QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models

Oct 25, 2023
Elias Frantar, Dan Alistarh

Viaarxiv icon

Towards End-to-end 4-Bit Inference on Generative Large Language Models

Oct 13, 2023
Saleh Ashkboos, Ilia Markov, Elias Frantar, Tingxuan Zhong, Xincheng Wang, Jie Ren, Torsten Hoefler, Dan Alistarh

Viaarxiv icon

Sparse Fine-tuning for Inference Acceleration of Large Language Models

Oct 13, 2023
Eldar Kurtic, Denis Kuznedelev, Elias Frantar, Michael Goin, Dan Alistarh

Viaarxiv icon

Sparse Finetuning for Inference Acceleration of Large Language Models

Oct 10, 2023
Eldar Kurtic, Denis Kuznedelev, Elias Frantar, Michael Goin, Dan Alistarh

Viaarxiv icon

SPADE: Sparsity-Guided Debugging for Deep Neural Networks

Oct 06, 2023
Arshia Soltani Moakhar, Eugenia Iofinova, Dan Alistarh

Figure 1 for SPADE: Sparsity-Guided Debugging for Deep Neural Networks
Figure 2 for SPADE: Sparsity-Guided Debugging for Deep Neural Networks
Figure 3 for SPADE: Sparsity-Guided Debugging for Deep Neural Networks
Figure 4 for SPADE: Sparsity-Guided Debugging for Deep Neural Networks
Viaarxiv icon