Picture for Dan Alistarh

Dan Alistarh

Extreme Compression of Large Language Models via Additive Quantization

Add code
Jan 11, 2024
Viaarxiv icon

How to Prune Your Language Model: Recovering Accuracy on the "Sparsity May Cry'' Benchmark

Add code
Dec 21, 2023
Viaarxiv icon

ELSA: Partial Weight Freezing for Overhead-Free Sparse Network Deployment

Add code
Dec 17, 2023
Figure 1 for ELSA: Partial Weight Freezing for Overhead-Free Sparse Network Deployment
Figure 2 for ELSA: Partial Weight Freezing for Overhead-Free Sparse Network Deployment
Figure 3 for ELSA: Partial Weight Freezing for Overhead-Free Sparse Network Deployment
Figure 4 for ELSA: Partial Weight Freezing for Overhead-Free Sparse Network Deployment
Viaarxiv icon

AsGrad: A Sharp Unified Analysis of Asynchronous-SGD Algorithms

Add code
Oct 31, 2023
Viaarxiv icon

QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models

Add code
Oct 25, 2023
Viaarxiv icon

Sparse Fine-tuning for Inference Acceleration of Large Language Models

Add code
Oct 13, 2023
Viaarxiv icon

Towards End-to-end 4-Bit Inference on Generative Large Language Models

Add code
Oct 13, 2023
Figure 1 for Towards End-to-end 4-Bit Inference on Generative Large Language Models
Figure 2 for Towards End-to-end 4-Bit Inference on Generative Large Language Models
Figure 3 for Towards End-to-end 4-Bit Inference on Generative Large Language Models
Figure 4 for Towards End-to-end 4-Bit Inference on Generative Large Language Models
Viaarxiv icon

SPADE: Sparsity-Guided Debugging for Deep Neural Networks

Add code
Oct 06, 2023
Figure 1 for SPADE: Sparsity-Guided Debugging for Deep Neural Networks
Figure 2 for SPADE: Sparsity-Guided Debugging for Deep Neural Networks
Figure 3 for SPADE: Sparsity-Guided Debugging for Deep Neural Networks
Figure 4 for SPADE: Sparsity-Guided Debugging for Deep Neural Networks
Viaarxiv icon

Scaling Laws for Sparsely-Connected Foundation Models

Add code
Sep 15, 2023
Viaarxiv icon

Accurate Neural Network Pruning Requires Rethinking Sparse Optimization

Add code
Aug 03, 2023
Viaarxiv icon