Picture for Dan Alistarh

Dan Alistarh

Apertus LLM Family Expansion via Distillation and Quantization

Add code
May 27, 2026
Viaarxiv icon

Grid Games: The Power of Multiple Grids for Quantizing Large Language Models

Add code
May 12, 2026
Viaarxiv icon

Statistically-Lossless Quantization of Large Language Models

Add code
May 04, 2026
Viaarxiv icon

GSQ: Highly-Accurate Low-Precision Scalar Quantization for LLMs via Gumbel-Softmax Sampling

Add code
Apr 20, 2026
Viaarxiv icon

Towards Robust Scaling Laws for Optimizers

Add code
Feb 07, 2026
Viaarxiv icon

LoRDO: Distributed Low-Rank Optimization with Infrequent Communication

Add code
Feb 04, 2026
Viaarxiv icon

MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization

Add code
Feb 03, 2026
Viaarxiv icon

DASH: Faster Shampoo via Batched Block Preconditioning and Efficient Inverse-Root Solvers

Add code
Feb 02, 2026
Viaarxiv icon

Quartet II: Accurate LLM Pre-Training in NVFP4 by Improved Unbiased Gradient Estimation

Add code
Jan 30, 2026
Viaarxiv icon

Behemoth: Benchmarking Unlearning in LLMs Using Fully Synthetic Data

Add code
Jan 30, 2026
Viaarxiv icon