Picture for Denis Kuznedelev

Denis Kuznedelev

EvoPress: Towards Optimal Dynamic Model Compression via Evolutionary Search

Add code
Oct 18, 2024
Viaarxiv icon

Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization

Add code
Aug 31, 2024
Viaarxiv icon

The Iterative Optimal Brain Surgeon: Faster Sparse Recovery by Leveraging Second-Order Information

Add code
Aug 30, 2024
Viaarxiv icon

Does Diffusion Beat GAN in Image Super Resolution?

Add code
May 27, 2024
Viaarxiv icon

PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression

Add code
May 23, 2024
Viaarxiv icon

YaART: Yet Another ART Rendering Technology

Add code
Apr 08, 2024
Viaarxiv icon

Extreme Compression of Large Language Models via Additive Quantization

Add code
Jan 11, 2024
Viaarxiv icon

Sparse Fine-tuning for Inference Acceleration of Large Language Models

Add code
Oct 13, 2023
Viaarxiv icon

Accurate Neural Network Pruning Requires Rethinking Sparse Optimization

Add code
Aug 03, 2023
Viaarxiv icon

SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression

Add code
Jun 05, 2023
Viaarxiv icon