Picture for Elliott Delaye

Elliott Delaye

MINCE: Shrinking LLM Evaluation Datasets via Few-Model Monte Carlo Calibration

Add code
Jun 22, 2026
Viaarxiv icon

Recover-LoRA for Aggressive Quantization: Reclaiming Accuracy in 2-Bit Language Models via Low-Rank Adaptation with Knowledge Distillation on Synthetic Data

Add code
Jun 02, 2026
Viaarxiv icon

Bring Your Own Codegen to Deep Learning Compiler

Add code
May 03, 2021
Figure 1 for Bring Your Own Codegen to Deep Learning Compiler
Figure 2 for Bring Your Own Codegen to Deep Learning Compiler
Figure 3 for Bring Your Own Codegen to Deep Learning Compiler
Figure 4 for Bring Your Own Codegen to Deep Learning Compiler
Viaarxiv icon

Quantizing Convolutional Neural Networks for Low-Power High-Throughput Inference Engines

Add code
May 21, 2018
Figure 1 for Quantizing Convolutional Neural Networks for Low-Power High-Throughput Inference Engines
Figure 2 for Quantizing Convolutional Neural Networks for Low-Power High-Throughput Inference Engines
Figure 3 for Quantizing Convolutional Neural Networks for Low-Power High-Throughput Inference Engines
Figure 4 for Quantizing Convolutional Neural Networks for Low-Power High-Throughput Inference Engines
Viaarxiv icon