Picture for Cyril Moineau

Cyril Moineau

Precision Where It Matters: A Novel Spike Aware Mixed-Precision Quantization Strategy for LLaMA-based Language Models

Add code
Apr 30, 2025
Viaarxiv icon

Gradual Binary Search and Dimension Expansion : A general method for activation quantization in LLMs

Add code
Apr 18, 2025
Viaarxiv icon