Picture for Lucas Lauton de Alcantara

Lucas Lauton de Alcantara

Efficient LLMs with AMP: Attention Heads and MLP Pruning

Add code
Apr 29, 2025
Viaarxiv icon