Picture for Bruno Lopes Yamamoto

Bruno Lopes Yamamoto

Efficient LLMs with AMP: Attention Heads and MLP Pruning

Add code
Apr 29, 2025
Viaarxiv icon