Picture for Fabian Güra

Fabian Güra

ZClip: Adaptive Spike Mitigation for LLM Pre-Training

Add code
Apr 03, 2025
Viaarxiv icon

A Refined Analysis of Massive Activations in LLMs

Add code
Mar 28, 2025
Viaarxiv icon