Picture for Fabian Güra

Fabian Güra

ZClip: Adaptive Spike Mitigation for LLM Pre-Training

Add code
Apr 03, 2025
Figure 1 for ZClip: Adaptive Spike Mitigation for LLM Pre-Training
Figure 2 for ZClip: Adaptive Spike Mitigation for LLM Pre-Training
Figure 3 for ZClip: Adaptive Spike Mitigation for LLM Pre-Training
Figure 4 for ZClip: Adaptive Spike Mitigation for LLM Pre-Training
Viaarxiv icon

A Refined Analysis of Massive Activations in LLMs

Add code
Mar 28, 2025
Figure 1 for A Refined Analysis of Massive Activations in LLMs
Figure 2 for A Refined Analysis of Massive Activations in LLMs
Figure 3 for A Refined Analysis of Massive Activations in LLMs
Figure 4 for A Refined Analysis of Massive Activations in LLMs
Viaarxiv icon