Picture for Alexandre Van Tassel

Alexandre Van Tassel

Dispersion Loss Counteracts Embedding Condensation and Improves Generalization in Small Language Models

Add code
Jan 30, 2026
Viaarxiv icon