Alert button

Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

Apr 11, 2024
Nathan Godey, Éric de la Clergerie, Benoît Sagot

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: