Alert button

Critical Data Size of Language Models from a Grokking Perspective

Jan 19, 2024
Xuekai Zhu, Yao Fu, Bowen Zhou, Zhouhan Lin

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: