Alert button

Structured Packing in LLM Training Improves Long Context Utilization

Jan 02, 2024
Konrad Staniszewski, Szymon Tworkowski, Sebastian Jaszczur, Henryk Michalewski, Łukasz Kuciński, Piotr Miłoś

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: