Picture for Jiandong Shao

Jiandong Shao

The Role of Mixed-Language Documents for Multilingual Large Language Model Pretraining

Add code
Jan 01, 2026
Viaarxiv icon