Picture for Bingni Zhang

Bingni Zhang

Exploring Polyglot Harmony: On Multilingual Data Allocation for Large Language Models Pretraining

Add code
Sep 19, 2025
Viaarxiv icon

MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining

Add code
Jul 02, 2025
Viaarxiv icon

MuBench: Assessment of Multilingual Capabilities of Large Language Models Across 61 Languages

Add code
Jun 24, 2025
Viaarxiv icon

Frame-Voyager: Learning to Query Frames for Video Large Language Models

Add code
Oct 07, 2024
Viaarxiv icon