Picture for Wenhan Han

Wenhan Han

MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining

Add code
Jul 02, 2025
Viaarxiv icon

MuBench: Assessment of Multilingual Capabilities of Large Language Models Across 61 Languages

Add code
Jun 24, 2025
Viaarxiv icon

MedINST: Meta Dataset of Biomedical Instructions

Add code
Oct 17, 2024
Figure 1 for MedINST: Meta Dataset of Biomedical Instructions
Figure 2 for MedINST: Meta Dataset of Biomedical Instructions
Figure 3 for MedINST: Meta Dataset of Biomedical Instructions
Figure 4 for MedINST: Meta Dataset of Biomedical Instructions
Viaarxiv icon