Picture for Hannah Liu

Hannah Liu

OasisSimp: An Open-source Asian-English Sentence Simplification Dataset

Add code
Mar 14, 2026
Viaarxiv icon

SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects

Add code
Sep 14, 2023
Figure 1 for SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects
Figure 2 for SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects
Figure 3 for SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects
Figure 4 for SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects
Viaarxiv icon