Picture for Shu Okabe

Shu Okabe

CommonLID: Re-evaluating State-of-the-Art Language Identification Performance on Web Data

Add code
Jan 25, 2026
Viaarxiv icon

Low-Resource, High-Impact: Building Corpora for Inclusive Language Technologies

Add code
Dec 16, 2025
Viaarxiv icon