Picture for Maryruth Gathoni

Maryruth Gathoni

The Thiomi Dataset: A Large-Scale Multimodal Corpus for Low-Resource African Languages

Add code
Mar 31, 2026
Viaarxiv icon