Picture for Muhammad Abdul-Mageed

Muhammad Abdul-Mageed

University of British Columbia

Alexandria: A Multi-Domain Dialectal Arabic Machine Translation Dataset for Culturally Inclusive and Linguistically Diverse LLMs

Add code
Jan 19, 2026
Viaarxiv icon

AfroScope: A Framework for Studying the Linguistic Landscape of Africa

Add code
Jan 19, 2026
Viaarxiv icon

Arab Voices: Mapping Standard and Dialectal Arabic Speech Technology

Add code
Jan 19, 2026
Viaarxiv icon

TEMPO: A Realistic Multi-Domain Benchmark for Temporal Reasoning-Intensive Retrieval

Add code
Jan 14, 2026
Viaarxiv icon

Reflection Pretraining Enables Token-Level Self-Correction in Biological Sequence Models

Add code
Dec 24, 2025
Figure 1 for Reflection Pretraining Enables Token-Level Self-Correction in Biological Sequence Models
Figure 2 for Reflection Pretraining Enables Token-Level Self-Correction in Biological Sequence Models
Figure 3 for Reflection Pretraining Enables Token-Level Self-Correction in Biological Sequence Models
Figure 4 for Reflection Pretraining Enables Token-Level Self-Correction in Biological Sequence Models
Viaarxiv icon

Distilling the Essence: Efficient Reasoning Distillation via Sequence Truncation

Add code
Dec 24, 2025
Viaarxiv icon

AraHealthQA 2025: The First Shared Task on Arabic Health Question Answering

Add code
Aug 28, 2025
Viaarxiv icon

Pearl: A Multimodal Culturally-Aware Arabic Instruction Dataset

Add code
May 28, 2025
Viaarxiv icon

Voice of a Continent: Mapping Africa's Speech Technology Frontier

Add code
May 24, 2025
Viaarxiv icon

NileChat: Towards Linguistically Diverse and Culturally Aware LLMs for Local Communities

Add code
May 23, 2025
Viaarxiv icon