Picture for Muhammad Abdul-Mageed

Muhammad Abdul-Mageed

University of British Columbia

Pearl: A Multimodal Culturally-Aware Arabic Instruction Dataset

Add code
May 28, 2025
Viaarxiv icon

Voice of a Continent: Mapping Africa's Speech Technology Frontier

Add code
May 24, 2025
Viaarxiv icon

NileChat: Towards Linguistically Diverse and Culturally Aware LLMs for Local Communities

Add code
May 23, 2025
Viaarxiv icon

Where Are We? Evaluating LLM Performance on African Languages

Add code
Feb 26, 2025
Viaarxiv icon

Cross-Modal Consistency in Multimodal Large Language Models

Add code
Nov 14, 2024
Viaarxiv icon

Swan and ArabicMTEB: Dialect-Aware, Arabic-Centric, Cross-Lingual, and Cross-Cultural Embedding Models and Benchmarks

Add code
Nov 02, 2024
Viaarxiv icon

Desert Camels and Oil Sheikhs: Arab-Centric Red Teaming of Frontier LLMs

Add code
Oct 31, 2024
Viaarxiv icon

Gazelle: An Instruction Dataset for Arabic Writing Assistance

Add code
Oct 23, 2024
Figure 1 for Gazelle: An Instruction Dataset for Arabic Writing Assistance
Figure 2 for Gazelle: An Instruction Dataset for Arabic Writing Assistance
Figure 3 for Gazelle: An Instruction Dataset for Arabic Writing Assistance
Figure 4 for Gazelle: An Instruction Dataset for Arabic Writing Assistance
Viaarxiv icon

Effective Self-Mining of In-Context Examples for Unsupervised Machine Translation with LLMs

Add code
Oct 14, 2024
Viaarxiv icon

Casablanca: Data and Models for Multidialectal Arabic Speech Recognition

Add code
Oct 06, 2024
Figure 1 for Casablanca: Data and Models for Multidialectal Arabic Speech Recognition
Figure 2 for Casablanca: Data and Models for Multidialectal Arabic Speech Recognition
Figure 3 for Casablanca: Data and Models for Multidialectal Arabic Speech Recognition
Figure 4 for Casablanca: Data and Models for Multidialectal Arabic Speech Recognition
Viaarxiv icon