Picture for Orevaoghene Ahia

Orevaoghene Ahia

Frame-Level Internal Tool Use for Temporal Grounding in Audio LMs

Add code
Feb 10, 2026
Viaarxiv icon

BASS: Benchmarking Audio LMs for Musical Structure and Semantic Reasoning

Add code
Feb 03, 2026
Viaarxiv icon

Broken Tokens? Your Language Model can Secretly Handle Non-Canonical Tokenizations

Add code
Jun 23, 2025
Viaarxiv icon

BLAB: Brutally Long Audio Bench

Add code
May 05, 2025
Viaarxiv icon

MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization

Add code
Jul 11, 2024
Figure 1 for MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization
Figure 2 for MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization
Figure 3 for MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization
Figure 4 for MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization
Viaarxiv icon

Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects

Add code
Jun 27, 2024
Figure 1 for Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects
Figure 2 for Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects
Figure 3 for Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects
Figure 4 for Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects
Viaarxiv icon

Teaching LLMs to Abstain across Languages via Multilingual Feedback

Add code
Jun 22, 2024
Figure 1 for Teaching LLMs to Abstain across Languages via Multilingual Feedback
Figure 2 for Teaching LLMs to Abstain across Languages via Multilingual Feedback
Figure 3 for Teaching LLMs to Abstain across Languages via Multilingual Feedback
Figure 4 for Teaching LLMs to Abstain across Languages via Multilingual Feedback
Viaarxiv icon

Critical Learning Periods: Leveraging Early Training Dynamics for Efficient Data Pruning

Add code
May 29, 2024
Figure 1 for Critical Learning Periods: Leveraging Early Training Dynamics for Efficient Data Pruning
Figure 2 for Critical Learning Periods: Leveraging Early Training Dynamics for Efficient Data Pruning
Figure 3 for Critical Learning Periods: Leveraging Early Training Dynamics for Efficient Data Pruning
Figure 4 for Critical Learning Periods: Leveraging Early Training Dynamics for Efficient Data Pruning
Viaarxiv icon

DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages

Add code
Mar 16, 2024
Figure 1 for DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages
Figure 2 for DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages
Figure 3 for DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages
Figure 4 for DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages
Viaarxiv icon

MYTE: Morphology-Driven Byte Encoding for Better and Fairer Multilingual Language Modeling

Add code
Mar 15, 2024
Viaarxiv icon