Picture for Alexander Fraser

Alexander Fraser

Mask and You Shall Receive: Optimizing Masked Language Modeling For Pretraining BabyLMs

Add code
Oct 23, 2025
Viaarxiv icon

EmoBench-UA: A Benchmark Dataset for Emotion Detection in Ukrainian

Add code
May 29, 2025
Viaarxiv icon

NLP for Social Good: A Survey of Challenges, Opportunities, and Responsible Deployment

Add code
May 28, 2025
Figure 1 for NLP for Social Good: A Survey of Challenges, Opportunities, and Responsible Deployment
Figure 2 for NLP for Social Good: A Survey of Challenges, Opportunities, and Responsible Deployment
Figure 3 for NLP for Social Good: A Survey of Challenges, Opportunities, and Responsible Deployment
Figure 4 for NLP for Social Good: A Survey of Challenges, Opportunities, and Responsible Deployment
Viaarxiv icon

EXECUTE: A Multilingual Benchmark for LLM Token Understanding

Add code
May 23, 2025
Figure 1 for EXECUTE: A Multilingual Benchmark for LLM Token Understanding
Figure 2 for EXECUTE: A Multilingual Benchmark for LLM Token Understanding
Figure 3 for EXECUTE: A Multilingual Benchmark for LLM Token Understanding
Figure 4 for EXECUTE: A Multilingual Benchmark for LLM Token Understanding
Viaarxiv icon

Data-Efficient Hate Speech Detection via Cross-Lingual Nearest Neighbor Retrieval with Limited Labeled Data

Add code
May 20, 2025
Viaarxiv icon

From Unaligned to Aligned: Scaling Multilingual LLMs with Multi-Way Parallel Corpora

Add code
May 20, 2025
Figure 1 for From Unaligned to Aligned: Scaling Multilingual LLMs with Multi-Way Parallel Corpora
Figure 2 for From Unaligned to Aligned: Scaling Multilingual LLMs with Multi-Way Parallel Corpora
Figure 3 for From Unaligned to Aligned: Scaling Multilingual LLMs with Multi-Way Parallel Corpora
Figure 4 for From Unaligned to Aligned: Scaling Multilingual LLMs with Multi-Way Parallel Corpora
Viaarxiv icon

Can Prompting LLMs Unlock Hate Speech Detection across Languages? A Zero-shot and Few-shot Study

Add code
May 09, 2025
Viaarxiv icon

DCAD-2000: A Multilingual Dataset across 2000+ Languages with Data Cleaning as Anomaly Detection

Add code
Feb 17, 2025
Viaarxiv icon

Beyond Literal Token Overlap: Token Alignability for Multilinguality

Add code
Feb 10, 2025
Figure 1 for Beyond Literal Token Overlap: Token Alignability for Multilinguality
Figure 2 for Beyond Literal Token Overlap: Token Alignability for Multilinguality
Figure 3 for Beyond Literal Token Overlap: Token Alignability for Multilinguality
Figure 4 for Beyond Literal Token Overlap: Token Alignability for Multilinguality
Viaarxiv icon

Joint Localization and Activation Editing for Low-Resource Fine-Tuning

Add code
Feb 03, 2025
Figure 1 for Joint Localization and Activation Editing for Low-Resource Fine-Tuning
Figure 2 for Joint Localization and Activation Editing for Low-Resource Fine-Tuning
Figure 3 for Joint Localization and Activation Editing for Low-Resource Fine-Tuning
Figure 4 for Joint Localization and Activation Editing for Low-Resource Fine-Tuning
Viaarxiv icon