Picture for Sebastian Ruder

Sebastian Ruder

MENLO: From Preferences to Proficiency -- Evaluating and Modeling Native-like Quality Across 47 Languages

Add code
Sep 30, 2025
Figure 1 for MENLO: From Preferences to Proficiency -- Evaluating and Modeling Native-like Quality Across 47 Languages
Figure 2 for MENLO: From Preferences to Proficiency -- Evaluating and Modeling Native-like Quality Across 47 Languages
Figure 3 for MENLO: From Preferences to Proficiency -- Evaluating and Modeling Native-like Quality Across 47 Languages
Figure 4 for MENLO: From Preferences to Proficiency -- Evaluating and Modeling Native-like Quality Across 47 Languages
Viaarxiv icon

Arbiters of Ambivalence: Challenges of Using LLMs in No-Consensus Tasks

Add code
May 28, 2025
Viaarxiv icon

The Sparse Frontier: Sparse Attention Trade-offs in Transformer LLMs

Add code
Apr 24, 2025
Viaarxiv icon

A Post-trainer's Guide to Multilingual Training Data: Uncovering Cross-lingual Transfer Dynamics

Add code
Apr 23, 2025
Viaarxiv icon

AL-QASIDA: Analyzing LLM Quality and Accuracy Systematically in Dialectal Arabic

Add code
Dec 05, 2024
Figure 1 for AL-QASIDA: Analyzing LLM Quality and Accuracy Systematically in Dialectal Arabic
Figure 2 for AL-QASIDA: Analyzing LLM Quality and Accuracy Systematically in Dialectal Arabic
Figure 3 for AL-QASIDA: Analyzing LLM Quality and Accuracy Systematically in Dialectal Arabic
Figure 4 for AL-QASIDA: Analyzing LLM Quality and Accuracy Systematically in Dialectal Arabic
Viaarxiv icon

M-RewardBench: Evaluating Reward Models in Multilingual Settings

Add code
Oct 20, 2024
Figure 1 for M-RewardBench: Evaluating Reward Models in Multilingual Settings
Figure 2 for M-RewardBench: Evaluating Reward Models in Multilingual Settings
Figure 3 for M-RewardBench: Evaluating Reward Models in Multilingual Settings
Figure 4 for M-RewardBench: Evaluating Reward Models in Multilingual Settings
Viaarxiv icon

BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts

Add code
Aug 15, 2024
Figure 1 for BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts
Figure 2 for BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts
Figure 3 for BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts
Figure 4 for BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts
Viaarxiv icon

How Does Quantization Affect Multilingual LLMs?

Add code
Jul 03, 2024
Viaarxiv icon

LLM See, LLM Do: Guiding Data Generation to Target Non-Differentiable Objectives

Add code
Jul 01, 2024
Figure 1 for LLM See, LLM Do: Guiding Data Generation to Target Non-Differentiable Objectives
Figure 2 for LLM See, LLM Do: Guiding Data Generation to Target Non-Differentiable Objectives
Figure 3 for LLM See, LLM Do: Guiding Data Generation to Target Non-Differentiable Objectives
Figure 4 for LLM See, LLM Do: Guiding Data Generation to Target Non-Differentiable Objectives
Viaarxiv icon

Understanding and Mitigating Language Confusion in LLMs

Add code
Jun 28, 2024
Viaarxiv icon