Picture for Sara Hooker

Sara Hooker

Nexus: Specialization meets Adaptability for Efficiently Training Mixture of Experts

Add code
Aug 28, 2024
Viaarxiv icon

Multilingual Arbitrage: Optimizing Data Pools to Accelerate Multilingual Progress

Add code
Aug 27, 2024
Viaarxiv icon

To Code, or Not To Code? Exploring Impact of Code in Pre-training

Add code
Aug 20, 2024
Viaarxiv icon

Consent in Crisis: The Rapid Decline of the AI Data Commons

Add code
Jul 24, 2024
Figure 1 for Consent in Crisis: The Rapid Decline of the AI Data Commons
Figure 2 for Consent in Crisis: The Rapid Decline of the AI Data Commons
Figure 3 for Consent in Crisis: The Rapid Decline of the AI Data Commons
Figure 4 for Consent in Crisis: The Rapid Decline of the AI Data Commons
Viaarxiv icon

On the Limitations of Compute Thresholds as a Governance Strategy

Add code
Jul 08, 2024
Viaarxiv icon

How Does Quantization Affect Multilingual LLMs?

Add code
Jul 03, 2024
Figure 1 for How Does Quantization Affect Multilingual LLMs?
Figure 2 for How Does Quantization Affect Multilingual LLMs?
Figure 3 for How Does Quantization Affect Multilingual LLMs?
Figure 4 for How Does Quantization Affect Multilingual LLMs?
Viaarxiv icon

RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs

Add code
Jul 02, 2024
Figure 1 for RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs
Figure 2 for RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs
Figure 3 for RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs
Figure 4 for RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs
Viaarxiv icon

LLM See, LLM Do: Guiding Data Generation to Target Non-Differentiable Objectives

Add code
Jul 01, 2024
Figure 1 for LLM See, LLM Do: Guiding Data Generation to Target Non-Differentiable Objectives
Figure 2 for LLM See, LLM Do: Guiding Data Generation to Target Non-Differentiable Objectives
Figure 3 for LLM See, LLM Do: Guiding Data Generation to Target Non-Differentiable Objectives
Figure 4 for LLM See, LLM Do: Guiding Data Generation to Target Non-Differentiable Objectives
Viaarxiv icon

The Multilingual Alignment Prism: Aligning Global and Local Preferences to Reduce Harm

Add code
Jun 26, 2024
Viaarxiv icon

IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models

Add code
Jun 05, 2024
Figure 1 for IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models
Figure 2 for IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models
Figure 3 for IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models
Figure 4 for IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models
Viaarxiv icon