Picture for Sergey Edunov

Sergey Edunov

Jack

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks

Add code
Feb 24, 2025
Viaarxiv icon

Law of the Weakest Link: Cross Capabilities of Large Language Models

Add code
Sep 30, 2024
Figure 1 for Law of the Weakest Link: Cross Capabilities of Large Language Models
Figure 2 for Law of the Weakest Link: Cross Capabilities of Large Language Models
Figure 3 for Law of the Weakest Link: Cross Capabilities of Large Language Models
Figure 4 for Law of the Weakest Link: Cross Capabilities of Large Language Models
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

Effective Long-Context Scaling of Foundation Models

Add code
Sep 27, 2023
Figure 1 for Effective Long-Context Scaling of Foundation Models
Figure 2 for Effective Long-Context Scaling of Foundation Models
Figure 3 for Effective Long-Context Scaling of Foundation Models
Figure 4 for Effective Long-Context Scaling of Foundation Models
Viaarxiv icon

Llama 2: Open Foundation and Fine-Tuned Chat Models

Add code
Jul 19, 2023
Figure 1 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 2 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 3 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Figure 4 for Llama 2: Open Foundation and Fine-Tuned Chat Models
Viaarxiv icon

No Language Left Behind: Scaling Human-Centered Machine Translation

Add code
Jul 11, 2022
Figure 1 for No Language Left Behind: Scaling Human-Centered Machine Translation
Figure 2 for No Language Left Behind: Scaling Human-Centered Machine Translation
Figure 3 for No Language Left Behind: Scaling Human-Centered Machine Translation
Figure 4 for No Language Left Behind: Scaling Human-Centered Machine Translation
Viaarxiv icon

LegoNN: Building Modular Encoder-Decoder Models

Add code
Jun 07, 2022
Figure 1 for LegoNN: Building Modular Encoder-Decoder Models
Figure 2 for LegoNN: Building Modular Encoder-Decoder Models
Figure 3 for LegoNN: Building Modular Encoder-Decoder Models
Figure 4 for LegoNN: Building Modular Encoder-Decoder Models
Viaarxiv icon

Facebook AI WMT21 News Translation Task Submission

Add code
Aug 06, 2021
Figure 1 for Facebook AI WMT21 News Translation Task Submission
Figure 2 for Facebook AI WMT21 News Translation Task Submission
Figure 3 for Facebook AI WMT21 News Translation Task Submission
Figure 4 for Facebook AI WMT21 News Translation Task Submission
Viaarxiv icon

A Comparison of Approaches to Document-level Machine Translation

Add code
Jan 26, 2021
Figure 1 for A Comparison of Approaches to Document-level Machine Translation
Figure 2 for A Comparison of Approaches to Document-level Machine Translation
Figure 3 for A Comparison of Approaches to Document-level Machine Translation
Figure 4 for A Comparison of Approaches to Document-level Machine Translation
Viaarxiv icon