Picture for Graeme Nail

Graeme Nail

Jack

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

A New Massive Multilingual Dataset for High-Performance Language Technologies

Add code
Mar 20, 2024
Figure 1 for A New Massive Multilingual Dataset for High-Performance Language Technologies
Figure 2 for A New Massive Multilingual Dataset for High-Performance Language Technologies
Figure 3 for A New Massive Multilingual Dataset for High-Performance Language Technologies
Figure 4 for A New Massive Multilingual Dataset for High-Performance Language Technologies
Viaarxiv icon

OpusCleaner and OpusTrainer, open source toolkits for training Machine Translation and Large language models

Add code
Nov 24, 2023
Viaarxiv icon