Picture for Teven Le Scao

Teven Le Scao

Magistral

Add code
Jun 12, 2025
Viaarxiv icon

Pixtral 12B

Add code
Oct 09, 2024
Figure 1 for Pixtral 12B
Figure 2 for Pixtral 12B
Figure 3 for Pixtral 12B
Figure 4 for Pixtral 12B
Viaarxiv icon

Mixtral of Experts

Add code
Jan 08, 2024
Figure 1 for Mixtral of Experts
Figure 2 for Mixtral of Experts
Figure 3 for Mixtral of Experts
Figure 4 for Mixtral of Experts
Viaarxiv icon

FinGPT: Large Generative Models for a Small Language

Add code
Nov 03, 2023
Figure 1 for FinGPT: Large Generative Models for a Small Language
Figure 2 for FinGPT: Large Generative Models for a Small Language
Figure 3 for FinGPT: Large Generative Models for a Small Language
Figure 4 for FinGPT: Large Generative Models for a Small Language
Viaarxiv icon

Mistral 7B

Add code
Oct 10, 2023
Figure 1 for Mistral 7B
Figure 2 for Mistral 7B
Figure 3 for Mistral 7B
Figure 4 for Mistral 7B
Viaarxiv icon

Scaling Data-Constrained Language Models

Add code
May 25, 2023
Viaarxiv icon

The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset

Add code
Mar 07, 2023
Figure 1 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Figure 2 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Figure 3 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Figure 4 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Viaarxiv icon

Joint Representations of Text and Knowledge Graphs for Retrieval and Evaluation

Add code
Feb 28, 2023
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

What Language Model to Train if You Have One Million GPU Hours?

Add code
Nov 08, 2022
Viaarxiv icon