Alert button
Picture for Éric de la Clergerie

Éric de la Clergerie

Alert button

Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

Add code
Bookmark button
Alert button
Apr 11, 2024
Nathan Godey, Éric de la Clergerie, Benoît Sagot

Viaarxiv icon

On the Scaling Laws of Geographical Representation in Language Models

Add code
Bookmark button
Alert button
Mar 04, 2024
Nathan Godey, Éric de la Clergerie, Benoît Sagot

Viaarxiv icon

Anisotropy Is Inherent to Self-Attention in Transformers

Add code
Bookmark button
Alert button
Jan 24, 2024
Nathan Godey, Éric de la Clergerie, Benoît Sagot

Viaarxiv icon

Headless Language Models: Learning without Predicting with Contrastive Weight Tying

Add code
Bookmark button
Alert button
Sep 15, 2023
Nathan Godey, Éric de la Clergerie, Benoît Sagot

Figure 1 for Headless Language Models: Learning without Predicting with Contrastive Weight Tying
Figure 2 for Headless Language Models: Learning without Predicting with Contrastive Weight Tying
Figure 3 for Headless Language Models: Learning without Predicting with Contrastive Weight Tying
Figure 4 for Headless Language Models: Learning without Predicting with Contrastive Weight Tying
Viaarxiv icon

Is Anisotropy Inherent to Transformers?

Add code
Bookmark button
Alert button
Jun 13, 2023
Nathan Godey, Éric de la Clergerie, Benoît Sagot

Figure 1 for Is Anisotropy Inherent to Transformers?
Figure 2 for Is Anisotropy Inherent to Transformers?
Figure 3 for Is Anisotropy Inherent to Transformers?
Figure 4 for Is Anisotropy Inherent to Transformers?
Viaarxiv icon

MANTa: Efficient Gradient-Based Tokenization for Robust End-to-End Language Modeling

Add code
Bookmark button
Alert button
Dec 14, 2022
Nathan Godey, Roman Castagné, Éric de la Clergerie, Benoît Sagot

Figure 1 for MANTa: Efficient Gradient-Based Tokenization for Robust End-to-End Language Modeling
Figure 2 for MANTa: Efficient Gradient-Based Tokenization for Robust End-to-End Language Modeling
Figure 3 for MANTa: Efficient Gradient-Based Tokenization for Robust End-to-End Language Modeling
Figure 4 for MANTa: Efficient Gradient-Based Tokenization for Robust End-to-End Language Modeling
Viaarxiv icon

Clustering-based Automatic Construction of Legal Entity Knowledge Base from Contracts

Add code
Bookmark button
Alert button
Dec 07, 2020
Fuqi Song, Éric de la Clergerie

Figure 1 for Clustering-based Automatic Construction of Legal Entity Knowledge Base from Contracts
Figure 2 for Clustering-based Automatic Construction of Legal Entity Knowledge Base from Contracts
Figure 3 for Clustering-based Automatic Construction of Legal Entity Knowledge Base from Contracts
Figure 4 for Clustering-based Automatic Construction of Legal Entity Knowledge Base from Contracts
Viaarxiv icon

Multilingual Unsupervised Sentence Simplification

Add code
Bookmark button
Alert button
May 01, 2020
Louis Martin, Angela Fan, Éric de la Clergerie, Antoine Bordes, Benoît Sagot

Figure 1 for Multilingual Unsupervised Sentence Simplification
Figure 2 for Multilingual Unsupervised Sentence Simplification
Figure 3 for Multilingual Unsupervised Sentence Simplification
Figure 4 for Multilingual Unsupervised Sentence Simplification
Viaarxiv icon