Picture for Benoît Sagot

Benoît Sagot

ALMAnaCH

Towards Zero-Shot Multimodal Machine Translation

Add code
Jul 18, 2024
Viaarxiv icon

mOSCAR: A Large-scale Multilingual and Multimodal Document-level Corpus

Add code
Jun 13, 2024
Viaarxiv icon

PatentEval: Understanding Errors in Patent Generation

Add code
Jun 05, 2024
Viaarxiv icon

Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

Add code
Apr 11, 2024
Figure 1 for Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck
Figure 2 for Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck
Figure 3 for Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck
Figure 4 for Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck
Viaarxiv icon

Making Sentence Embeddings Robust to User-Generated Content

Add code
Mar 25, 2024
Figure 1 for Making Sentence Embeddings Robust to User-Generated Content
Figure 2 for Making Sentence Embeddings Robust to User-Generated Content
Figure 3 for Making Sentence Embeddings Robust to User-Generated Content
Figure 4 for Making Sentence Embeddings Robust to User-Generated Content
Viaarxiv icon

On the Scaling Laws of Geographical Representation in Language Models

Add code
Mar 04, 2024
Viaarxiv icon

Anisotropy Is Inherent to Self-Attention in Transformers

Add code
Jan 24, 2024
Viaarxiv icon

Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer

Add code
Oct 05, 2023
Figure 1 for Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer
Figure 2 for Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer
Figure 3 for Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer
Figure 4 for Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer
Viaarxiv icon

From Text to Source: Results in Detecting Large Language Model-Generated Content

Add code
Sep 23, 2023
Figure 1 for From Text to Source: Results in Detecting Large Language Model-Generated Content
Figure 2 for From Text to Source: Results in Detecting Large Language Model-Generated Content
Figure 3 for From Text to Source: Results in Detecting Large Language Model-Generated Content
Figure 4 for From Text to Source: Results in Detecting Large Language Model-Generated Content
Viaarxiv icon

Headless Language Models: Learning without Predicting with Contrastive Weight Tying

Add code
Sep 15, 2023
Figure 1 for Headless Language Models: Learning without Predicting with Contrastive Weight Tying
Figure 2 for Headless Language Models: Learning without Predicting with Contrastive Weight Tying
Figure 3 for Headless Language Models: Learning without Predicting with Contrastive Weight Tying
Figure 4 for Headless Language Models: Learning without Predicting with Contrastive Weight Tying
Viaarxiv icon