Picture for Benoît Sagot

Benoît Sagot

ALMAnaCH

Reading or Guessing? Visual Grounding Failures of Vision-Language Models for OCR in Ancient Greek Editions

Add code
May 26, 2026
Viaarxiv icon

Testing the Deliteralization Hypothesis in Human and Machine Translation

Add code
May 25, 2026
Viaarxiv icon

Language-Switching Triggers Take a Latent Detour Through Language Models

Add code
May 18, 2026
Viaarxiv icon

Hindsight Quality Prediction Experiments in Multi-Candidate Human-Post-Edited Machine Translation

Add code
Mar 04, 2026
Viaarxiv icon

Structure-Aware Text Recognition for Ancient Greek Critical Editions

Add code
Mar 03, 2026
Viaarxiv icon

How Should We Model the Probability of a Language?

Add code
Feb 09, 2026
Viaarxiv icon

Disentangling meaning from language in LLM-based machine translation

Add code
Feb 04, 2026
Viaarxiv icon

CommonLID: Re-evaluating State-of-the-Art Language Identification Performance on Web Data

Add code
Jan 25, 2026
Viaarxiv icon

When the Gold Standard isn't Necessarily Standard: Challenges of Evaluating the Translation of User-Generated Content

Add code
Dec 19, 2025
Figure 1 for When the Gold Standard isn't Necessarily Standard: Challenges of Evaluating the Translation of User-Generated Content
Figure 2 for When the Gold Standard isn't Necessarily Standard: Challenges of Evaluating the Translation of User-Generated Content
Figure 3 for When the Gold Standard isn't Necessarily Standard: Challenges of Evaluating the Translation of User-Generated Content
Figure 4 for When the Gold Standard isn't Necessarily Standard: Challenges of Evaluating the Translation of User-Generated Content
Viaarxiv icon

Gaperon: A Peppered English-French Generative Language Model Suite

Add code
Oct 29, 2025
Viaarxiv icon