Picture for Benoît Sagot

Benoît Sagot

ALMAnaCH

TopXGen: Topic-Diverse Parallel Data Generation for Low-Resource Machine Translation

Add code
Aug 12, 2025
Viaarxiv icon

ModernBERT or DeBERTaV3? Examining Architecture and Data Influence on Transformer Encoder Models Performance

Add code
Apr 11, 2025
Viaarxiv icon

Explicit Learning and the LLM in Machine Translation

Add code
Mar 12, 2025
Figure 1 for Explicit Learning and the LLM in Machine Translation
Figure 2 for Explicit Learning and the LLM in Machine Translation
Figure 3 for Explicit Learning and the LLM in Machine Translation
Figure 4 for Explicit Learning and the LLM in Machine Translation
Viaarxiv icon

KréyoLID From Language Identification Towards Language Mining

Add code
Mar 09, 2025
Figure 1 for KréyoLID From Language Identification Towards Language Mining
Figure 2 for KréyoLID From Language Identification Towards Language Mining
Figure 3 for KréyoLID From Language Identification Towards Language Mining
Figure 4 for KréyoLID From Language Identification Towards Language Mining
Viaarxiv icon

Compositional Translation: A Novel LLM-based Approach for Low-resource Machine Translation

Add code
Mar 06, 2025
Figure 1 for Compositional Translation: A Novel LLM-based Approach for Low-resource Machine Translation
Figure 2 for Compositional Translation: A Novel LLM-based Approach for Low-resource Machine Translation
Figure 3 for Compositional Translation: A Novel LLM-based Approach for Low-resource Machine Translation
Figure 4 for Compositional Translation: A Novel LLM-based Approach for Low-resource Machine Translation
Viaarxiv icon

Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression

Add code
Mar 04, 2025
Viaarxiv icon

Diachronic Document Dataset for Semantic Layout Analysis

Add code
Nov 15, 2024
Figure 1 for Diachronic Document Dataset for Semantic Layout Analysis
Figure 2 for Diachronic Document Dataset for Semantic Layout Analysis
Figure 3 for Diachronic Document Dataset for Semantic Layout Analysis
Figure 4 for Diachronic Document Dataset for Semantic Layout Analysis
Viaarxiv icon

CamemBERT 2.0: A Smarter French Language Model Aged to Perfection

Add code
Nov 13, 2024
Viaarxiv icon

Tree of Problems: Improving structured problem solving with compositionality

Add code
Oct 09, 2024
Figure 1 for Tree of Problems: Improving structured problem solving with compositionality
Figure 2 for Tree of Problems: Improving structured problem solving with compositionality
Figure 3 for Tree of Problems: Improving structured problem solving with compositionality
Figure 4 for Tree of Problems: Improving structured problem solving with compositionality
Viaarxiv icon

Molyé: A Corpus-based Approach to Language Contact in Colonial France

Add code
Aug 08, 2024
Figure 1 for Molyé: A Corpus-based Approach to Language Contact in Colonial France
Figure 2 for Molyé: A Corpus-based Approach to Language Contact in Colonial France
Figure 3 for Molyé: A Corpus-based Approach to Language Contact in Colonial France
Figure 4 for Molyé: A Corpus-based Approach to Language Contact in Colonial France
Viaarxiv icon