Picture for Djamé Seddah

Djamé Seddah

Gaperon: A Peppered English-French Generative Language Model Suite

Add code
Oct 29, 2025
Viaarxiv icon

ModernBERT or DeBERTaV3? Examining Architecture and Data Influence on Transformer Encoder Models Performance

Add code
Apr 11, 2025
Viaarxiv icon

Common Ground, Diverse Roots: The Difficulty of Classifying Common Examples in Spanish Varieties

Add code
Dec 16, 2024
Viaarxiv icon

Beyond Dataset Creation: Critical View of Annotation Variation and Bias Probing of a Dataset for Online Radical Content Detection

Add code
Dec 16, 2024
Viaarxiv icon

CamemBERT 2.0: A Smarter French Language Model Aged to Perfection

Add code
Nov 13, 2024
Viaarxiv icon

Cloaked Classifiers: Pseudonymization Strategies on Sensitive Classification Tasks

Add code
Jun 25, 2024
Figure 1 for Cloaked Classifiers: Pseudonymization Strategies on Sensitive Classification Tasks
Figure 2 for Cloaked Classifiers: Pseudonymization Strategies on Sensitive Classification Tasks
Figure 3 for Cloaked Classifiers: Pseudonymization Strategies on Sensitive Classification Tasks
Figure 4 for Cloaked Classifiers: Pseudonymization Strategies on Sensitive Classification Tasks
Viaarxiv icon

From Text to Source: Results in Detecting Large Language Model-Generated Content

Add code
Sep 23, 2023
Viaarxiv icon

Towards a Robust Detection of Language Model Generated Text: Is ChatGPT that Easy to Detect?

Add code
Jun 09, 2023
Figure 1 for Towards a Robust Detection of Language Model Generated Text: Is ChatGPT that Easy to Detect?
Figure 2 for Towards a Robust Detection of Language Model Generated Text: Is ChatGPT that Easy to Detect?
Figure 3 for Towards a Robust Detection of Language Model Generated Text: Is ChatGPT that Easy to Detect?
Viaarxiv icon

Data-Efficient French Language Modeling with CamemBERTa

Add code
Jun 02, 2023
Viaarxiv icon

Multilingual Auxiliary Tasks Training: Bridging the Gap between Languages for Zero-Shot Transfer of Hate Speech Detection Models

Add code
Oct 25, 2022
Viaarxiv icon