Picture for José Pombal

José Pombal

SEQUOR: A Multi-Turn Benchmark for Realistic Constraint Following

Add code
May 07, 2026
Viaarxiv icon

Self-Preference Bias in Rubric-Based Evaluation of Large Language Models

Add code
Apr 08, 2026
Viaarxiv icon

EuroLLM-22B: Technical Report

Add code
Feb 05, 2026
Viaarxiv icon

MindGuard: Guardrail Classifiers for Multi-Turn Mental Health Support

Add code
Feb 01, 2026
Viaarxiv icon

EuroLLM-9B: Technical Report

Add code
Jun 04, 2025
Figure 1 for EuroLLM-9B: Technical Report
Figure 2 for EuroLLM-9B: Technical Report
Figure 3 for EuroLLM-9B: Technical Report
Figure 4 for EuroLLM-9B: Technical Report
Viaarxiv icon

M-Prometheus: A Suite of Open Multilingual LLM Judges

Add code
Apr 07, 2025
Viaarxiv icon

Zero-shot Benchmarking: A Framework for Flexible and Scalable Automatic Evaluation of Language Models

Add code
Apr 01, 2025
Viaarxiv icon

Adding Chocolate to Mint: Mitigating Metric Interference in Machine Translation

Add code
Mar 11, 2025
Viaarxiv icon

A Context-aware Framework for Translation-mediated Conversations

Add code
Dec 05, 2024
Viaarxiv icon

EuroLLM: Multilingual Language Models for Europe

Add code
Sep 24, 2024
Viaarxiv icon