Picture for Roberto Navigli

Roberto Navigli

Has Machine Translation Evaluation Achieved Human Parity? The Human Reference and the Limits of Progress

Add code
Jun 24, 2025
Viaarxiv icon

Optimizing LLMs for Italian: Reducing Token Fertility and Enhancing Efficiency Through Vocabulary Adaptation

Add code
Apr 23, 2025
Viaarxiv icon

Right Answer, Wrong Score: Uncovering the Inconsistencies of LLM Evaluation in Multiple-Choice Question Answering

Add code
Mar 19, 2025
Viaarxiv icon

LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps

Add code
Dec 19, 2024
Figure 1 for LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps
Figure 2 for LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps
Figure 3 for LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps
Figure 4 for LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps
Viaarxiv icon

Word Sense Linking: Disambiguating Outside the Sandbox

Add code
Dec 12, 2024
Figure 1 for Word Sense Linking: Disambiguating Outside the Sandbox
Figure 2 for Word Sense Linking: Disambiguating Outside the Sandbox
Figure 3 for Word Sense Linking: Disambiguating Outside the Sandbox
Figure 4 for Word Sense Linking: Disambiguating Outside the Sandbox
Viaarxiv icon

Truth or Mirage? Towards End-to-End Factuality Evaluation with LLM-Oasis

Add code
Dec 02, 2024
Viaarxiv icon

Multimodal Large Language Models and Tunings: Vision, Language, Sensors, Audio, and Beyond

Add code
Oct 08, 2024
Viaarxiv icon

ZEBRA: Zero-Shot Example-Based Retrieval Augmentation for Commonsense Question Answering

Add code
Oct 07, 2024
Viaarxiv icon

Beyond Correlation: Interpretable Evaluation of Machine Translation Metrics

Add code
Oct 07, 2024
Figure 1 for Beyond Correlation: Interpretable Evaluation of Machine Translation Metrics
Figure 2 for Beyond Correlation: Interpretable Evaluation of Machine Translation Metrics
Figure 3 for Beyond Correlation: Interpretable Evaluation of Machine Translation Metrics
Figure 4 for Beyond Correlation: Interpretable Evaluation of Machine Translation Metrics
Viaarxiv icon

Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In!

Add code
Aug 25, 2024
Figure 1 for Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In!
Figure 2 for Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In!
Figure 3 for Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In!
Figure 4 for Guardians of the Machine Translation Meta-Evaluation: Sentinel Metrics Fall In!
Viaarxiv icon