Picture for Eduardo Sánchez

Eduardo Sánchez

BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation

Add code
Feb 06, 2025
Figure 1 for BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation
Figure 2 for BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation
Figure 3 for BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation
Figure 4 for BOUQuET: dataset, Benchmark and Open initiative for Universal Quality Evaluation in Translation
Viaarxiv icon

LCFO: Long Context and Long Form Output Dataset and Benchmarking

Add code
Dec 12, 2024
Figure 1 for LCFO: Long Context and Long Form Output Dataset and Benchmarking
Figure 2 for LCFO: Long Context and Long Form Output Dataset and Benchmarking
Figure 3 for LCFO: Long Context and Long Form Output Dataset and Benchmarking
Figure 4 for LCFO: Long Context and Long Form Output Dataset and Benchmarking
Viaarxiv icon

Large Concept Models: Language Modeling in a Sentence Representation Space

Add code
Dec 11, 2024
Figure 1 for Large Concept Models: Language Modeling in a Sentence Representation Space
Figure 2 for Large Concept Models: Language Modeling in a Sentence Representation Space
Figure 3 for Large Concept Models: Language Modeling in a Sentence Representation Space
Figure 4 for Large Concept Models: Language Modeling in a Sentence Representation Space
Viaarxiv icon

Y-NQ: English-Yorùbá Evaluation dataset for Open-Book Reading Comprehension and Text Generation

Add code
Dec 11, 2024
Figure 1 for Y-NQ: English-Yorùbá Evaluation dataset for Open-Book Reading Comprehension and Text Generation
Figure 2 for Y-NQ: English-Yorùbá Evaluation dataset for Open-Book Reading Comprehension and Text Generation
Figure 3 for Y-NQ: English-Yorùbá Evaluation dataset for Open-Book Reading Comprehension and Text Generation
Figure 4 for Y-NQ: English-Yorùbá Evaluation dataset for Open-Book Reading Comprehension and Text Generation
Viaarxiv icon

On the Role of Speech Data in Reducing Toxicity Detection Bias

Add code
Nov 12, 2024
Figure 1 for On the Role of Speech Data in Reducing Toxicity Detection Bias
Figure 2 for On the Role of Speech Data in Reducing Toxicity Detection Bias
Figure 3 for On the Role of Speech Data in Reducing Toxicity Detection Bias
Figure 4 for On the Role of Speech Data in Reducing Toxicity Detection Bias
Viaarxiv icon

Linguini: A benchmark for language-agnostic linguistic reasoning

Add code
Sep 18, 2024
Figure 1 for Linguini: A benchmark for language-agnostic linguistic reasoning
Figure 2 for Linguini: A benchmark for language-agnostic linguistic reasoning
Figure 3 for Linguini: A benchmark for language-agnostic linguistic reasoning
Figure 4 for Linguini: A benchmark for language-agnostic linguistic reasoning
Viaarxiv icon

Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language Models

Add code
Jul 25, 2024
Viaarxiv icon

Gender-specific Machine Translation with Large Language Models

Add code
Sep 06, 2023
Figure 1 for Gender-specific Machine Translation with Large Language Models
Figure 2 for Gender-specific Machine Translation with Large Language Models
Figure 3 for Gender-specific Machine Translation with Large Language Models
Figure 4 for Gender-specific Machine Translation with Large Language Models
Viaarxiv icon