Picture for Vilém Zouhar

Vilém Zouhar

Deconstructing Self-Bias in LLM-generated Translation Benchmarks

Add code
Sep 30, 2025
Viaarxiv icon

Searching for Difficult-to-Translate Test Examples at Scale

Add code
Sep 30, 2025
Viaarxiv icon

Generating Difficult-to-Translate Texts

Add code
Sep 30, 2025
Viaarxiv icon

Biased Tales: Cultural and Topic Bias in Generating Children's Stories

Add code
Sep 09, 2025
Viaarxiv icon

Estimating Machine Translation Difficulty

Add code
Aug 13, 2025
Viaarxiv icon

Can Large Language Models Capture Human Annotator Disagreements?

Add code
Jun 24, 2025
Viaarxiv icon

Unsupervised Word-level Quality Estimation for Machine Translation Through the Lens of Annotators (Dis)agreement

Add code
May 29, 2025
Viaarxiv icon

Multilingual Performance Biases of Large Language Models in Education

Add code
Apr 24, 2025
Viaarxiv icon

Large Language Models as Span Annotators

Add code
Apr 11, 2025
Viaarxiv icon

QE4PE: Word-level Quality Estimation for Human Post-Editing

Add code
Mar 04, 2025
Viaarxiv icon