Picture for Chrysoula Zerva

Chrysoula Zerva

Counterfactual Fairness with Graph Uncertainty

Add code
Jan 06, 2026
Viaarxiv icon

Teaching Language Models to Faithfully Express their Uncertainty

Add code
Oct 14, 2025
Figure 1 for Teaching Language Models to Faithfully Express their Uncertainty
Figure 2 for Teaching Language Models to Faithfully Express their Uncertainty
Figure 3 for Teaching Language Models to Faithfully Express their Uncertainty
Figure 4 for Teaching Language Models to Faithfully Express their Uncertainty
Viaarxiv icon

Unlocking Latent Discourse Translation in LLMs Through Quality-Aware Decoding

Add code
Oct 08, 2025
Viaarxiv icon

GAMBIT+: A Challenge Set for Evaluating Gender Bias in Machine Translation Quality Estimation Metrics

Add code
Oct 08, 2025
Viaarxiv icon

Movie Facts and Fibs (MF$^2$): A Benchmark for Long Movie Understanding

Add code
Jun 06, 2025
Viaarxiv icon

A Conformal Risk Control Framework for Granular Word Assessment and Uncertainty Calibration of CLIPScore Quality Estimates

Add code
Apr 01, 2025
Viaarxiv icon

Rejected Dialects: Biases Against African American Language in Reward Models

Add code
Feb 18, 2025
Viaarxiv icon

Evaluation of Multilingual Image Captioning: How far can we get with CLIP models?

Add code
Feb 10, 2025
Figure 1 for Evaluation of Multilingual Image Captioning: How far can we get with CLIP models?
Figure 2 for Evaluation of Multilingual Image Captioning: How far can we get with CLIP models?
Figure 3 for Evaluation of Multilingual Image Captioning: How far can we get with CLIP models?
Figure 4 for Evaluation of Multilingual Image Captioning: How far can we get with CLIP models?
Viaarxiv icon

"I Never Said That": A dataset, taxonomy and baselines on response clarity classification

Add code
Sep 20, 2024
Figure 1 for "I Never Said That": A dataset, taxonomy and baselines on response clarity classification
Figure 2 for "I Never Said That": A dataset, taxonomy and baselines on response clarity classification
Figure 3 for "I Never Said That": A dataset, taxonomy and baselines on response clarity classification
Figure 4 for "I Never Said That": A dataset, taxonomy and baselines on response clarity classification
Viaarxiv icon

Conformal Prediction for Natural Language Processing: A Survey

Add code
May 03, 2024
Viaarxiv icon