Picture for Emine Yilmaz

Emine Yilmaz

Towards Understanding Bias in Synthetic Data for Evaluation

Add code
Jun 12, 2025
Viaarxiv icon

PersonaLens: A Benchmark for Personalization Evaluation in Conversational AI Assistants

Add code
Jun 11, 2025
Viaarxiv icon

Attributing Response to Context: A Jensen-Shannon Divergence Driven Mechanistic Study of Context Attribution in Retrieval-Augmented Generation

Add code
May 22, 2025
Viaarxiv icon

Judging the Judges: A Collection of LLM-Generated Relevance Judgements

Add code
Feb 19, 2025
Viaarxiv icon

KEIR @ ECIR 2025: The Second Workshop on Knowledge-Enhanced Information Retrieval

Add code
Jan 20, 2025
Viaarxiv icon

JudgeBlender: Ensembling Judgments for Automatic Relevance Assessment

Add code
Dec 17, 2024
Figure 1 for JudgeBlender: Ensembling Judgments for Automatic Relevance Assessment
Figure 2 for JudgeBlender: Ensembling Judgments for Automatic Relevance Assessment
Figure 3 for JudgeBlender: Ensembling Judgments for Automatic Relevance Assessment
Figure 4 for JudgeBlender: Ensembling Judgments for Automatic Relevance Assessment
Viaarxiv icon

SynDL: A Large-Scale Synthetic Test Collection for Passage Retrieval

Add code
Aug 30, 2024
Figure 1 for SynDL: A Large-Scale Synthetic Test Collection for Passage Retrieval
Figure 2 for SynDL: A Large-Scale Synthetic Test Collection for Passage Retrieval
Figure 3 for SynDL: A Large-Scale Synthetic Test Collection for Passage Retrieval
Figure 4 for SynDL: A Large-Scale Synthetic Test Collection for Passage Retrieval
Viaarxiv icon

Report on the 1st Workshop on Large Language Model for Evaluation in Information Retrieval (LLM4Eval 2024) at SIGIR 2024

Add code
Aug 09, 2024
Figure 1 for Report on the 1st Workshop on Large Language Model for Evaluation in Information Retrieval (LLM4Eval 2024) at SIGIR 2024
Figure 2 for Report on the 1st Workshop on Large Language Model for Evaluation in Information Retrieval (LLM4Eval 2024) at SIGIR 2024
Viaarxiv icon

LLMJudge: LLMs for Relevance Judgments

Add code
Aug 09, 2024
Figure 1 for LLMJudge: LLMs for Relevance Judgments
Figure 2 for LLMJudge: LLMs for Relevance Judgments
Viaarxiv icon

Adaptive Retrieval-Augmented Generation for Conversational Systems

Add code
Jul 31, 2024
Viaarxiv icon