Picture for Laura Dietz

Laura Dietz

Beyond Relevance: On the Relationship Between Retrieval and RAG Information Coverage

Add code
Mar 11, 2026
Viaarxiv icon

Supporting Humans in Evaluating AI Summaries of Legal Depositions

Add code
Jan 21, 2026
Viaarxiv icon

Insider Knowledge: How Much Can RAG Systems Gain from Evaluation Secrets?

Add code
Jan 19, 2026
Viaarxiv icon

Incorporating Q&A Nuggets into Retrieval-Augmented Generation

Add code
Jan 19, 2026
Viaarxiv icon

UNH at CheckThat! 2025: Fine-tuning Vs Prompting in Claim Extraction

Add code
Sep 08, 2025
Viaarxiv icon

LLM-Evaluation Tropes: Perspectives on the Validity of LLM-Evaluations

Add code
Apr 27, 2025
Viaarxiv icon

LLM-based relevance assessment still can't replace human relevance assessment

Add code
Dec 22, 2024
Figure 1 for LLM-based relevance assessment still can't replace human relevance assessment
Viaarxiv icon

Best in Tau@LLMJudge: Criteria-Based Relevance Evaluation with Llama3

Add code
Oct 17, 2024
Figure 1 for Best in Tau@LLMJudge: Criteria-Based Relevance Evaluation with Llama3
Figure 2 for Best in Tau@LLMJudge: Criteria-Based Relevance Evaluation with Llama3
Figure 3 for Best in Tau@LLMJudge: Criteria-Based Relevance Evaluation with Llama3
Figure 4 for Best in Tau@LLMJudge: Criteria-Based Relevance Evaluation with Llama3
Viaarxiv icon

A Workbench for Autograding Retrieve/Generate Systems

Add code
May 21, 2024
Figure 1 for A Workbench for Autograding Retrieve/Generate Systems
Figure 2 for A Workbench for Autograding Retrieve/Generate Systems
Figure 3 for A Workbench for Autograding Retrieve/Generate Systems
Figure 4 for A Workbench for Autograding Retrieve/Generate Systems
Viaarxiv icon

An Exam-based Evaluation Approach Beyond Traditional Relevance Judgments

Add code
Feb 01, 2024
Viaarxiv icon