Picture for Shivani Upadhyay

Shivani Upadhyay

Support Evaluation for the TREC 2024 RAG Track: Comparing Human versus LLM Judges

Add code
Apr 21, 2025
Viaarxiv icon

The Great Nugget Recall: Automating Fact Extraction and RAG Evaluation with Large Language Models

Add code
Apr 21, 2025
Viaarxiv icon

Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework

Add code
Nov 14, 2024
Figure 1 for Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework
Figure 2 for Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework
Figure 3 for Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework
Figure 4 for Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework
Viaarxiv icon

A Large-Scale Study of Relevance Assessments with Large Language Models: An Initial Look

Add code
Nov 13, 2024
Viaarxiv icon

UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance Assessor

Add code
Jun 10, 2024
Viaarxiv icon

UniRAG: Universal Retrieval Augmentation for Multi-Modal Large Language Models

Add code
May 16, 2024
Viaarxiv icon

LLMs Can Patch Up Missing Relevance Judgments in Evaluation

Add code
May 08, 2024
Viaarxiv icon