Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Fréchet Distance for Offline Evaluation of Information Retrieval Systems with Sparse Labels

Jan 31, 2024

Negar Arabzadeh, Charles L. A. Clarke

Figure 1 for Fréchet Distance for Offline Evaluation of Information Retrieval Systems with Sparse Labels

Figure 2 for Fréchet Distance for Offline Evaluation of Information Retrieval Systems with Sparse Labels

Figure 3 for Fréchet Distance for Offline Evaluation of Information Retrieval Systems with Sparse Labels

Figure 4 for Fréchet Distance for Offline Evaluation of Information Retrieval Systems with Sparse Labels

Share this with someone who'll enjoy it:

Abstract:The rapid advancement of natural language processing, information retrieval (IR), computer vision, and other technologies has presented significant challenges in evaluating the performance of these systems. One of the main challenges is the scarcity of human-labeled data, which hinders the fair and accurate assessment of these systems. In this work, we specifically focus on evaluating IR systems with sparse labels, borrowing from recent research on evaluating computer vision tasks. taking inspiration from the success of using Fr\'echet Inception Distance (FID) in assessing text-to-image generation systems. We propose leveraging the Fr\'echet Distance to measure the distance between the distributions of relevant judged items and retrieved results. Our experimental results on MS MARCO V1 dataset and TREC Deep Learning Tracks query sets demonstrate the effectiveness of the Fr\'echet Distance as a metric for evaluating IR systems, particularly in settings where a few labels are available. This approach contributes to the advancement of evaluation methodologies in real-world scenarios such as the assessment of generative IR systems.

View paper on

Share this with someone who'll enjoy it:

Title:Fréchet Distance for Offline Evaluation of Information Retrieval Systems with Sparse Labels

Paper and Code