Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:The effectiveness of feature attribution methods and its correlation with automatic evaluation scores

Jun 23, 2021

Giang Nguyen, Daeyoung Kim, Anh Nguyen

Figure 1 for The effectiveness of feature attribution methods and its correlation with automatic evaluation scores

Figure 2 for The effectiveness of feature attribution methods and its correlation with automatic evaluation scores

Figure 3 for The effectiveness of feature attribution methods and its correlation with automatic evaluation scores

Figure 4 for The effectiveness of feature attribution methods and its correlation with automatic evaluation scores

Share this with someone who'll enjoy it:

Abstract:Explaining the decisions of an Artificial Intelligence (AI) model is increasingly critical in many real-world, high-stake applications. Hundreds of papers have either proposed new feature attribution methods, discussed or harnessed these tools in their work. However, despite humans being the target end-users, most attribution methods were only evaluated on proxy automatic-evaluation metrics. In this paper, we conduct the first, large-scale user study on 320 lay and 11 expert users to shed light on the effectiveness of state-of-the-art attribution methods in assisting humans in ImageNet classification, Stanford Dogs fine-grained classification, and these two tasks but when the input image contains adversarial perturbations. We found that, in overall, feature attribution is surprisingly not more effective than showing humans nearest training-set examples. On a hard task of fine-grained dog categorization, presenting attribution maps to humans does not help, but instead hurts the performance of human-AI teams compared to AI alone. Importantly, we found automatic attribution-map evaluation measures to correlate poorly with the actual human-AI team performance. Our findings encourage the community to rigorously test their methods on the downstream human-in-the-loop applications and to rethink the existing evaluation metrics.

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:The effectiveness of feature attribution methods and its correlation with automatic evaluation scores

Paper and Code