Alert button

Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from Large Language Models

Feb 08, 2024
Chirag Agarwal, Sree Harsha Tanneru, Himabindu Lakkaraju

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: