Alert button

How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions

Add code
Bookmark button
Alert button
Sep 26, 2023
Lorenzo Pacchiardi, Alex J. Chan, Sören Mindermann, Ilan Moscovitz, Alexa Y. Pan, Yarin Gal, Owain Evans, Jan Brauner

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: