Alert button

Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure

Add code
Bookmark button
Alert button
Nov 27, 2023
Jérémy Scheurer, Mikita Balesni, Marius Hobbhahn

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: