Alert button

Large Language Models are Vulnerable to Bait-and-Switch Attacks for Generating Harmful Content

Feb 21, 2024
Federico Bianchi, James Zou

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: