Alert button

Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing

Add code
Bookmark button
Alert button
Feb 25, 2024
Jiabao Ji, Bairu Hou, Alexander Robey, George J. Pappas, Hamed Hassani, Yang Zhang, Eric Wong, Shiyu Chang

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: