Alert button

JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models

Add code
Bookmark button
Alert button
Mar 28, 2024
Patrick Chao, Edoardo Debenedetti, Alexander Robey, Maksym Andriushchenko, Francesco Croce, Vikash Sehwag, Edgar Dobriban, Nicolas Flammarion, George J. Pappas, Florian Tramer, Hamed Hassani, Eric Wong

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: