Alert button

Latent Jailbreak: A Benchmark for Evaluating Text Safety and Output Robustness of Large Language Models

Add code
Bookmark button
Alert button
Jul 17, 2023
Huachuan Qiu, Shuai Zhang, Anqi Li, Hongliang He, Zhenzhong Lan

Figure 1 for Latent Jailbreak: A Benchmark for Evaluating Text Safety and Output Robustness of Large Language Models
Figure 2 for Latent Jailbreak: A Benchmark for Evaluating Text Safety and Output Robustness of Large Language Models
Figure 3 for Latent Jailbreak: A Benchmark for Evaluating Text Safety and Output Robustness of Large Language Models
Figure 4 for Latent Jailbreak: A Benchmark for Evaluating Text Safety and Output Robustness of Large Language Models

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: