Picture for Giada Cosenza

Giada Cosenza

Are Large Language Models Really Bias-Free? Jailbreak Prompts for Assessing Adversarial Robustness to Bias Elicitation

Add code
Jul 11, 2024
Viaarxiv icon