Alert button

Curiosity-driven Red-teaming for Large Language Models

Add code
Bookmark button
Alert button
Feb 29, 2024
Zhang-Wei Hong, Idan Shenfeld, Tsun-Hsuan Wang, Yung-Sung Chuang, Aldo Pareja, James Glass, Akash Srivastava, Pulkit Agrawal

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: