Alert button

Learn to Disguise: Avoid Refusal Responses in LLM's Defense via a Multi-agent Attacker-Disguiser Game

Apr 03, 2024
Qianqiao Xu, Zhiliang Tian, Hongyan Wu, Zhen Huang, Yiping Song, Feng Liu, Dongsheng Li

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: