Alert button

Test-time Backdoor Mitigation for Black-Box Large Language Models with Defensive Demonstrations

Nov 16, 2023
Wenjie Mo, Jiashu Xu, Qin Liu, Jiongxiao Wang, Jun Yan, Chaowei Xiao, Muhao Chen

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: