Alert button

Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks

Add code
Bookmark button
Alert button
Oct 19, 2023
Xiaodong Yu, Hao Cheng, Xiaodong Liu, Dan Roth, Jianfeng Gao

Figure 1 for Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks
Figure 2 for Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks
Figure 3 for Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks
Figure 4 for Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: