Uncovering and Aligning Anomalous Attention Heads to Defend Against NLP Backdoor Attacks

Add code
Nov 16, 2025
Figure 1 for Uncovering and Aligning Anomalous Attention Heads to Defend Against NLP Backdoor Attacks
Figure 2 for Uncovering and Aligning Anomalous Attention Heads to Defend Against NLP Backdoor Attacks
Figure 3 for Uncovering and Aligning Anomalous Attention Heads to Defend Against NLP Backdoor Attacks
Figure 4 for Uncovering and Aligning Anomalous Attention Heads to Defend Against NLP Backdoor Attacks

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: