Picture for Nay Myat Min

Nay Myat Min

Propaganda via AI? A Study on Semantic Backdoors in Large Language Models

Add code
Apr 15, 2025
Viaarxiv icon

CROW: Eliminating Backdoors from Large Language Models via Internal Consistency Regularization

Add code
Nov 18, 2024
Figure 1 for CROW: Eliminating Backdoors from Large Language Models via Internal Consistency Regularization
Figure 2 for CROW: Eliminating Backdoors from Large Language Models via Internal Consistency Regularization
Figure 3 for CROW: Eliminating Backdoors from Large Language Models via Internal Consistency Regularization
Figure 4 for CROW: Eliminating Backdoors from Large Language Models via Internal Consistency Regularization
Viaarxiv icon

Unified Neural Backdoor Removal with Only Few Clean Samples through Unlearning and Relearning

Add code
May 23, 2024
Figure 1 for Unified Neural Backdoor Removal with Only Few Clean Samples through Unlearning and Relearning
Figure 2 for Unified Neural Backdoor Removal with Only Few Clean Samples through Unlearning and Relearning
Figure 3 for Unified Neural Backdoor Removal with Only Few Clean Samples through Unlearning and Relearning
Figure 4 for Unified Neural Backdoor Removal with Only Few Clean Samples through Unlearning and Relearning
Viaarxiv icon