Alert button
Picture for Tinghao Xie

Tinghao Xie

Alert button

Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications

Add code
Bookmark button
Alert button
Feb 07, 2024
Boyi Wei, Kaixuan Huang, Yangsibo Huang, Tinghao Xie, Xiangyu Qi, Mengzhou Xia, Prateek Mittal, Mengdi Wang, Peter Henderson

Viaarxiv icon

Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To!

Add code
Bookmark button
Alert button
Oct 05, 2023
Xiangyu Qi, Yi Zeng, Tinghao Xie, Pin-Yu Chen, Ruoxi Jia, Prateek Mittal, Peter Henderson

Figure 1 for Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To!
Figure 2 for Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To!
Figure 3 for Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To!
Figure 4 for Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To!
Viaarxiv icon

BaDExpert: Extracting Backdoor Functionality for Accurate Backdoor Input Detection

Add code
Bookmark button
Alert button
Aug 23, 2023
Tinghao Xie, Xiangyu Qi, Ping He, Yiming Li, Jiachen T. Wang, Prateek Mittal

Figure 1 for BaDExpert: Extracting Backdoor Functionality for Accurate Backdoor Input Detection
Figure 2 for BaDExpert: Extracting Backdoor Functionality for Accurate Backdoor Input Detection
Figure 3 for BaDExpert: Extracting Backdoor Functionality for Accurate Backdoor Input Detection
Figure 4 for BaDExpert: Extracting Backdoor Functionality for Accurate Backdoor Input Detection
Viaarxiv icon

Fight Poison with Poison: Detecting Backdoor Poison Samples via Decoupling Benign Correlations

Add code
Bookmark button
Alert button
May 26, 2022
Xiangyu Qi, Tinghao Xie, Saeed Mahloujifar, Prateek Mittal

Figure 1 for Fight Poison with Poison: Detecting Backdoor Poison Samples via Decoupling Benign Correlations
Figure 2 for Fight Poison with Poison: Detecting Backdoor Poison Samples via Decoupling Benign Correlations
Figure 3 for Fight Poison with Poison: Detecting Backdoor Poison Samples via Decoupling Benign Correlations
Figure 4 for Fight Poison with Poison: Detecting Backdoor Poison Samples via Decoupling Benign Correlations
Viaarxiv icon

Circumventing Backdoor Defenses That Are Based on Latent Separability

Add code
Bookmark button
Alert button
May 26, 2022
Xiangyu Qi, Tinghao Xie, Saeed Mahloujifar, Prateek Mittal

Figure 1 for Circumventing Backdoor Defenses That Are Based on Latent Separability
Figure 2 for Circumventing Backdoor Defenses That Are Based on Latent Separability
Figure 3 for Circumventing Backdoor Defenses That Are Based on Latent Separability
Figure 4 for Circumventing Backdoor Defenses That Are Based on Latent Separability
Viaarxiv icon

Towards Practical Deployment-Stage Backdoor Attack on Deep Neural Networks

Add code
Bookmark button
Alert button
Nov 25, 2021
Xiangyu Qi, Tinghao Xie, Ruizhe Pan, Jifeng Zhu, Yong Yang, Kai Bu

Figure 1 for Towards Practical Deployment-Stage Backdoor Attack on Deep Neural Networks
Figure 2 for Towards Practical Deployment-Stage Backdoor Attack on Deep Neural Networks
Figure 3 for Towards Practical Deployment-Stage Backdoor Attack on Deep Neural Networks
Figure 4 for Towards Practical Deployment-Stage Backdoor Attack on Deep Neural Networks
Viaarxiv icon