Picture for Guozhu Meng

Guozhu Meng

Align in Depth: Defending Jailbreak Attacks via Progressive Answer Detoxification

Add code
Mar 14, 2025
Figure 1 for Align in Depth: Defending Jailbreak Attacks via Progressive Answer Detoxification
Figure 2 for Align in Depth: Defending Jailbreak Attacks via Progressive Answer Detoxification
Figure 3 for Align in Depth: Defending Jailbreak Attacks via Progressive Answer Detoxification
Figure 4 for Align in Depth: Defending Jailbreak Attacks via Progressive Answer Detoxification
Viaarxiv icon

SAP-DIFF: Semantic Adversarial Patch Generation for Black-Box Face Recognition Models via Diffusion Models

Add code
Feb 27, 2025
Viaarxiv icon

Dormant: Defending against Pose-driven Human Image Animation

Add code
Sep 22, 2024
Figure 1 for Dormant: Defending against Pose-driven Human Image Animation
Figure 2 for Dormant: Defending against Pose-driven Human Image Animation
Figure 3 for Dormant: Defending against Pose-driven Human Image Animation
Figure 4 for Dormant: Defending against Pose-driven Human Image Animation
Viaarxiv icon

Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise and Reconstruction

Add code
Feb 28, 2024
Figure 1 for Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise and Reconstruction
Figure 2 for Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise and Reconstruction
Figure 3 for Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise and Reconstruction
Figure 4 for Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise and Reconstruction
Viaarxiv icon

Evaluating Decision Optimality of Autonomous Driving via Metamorphic Testing

Add code
Feb 28, 2024
Figure 1 for Evaluating Decision Optimality of Autonomous Driving via Metamorphic Testing
Figure 2 for Evaluating Decision Optimality of Autonomous Driving via Metamorphic Testing
Figure 3 for Evaluating Decision Optimality of Autonomous Driving via Metamorphic Testing
Figure 4 for Evaluating Decision Optimality of Autonomous Driving via Metamorphic Testing
Viaarxiv icon

DataElixir: Purifying Poisoned Dataset to Mitigate Backdoor Attacks via Diffusion Models

Add code
Dec 20, 2023
Figure 1 for DataElixir: Purifying Poisoned Dataset to Mitigate Backdoor Attacks via Diffusion Models
Figure 2 for DataElixir: Purifying Poisoned Dataset to Mitigate Backdoor Attacks via Diffusion Models
Figure 3 for DataElixir: Purifying Poisoned Dataset to Mitigate Backdoor Attacks via Diffusion Models
Figure 4 for DataElixir: Purifying Poisoned Dataset to Mitigate Backdoor Attacks via Diffusion Models
Viaarxiv icon

Good-looking but Lacking Faithfulness: Understanding Local Explanation Methods through Trend-based Testing

Add code
Sep 09, 2023
Viaarxiv icon

ConFL: Constraint-guided Fuzzing for Machine Learning Framework

Add code
Jul 11, 2023
Viaarxiv icon

SSL-WM: A Black-Box Watermarking Approach for Encoders Pre-trained by Self-supervised Learning

Add code
Sep 08, 2022
Figure 1 for SSL-WM: A Black-Box Watermarking Approach for Encoders Pre-trained by Self-supervised Learning
Figure 2 for SSL-WM: A Black-Box Watermarking Approach for Encoders Pre-trained by Self-supervised Learning
Figure 3 for SSL-WM: A Black-Box Watermarking Approach for Encoders Pre-trained by Self-supervised Learning
Figure 4 for SSL-WM: A Black-Box Watermarking Approach for Encoders Pre-trained by Self-supervised Learning
Viaarxiv icon

Learning Program Semantics with Code Representations: An Empirical Study

Add code
Mar 22, 2022
Figure 1 for Learning Program Semantics with Code Representations: An Empirical Study
Figure 2 for Learning Program Semantics with Code Representations: An Empirical Study
Figure 3 for Learning Program Semantics with Code Representations: An Empirical Study
Figure 4 for Learning Program Semantics with Code Representations: An Empirical Study
Viaarxiv icon