Picture for Peixin Zhang

Peixin Zhang

DDOR: Delta Debugging for Explainable Overrefusal Testing and Repair

Add code
Jun 02, 2026
Viaarxiv icon

ClawGuard: A Runtime Security Framework for Tool-Augmented LLM Agents Against Indirect Prompt Injection

Add code
Apr 13, 2026
Viaarxiv icon

Towards Provably Unlearnable Examples via Bayes Error Optimisation

Add code
Nov 11, 2025
Figure 1 for Towards Provably Unlearnable Examples via Bayes Error Optimisation
Figure 2 for Towards Provably Unlearnable Examples via Bayes Error Optimisation
Figure 3 for Towards Provably Unlearnable Examples via Bayes Error Optimisation
Figure 4 for Towards Provably Unlearnable Examples via Bayes Error Optimisation
Viaarxiv icon

PRUNE: A Patching Based Repair Framework for Certiffable Unlearning of Neural Networks

Add code
May 10, 2025
Viaarxiv icon

RedTest: Towards Measuring Redundancy in Deep Neural Networks Effectively

Add code
Nov 15, 2024
Figure 1 for RedTest: Towards Measuring Redundancy in Deep Neural Networks Effectively
Figure 2 for RedTest: Towards Measuring Redundancy in Deep Neural Networks Effectively
Figure 3 for RedTest: Towards Measuring Redundancy in Deep Neural Networks Effectively
Figure 4 for RedTest: Towards Measuring Redundancy in Deep Neural Networks Effectively
Viaarxiv icon

LLMScan: Causal Scan for LLM Misbehavior Detection

Add code
Oct 23, 2024
Figure 1 for LLMScan: Causal Scan for LLM Misbehavior Detection
Figure 2 for LLMScan: Causal Scan for LLM Misbehavior Detection
Figure 3 for LLMScan: Causal Scan for LLM Misbehavior Detection
Figure 4 for LLMScan: Causal Scan for LLM Misbehavior Detection
Viaarxiv icon

Towards Certified Probabilistic Robustness with High Accuracy

Add code
Sep 02, 2023
Figure 1 for Towards Certified Probabilistic Robustness with High Accuracy
Figure 2 for Towards Certified Probabilistic Robustness with High Accuracy
Figure 3 for Towards Certified Probabilistic Robustness with High Accuracy
Figure 4 for Towards Certified Probabilistic Robustness with High Accuracy
Viaarxiv icon

Fairness Testing of Deep Image Classification with Adequacy Metrics

Add code
Dec 01, 2021
Figure 1 for Fairness Testing of Deep Image Classification with Adequacy Metrics
Figure 2 for Fairness Testing of Deep Image Classification with Adequacy Metrics
Figure 3 for Fairness Testing of Deep Image Classification with Adequacy Metrics
Figure 4 for Fairness Testing of Deep Image Classification with Adequacy Metrics
Viaarxiv icon

Automatic Fairness Testing of Neural Classifiers through Adversarial Sampling

Add code
Jul 29, 2021
Figure 1 for Automatic Fairness Testing of Neural Classifiers through Adversarial Sampling
Figure 2 for Automatic Fairness Testing of Neural Classifiers through Adversarial Sampling
Figure 3 for Automatic Fairness Testing of Neural Classifiers through Adversarial Sampling
Figure 4 for Automatic Fairness Testing of Neural Classifiers through Adversarial Sampling
Viaarxiv icon

There is Limited Correlation between Coverage and Robustness for Deep Neural Networks

Add code
Nov 14, 2019
Figure 1 for There is Limited Correlation between Coverage and Robustness for Deep Neural Networks
Figure 2 for There is Limited Correlation between Coverage and Robustness for Deep Neural Networks
Figure 3 for There is Limited Correlation between Coverage and Robustness for Deep Neural Networks
Figure 4 for There is Limited Correlation between Coverage and Robustness for Deep Neural Networks
Viaarxiv icon