Picture for Haichang Gao

Haichang Gao

From Assistant to Double Agent: Formalizing and Benchmarking Attacks on OpenClaw for Personalized Local AI Agent

Add code
Feb 09, 2026
Viaarxiv icon

MLLM Machine Unlearning via Visual Knowledge Distillation

Add code
Dec 12, 2025
Viaarxiv icon

Whispers of Data: Unveiling Label Distributions in Federated Learning Through Virtual Client Simulation

Add code
Apr 30, 2025
Figure 1 for Whispers of Data: Unveiling Label Distributions in Federated Learning Through Virtual Client Simulation
Figure 2 for Whispers of Data: Unveiling Label Distributions in Federated Learning Through Virtual Client Simulation
Figure 3 for Whispers of Data: Unveiling Label Distributions in Federated Learning Through Virtual Client Simulation
Figure 4 for Whispers of Data: Unveiling Label Distributions in Federated Learning Through Virtual Client Simulation
Viaarxiv icon

HumorReject: Decoupling LLM Safety from Refusal Prefix via A Little Humor

Add code
Jan 23, 2025
Figure 1 for HumorReject: Decoupling LLM Safety from Refusal Prefix via A Little Humor
Figure 2 for HumorReject: Decoupling LLM Safety from Refusal Prefix via A Little Humor
Figure 3 for HumorReject: Decoupling LLM Safety from Refusal Prefix via A Little Humor
Figure 4 for HumorReject: Decoupling LLM Safety from Refusal Prefix via A Little Humor
Viaarxiv icon

Mining Glitch Tokens in Large Language Models via Gradient-based Discrete Optimization

Add code
Oct 19, 2024
Figure 1 for Mining Glitch Tokens in Large Language Models via Gradient-based Discrete Optimization
Figure 2 for Mining Glitch Tokens in Large Language Models via Gradient-based Discrete Optimization
Figure 3 for Mining Glitch Tokens in Large Language Models via Gradient-based Discrete Optimization
Figure 4 for Mining Glitch Tokens in Large Language Models via Gradient-based Discrete Optimization
Viaarxiv icon

The Dark Side of Function Calling: Pathways to Jailbreaking Large Language Models

Add code
Jul 25, 2024
Figure 1 for The Dark Side of Function Calling: Pathways to Jailbreaking Large Language Models
Figure 2 for The Dark Side of Function Calling: Pathways to Jailbreaking Large Language Models
Figure 3 for The Dark Side of Function Calling: Pathways to Jailbreaking Large Language Models
Figure 4 for The Dark Side of Function Calling: Pathways to Jailbreaking Large Language Models
Viaarxiv icon

SoK: Acoustic Side Channels

Add code
Aug 06, 2023
Figure 1 for SoK: Acoustic Side Channels
Figure 2 for SoK: Acoustic Side Channels
Figure 3 for SoK: Acoustic Side Channels
Figure 4 for SoK: Acoustic Side Channels
Viaarxiv icon

AdvFunMatch: When Consistent Teaching Meets Adversarial Robustness

Add code
May 25, 2023
Figure 1 for AdvFunMatch: When Consistent Teaching Meets Adversarial Robustness
Figure 2 for AdvFunMatch: When Consistent Teaching Meets Adversarial Robustness
Figure 3 for AdvFunMatch: When Consistent Teaching Meets Adversarial Robustness
Figure 4 for AdvFunMatch: When Consistent Teaching Meets Adversarial Robustness
Viaarxiv icon

Lower Difficulty and Better Robustness: A Bregman Divergence Perspective for Adversarial Training

Add code
Aug 26, 2022
Figure 1 for Lower Difficulty and Better Robustness: A Bregman Divergence Perspective for Adversarial Training
Figure 2 for Lower Difficulty and Better Robustness: A Bregman Divergence Perspective for Adversarial Training
Figure 3 for Lower Difficulty and Better Robustness: A Bregman Divergence Perspective for Adversarial Training
Figure 4 for Lower Difficulty and Better Robustness: A Bregman Divergence Perspective for Adversarial Training
Viaarxiv icon

Alleviating Robust Overfitting of Adversarial Training With Consistency Regularization

Add code
May 24, 2022
Figure 1 for Alleviating Robust Overfitting of Adversarial Training With Consistency Regularization
Figure 2 for Alleviating Robust Overfitting of Adversarial Training With Consistency Regularization
Figure 3 for Alleviating Robust Overfitting of Adversarial Training With Consistency Regularization
Figure 4 for Alleviating Robust Overfitting of Adversarial Training With Consistency Regularization
Viaarxiv icon