Picture for Bocheng Chen

Bocheng Chen

No Free Lunch for Defending Against Prefilling Attack by In-Context Learning

Add code
Dec 13, 2024
Viaarxiv icon

FlexLLM: Exploring LLM Customization for Moving Target Defense on Black-Box LLMs Against Jailbreak Attacks

Add code
Dec 10, 2024
Figure 1 for FlexLLM: Exploring LLM Customization for Moving Target Defense on Black-Box LLMs Against Jailbreak Attacks
Figure 2 for FlexLLM: Exploring LLM Customization for Moving Target Defense on Black-Box LLMs Against Jailbreak Attacks
Figure 3 for FlexLLM: Exploring LLM Customization for Moving Target Defense on Black-Box LLMs Against Jailbreak Attacks
Figure 4 for FlexLLM: Exploring LLM Customization for Moving Target Defense on Black-Box LLMs Against Jailbreak Attacks
Viaarxiv icon

Protecting Activity Sensing Data Privacy Using Hierarchical Information Dissociation

Add code
Sep 04, 2024
Viaarxiv icon

The Dark Side of Human Feedback: Poisoning Large Language Models via User Inputs

Add code
Sep 01, 2024
Figure 1 for The Dark Side of Human Feedback: Poisoning Large Language Models via User Inputs
Figure 2 for The Dark Side of Human Feedback: Poisoning Large Language Models via User Inputs
Figure 3 for The Dark Side of Human Feedback: Poisoning Large Language Models via User Inputs
Figure 4 for The Dark Side of Human Feedback: Poisoning Large Language Models via User Inputs
Viaarxiv icon

Beyond Boundaries: A Comprehensive Survey of Transferable Attacks on AI Systems

Add code
Nov 20, 2023
Figure 1 for Beyond Boundaries: A Comprehensive Survey of Transferable Attacks on AI Systems
Figure 2 for Beyond Boundaries: A Comprehensive Survey of Transferable Attacks on AI Systems
Figure 3 for Beyond Boundaries: A Comprehensive Survey of Transferable Attacks on AI Systems
Figure 4 for Beyond Boundaries: A Comprehensive Survey of Transferable Attacks on AI Systems
Viaarxiv icon

PhantomSound: Black-Box, Query-Efficient Audio Adversarial Attack via Split-Second Phoneme Injection

Add code
Sep 13, 2023
Viaarxiv icon

Understanding Multi-Turn Toxic Behaviors in Open-Domain Chatbots

Add code
Jul 14, 2023
Viaarxiv icon

VSMask: Defending Against Voice Synthesis Attack via Real-Time Predictive Perturbation

Add code
May 09, 2023
Figure 1 for VSMask: Defending Against Voice Synthesis Attack via Real-Time Predictive Perturbation
Figure 2 for VSMask: Defending Against Voice Synthesis Attack via Real-Time Predictive Perturbation
Figure 3 for VSMask: Defending Against Voice Synthesis Attack via Real-Time Predictive Perturbation
Figure 4 for VSMask: Defending Against Voice Synthesis Attack via Real-Time Predictive Perturbation
Viaarxiv icon