Picture for Chaowei Xiao

Chaowei Xiao

AI Risk Management Should Incorporate Both Safety and Security

Add code
May 29, 2024
Figure 1 for AI Risk Management Should Incorporate Both Safety and Security
Viaarxiv icon

Visual-RolePlay: Universal Jailbreak Attack on MultiModal Large Language Models via Role-playing Image Characte

Add code
May 25, 2024
Viaarxiv icon

Safeguarding Vision-Language Models Against Patched Visual Prompt Injectors

Add code
May 17, 2024
Viaarxiv icon

JailBreakV-28K: A Benchmark for Assessing the Robustness of MultiModal Large Language Models against Jailbreak Attacks

Add code
Apr 03, 2024
Viaarxiv icon

Don't Listen To Me: Understanding and Exploring Jailbreak Prompts of Large Language Models

Add code
Mar 26, 2024
Figure 1 for Don't Listen To Me: Understanding and Exploring Jailbreak Prompts of Large Language Models
Figure 2 for Don't Listen To Me: Understanding and Exploring Jailbreak Prompts of Large Language Models
Figure 3 for Don't Listen To Me: Understanding and Exploring Jailbreak Prompts of Large Language Models
Figure 4 for Don't Listen To Me: Understanding and Exploring Jailbreak Prompts of Large Language Models
Viaarxiv icon

AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting

Add code
Mar 14, 2024
Figure 1 for AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting
Figure 2 for AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting
Figure 3 for AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting
Figure 4 for AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting
Viaarxiv icon

Automatic and Universal Prompt Injection Attacks against Large Language Models

Add code
Mar 07, 2024
Viaarxiv icon

A New Era in LLM Security: Exploring Security Concerns in Real-World LLM-based Systems

Add code
Feb 28, 2024
Figure 1 for A New Era in LLM Security: Exploring Security Concerns in Real-World LLM-based Systems
Figure 2 for A New Era in LLM Security: Exploring Security Concerns in Real-World LLM-based Systems
Figure 3 for A New Era in LLM Security: Exploring Security Concerns in Real-World LLM-based Systems
Figure 4 for A New Era in LLM Security: Exploring Security Concerns in Real-World LLM-based Systems
Viaarxiv icon

Mitigating Fine-tuning Jailbreak Attack with Backdoor Enhanced Alignment

Add code
Feb 27, 2024
Viaarxiv icon

WIPI: A New Web Threat for LLM-Driven Web Agents

Add code
Feb 26, 2024
Viaarxiv icon