Picture for Yinpeng Dong

Yinpeng Dong

Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise and Reconstruction

Add code
Feb 28, 2024
Figure 1 for Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise and Reconstruction
Figure 2 for Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise and Reconstruction
Figure 3 for Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise and Reconstruction
Figure 4 for Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise and Reconstruction
Viaarxiv icon

BSPA: Exploring Black-box Stealthy Prompt Attacks against Image Generators

Add code
Feb 23, 2024
Viaarxiv icon

Your Diffusion Model is Secretly a Certifiably Robust Classifier

Add code
Feb 13, 2024
Figure 1 for Your Diffusion Model is Secretly a Certifiably Robust Classifier
Figure 2 for Your Diffusion Model is Secretly a Certifiably Robust Classifier
Figure 3 for Your Diffusion Model is Secretly a Certifiably Robust Classifier
Figure 4 for Your Diffusion Model is Secretly a Certifiably Robust Classifier
Viaarxiv icon

Discovering Universal Semantic Triggers for Text-to-Image Synthesis

Add code
Feb 12, 2024
Figure 1 for Discovering Universal Semantic Triggers for Text-to-Image Synthesis
Figure 2 for Discovering Universal Semantic Triggers for Text-to-Image Synthesis
Figure 3 for Discovering Universal Semantic Triggers for Text-to-Image Synthesis
Figure 4 for Discovering Universal Semantic Triggers for Text-to-Image Synthesis
Viaarxiv icon

Towards Transferable Targeted 3D Adversarial Attack in the Physical World

Add code
Dec 15, 2023
Figure 1 for Towards Transferable Targeted 3D Adversarial Attack in the Physical World
Figure 2 for Towards Transferable Targeted 3D Adversarial Attack in the Physical World
Figure 3 for Towards Transferable Targeted 3D Adversarial Attack in the Physical World
Figure 4 for Towards Transferable Targeted 3D Adversarial Attack in the Physical World
Viaarxiv icon

Focus on Hiders: Exploring Hidden Threats for Enhancing Adversarial Training

Add code
Dec 12, 2023
Figure 1 for Focus on Hiders: Exploring Hidden Threats for Enhancing Adversarial Training
Figure 2 for Focus on Hiders: Exploring Hidden Threats for Enhancing Adversarial Training
Figure 3 for Focus on Hiders: Exploring Hidden Threats for Enhancing Adversarial Training
Figure 4 for Focus on Hiders: Exploring Hidden Threats for Enhancing Adversarial Training
Viaarxiv icon

Machine Vision Therapy: Multimodal Large Language Models Can Enhance Visual Robustness via Denoising In-Context Learning

Add code
Dec 05, 2023
Figure 1 for Machine Vision Therapy: Multimodal Large Language Models Can Enhance Visual Robustness via Denoising In-Context Learning
Figure 2 for Machine Vision Therapy: Multimodal Large Language Models Can Enhance Visual Robustness via Denoising In-Context Learning
Figure 3 for Machine Vision Therapy: Multimodal Large Language Models Can Enhance Visual Robustness via Denoising In-Context Learning
Figure 4 for Machine Vision Therapy: Multimodal Large Language Models Can Enhance Visual Robustness via Denoising In-Context Learning
Viaarxiv icon

Evil Geniuses: Delving into the Safety of LLM-based Agents

Add code
Nov 20, 2023
Figure 1 for Evil Geniuses: Delving into the Safety of LLM-based Agents
Figure 2 for Evil Geniuses: Delving into the Safety of LLM-based Agents
Figure 3 for Evil Geniuses: Delving into the Safety of LLM-based Agents
Figure 4 for Evil Geniuses: Delving into the Safety of LLM-based Agents
Viaarxiv icon

How Robust is Google's Bard to Adversarial Image Attacks?

Add code
Sep 21, 2023
Figure 1 for How Robust is Google's Bard to Adversarial Image Attacks?
Figure 2 for How Robust is Google's Bard to Adversarial Image Attacks?
Figure 3 for How Robust is Google's Bard to Adversarial Image Attacks?
Figure 4 for How Robust is Google's Bard to Adversarial Image Attacks?
Viaarxiv icon

Robustness and Generalizability of Deepfake Detection: A Study with Diffusion Models

Add code
Sep 05, 2023
Viaarxiv icon