Alert button
Picture for Xinyue Shen

Xinyue Shen

Alert button

Comprehensive Assessment of Jailbreak Attacks Against LLMs

Add code
Bookmark button
Alert button
Feb 08, 2024
Junjie Chu, Yugeng Liu, Ziqing Yang, Xinyue Shen, Michael Backes, Yang Zhang

Viaarxiv icon

"Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models

Add code
Bookmark button
Alert button
Aug 07, 2023
Xinyue Shen, Zeyuan Chen, Michael Backes, Yun Shen, Yang Zhang

Figure 1 for "Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models
Figure 2 for "Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models
Figure 3 for "Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models
Figure 4 for "Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models
Viaarxiv icon

Unsafe Diffusion: On the Generation of Unsafe Images and Hateful Memes From Text-To-Image Models

Add code
Bookmark button
Alert button
May 23, 2023
Yiting Qu, Xinyue Shen, Xinlei He, Michael Backes, Savvas Zannettou, Yang Zhang

Figure 1 for Unsafe Diffusion: On the Generation of Unsafe Images and Hateful Memes From Text-To-Image Models
Figure 2 for Unsafe Diffusion: On the Generation of Unsafe Images and Hateful Memes From Text-To-Image Models
Figure 3 for Unsafe Diffusion: On the Generation of Unsafe Images and Hateful Memes From Text-To-Image Models
Figure 4 for Unsafe Diffusion: On the Generation of Unsafe Images and Hateful Memes From Text-To-Image Models
Viaarxiv icon

In ChatGPT We Trust? Measuring and Characterizing the Reliability of ChatGPT

Add code
Bookmark button
Alert button
Apr 18, 2023
Xinyue Shen, Zeyuan Chen, Michael Backes, Yang Zhang

Figure 1 for In ChatGPT We Trust? Measuring and Characterizing the Reliability of ChatGPT
Figure 2 for In ChatGPT We Trust? Measuring and Characterizing the Reliability of ChatGPT
Figure 3 for In ChatGPT We Trust? Measuring and Characterizing the Reliability of ChatGPT
Figure 4 for In ChatGPT We Trust? Measuring and Characterizing the Reliability of ChatGPT
Viaarxiv icon

MGTBench: Benchmarking Machine-Generated Text Detection

Add code
Bookmark button
Alert button
Mar 26, 2023
Xinlei He, Xinyue Shen, Zeyuan Chen, Michael Backes, Yang Zhang

Figure 1 for MGTBench: Benchmarking Machine-Generated Text Detection
Figure 2 for MGTBench: Benchmarking Machine-Generated Text Detection
Figure 3 for MGTBench: Benchmarking Machine-Generated Text Detection
Figure 4 for MGTBench: Benchmarking Machine-Generated Text Detection
Viaarxiv icon

Prompt Stealing Attacks Against Text-to-Image Generation Models

Add code
Bookmark button
Alert button
Feb 20, 2023
Xinyue Shen, Yiting Qu, Michael Backes, Yang Zhang

Figure 1 for Prompt Stealing Attacks Against Text-to-Image Generation Models
Figure 2 for Prompt Stealing Attacks Against Text-to-Image Generation Models
Figure 3 for Prompt Stealing Attacks Against Text-to-Image Generation Models
Figure 4 for Prompt Stealing Attacks Against Text-to-Image Generation Models
Viaarxiv icon

Backdoor Attacks in the Supply Chain of Masked Image Modeling

Add code
Bookmark button
Alert button
Oct 04, 2022
Xinyue Shen, Xinlei He, Zheng Li, Yun Shen, Michael Backes, Yang Zhang

Figure 1 for Backdoor Attacks in the Supply Chain of Masked Image Modeling
Figure 2 for Backdoor Attacks in the Supply Chain of Masked Image Modeling
Figure 3 for Backdoor Attacks in the Supply Chain of Masked Image Modeling
Figure 4 for Backdoor Attacks in the Supply Chain of Masked Image Modeling
Viaarxiv icon

Nonconvex Sparse Logistic Regression with Weakly Convex Regularization

Add code
Bookmark button
Alert button
Aug 07, 2017
Xinyue Shen, Yuantao Gu

Figure 1 for Nonconvex Sparse Logistic Regression with Weakly Convex Regularization
Figure 2 for Nonconvex Sparse Logistic Regression with Weakly Convex Regularization
Figure 3 for Nonconvex Sparse Logistic Regression with Weakly Convex Regularization
Figure 4 for Nonconvex Sparse Logistic Regression with Weakly Convex Regularization
Viaarxiv icon