Alert button
Picture for Neil Gong

Neil Gong

Alert button

A Transfer Attack to Image Watermarks

Add code
Bookmark button
Alert button
Mar 25, 2024
Yuepeng Hu, Zhengyuan Jiang, Moyang Guo, Neil Gong

Viaarxiv icon

GradSafe: Detecting Unsafe Prompts for LLMs via Safety-Critical Gradient Analysis

Add code
Bookmark button
Alert button
Feb 21, 2024
Yueqi Xie, Minghong Fang, Renjie Pi, Neil Gong

Viaarxiv icon

Mendata: A Framework to Purify Manipulated Training Data

Add code
Bookmark button
Alert button
Dec 03, 2023
Zonghao Huang, Neil Gong, Michael K. Reiter

Viaarxiv icon

SneakyPrompt: Evaluating Robustness of Text-to-image Generative Models' Safety Filters

Add code
Bookmark button
Alert button
May 20, 2023
Yuchen Yang, Bo Hui, Haolin Yuan, Neil Gong, Yinzhi Cao

Figure 1 for SneakyPrompt: Evaluating Robustness of Text-to-image Generative Models' Safety Filters
Figure 2 for SneakyPrompt: Evaluating Robustness of Text-to-image Generative Models' Safety Filters
Figure 3 for SneakyPrompt: Evaluating Robustness of Text-to-image Generative Models' Safety Filters
Figure 4 for SneakyPrompt: Evaluating Robustness of Text-to-image Generative Models' Safety Filters
Viaarxiv icon