Picture for Wenzhi Chen

Wenzhi Chen

GradEscape: A Gradient-Based Evader Against AI-Generated Text Detectors

Add code
Jun 09, 2025
Viaarxiv icon

VModA: An Effective Framework for Adaptive NSFW Image Moderation

Add code
May 29, 2025
Viaarxiv icon

MentalMAC: Enhancing Large Language Models for Detecting Mental Manipulation via Multi-Task Anti-Curriculum Distillation

Add code
May 21, 2025
Viaarxiv icon

DC-SGD: Differentially Private SGD with Dynamic Clipping through Gradient Norm Distribution Estimation

Add code
Apr 01, 2025
Viaarxiv icon

Dialogue Injection Attack: Jailbreaking LLMs through Context Manipulation

Add code
Mar 11, 2025
Viaarxiv icon

R.R.: Unveiling LLM Training Privacy through Recollection and Ranking

Add code
Feb 18, 2025
Viaarxiv icon

Be Cautious When Merging Unfamiliar LLMs: A Phishing Model Capable of Stealing Privacy

Add code
Feb 17, 2025
Viaarxiv icon

Could It Be Generated? Towards Practical Analysis of Memorization in Text-To-Image Diffusion Models

Add code
May 09, 2024
Figure 1 for Could It Be Generated? Towards Practical Analysis of Memorization in Text-To-Image Diffusion Models
Figure 2 for Could It Be Generated? Towards Practical Analysis of Memorization in Text-To-Image Diffusion Models
Figure 3 for Could It Be Generated? Towards Practical Analysis of Memorization in Text-To-Image Diffusion Models
Figure 4 for Could It Be Generated? Towards Practical Analysis of Memorization in Text-To-Image Diffusion Models
Viaarxiv icon

How ChatGPT is Solving Vulnerability Management Problem

Add code
Nov 11, 2023
Figure 1 for How ChatGPT is Solving Vulnerability Management Problem
Figure 2 for How ChatGPT is Solving Vulnerability Management Problem
Figure 3 for How ChatGPT is Solving Vulnerability Management Problem
Figure 4 for How ChatGPT is Solving Vulnerability Management Problem
Viaarxiv icon

LMSanitator: Defending Prompt-Tuning Against Task-Agnostic Backdoors

Add code
Aug 26, 2023
Figure 1 for LMSanitator: Defending Prompt-Tuning Against Task-Agnostic Backdoors
Figure 2 for LMSanitator: Defending Prompt-Tuning Against Task-Agnostic Backdoors
Figure 3 for LMSanitator: Defending Prompt-Tuning Against Task-Agnostic Backdoors
Figure 4 for LMSanitator: Defending Prompt-Tuning Against Task-Agnostic Backdoors
Viaarxiv icon