Picture for Shengfang Zhai

Shengfang Zhai

Purify Once, Edit Freely: Breaking Image Protections under Model Mismatch

Add code
Mar 13, 2026
Viaarxiv icon

IMMACULATE: A Practical LLM Auditing Framework via Verifiable Computation

Add code
Feb 26, 2026
Viaarxiv icon

MemPot: Defending Against Memory Extraction Attack with Optimized Honeypots

Add code
Feb 07, 2026
Viaarxiv icon

Silent Leaks: Implicit Knowledge Extraction Attack on RAG Systems through Benign Queries

Add code
May 21, 2025
Viaarxiv icon

GuardReasoner-VL: Safeguarding VLMs via Reinforced Reasoning

Add code
May 16, 2025
Viaarxiv icon

Sparse Autoencoder as a Zero-Shot Classifier for Concept Erasing in Text-to-Image Diffusion Models

Add code
Mar 12, 2025
Viaarxiv icon

GuardReasoner: Towards Reasoning-based LLM Safeguards

Add code
Jan 30, 2025
Figure 1 for GuardReasoner: Towards Reasoning-based LLM Safeguards
Figure 2 for GuardReasoner: Towards Reasoning-based LLM Safeguards
Figure 3 for GuardReasoner: Towards Reasoning-based LLM Safeguards
Figure 4 for GuardReasoner: Towards Reasoning-based LLM Safeguards
Viaarxiv icon

Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy

Add code
May 23, 2024
Figure 1 for Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy
Figure 2 for Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy
Figure 3 for Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy
Figure 4 for Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy
Viaarxiv icon

Discovering Universal Semantic Triggers for Text-to-Image Synthesis

Add code
Feb 12, 2024
Figure 1 for Discovering Universal Semantic Triggers for Text-to-Image Synthesis
Figure 2 for Discovering Universal Semantic Triggers for Text-to-Image Synthesis
Figure 3 for Discovering Universal Semantic Triggers for Text-to-Image Synthesis
Figure 4 for Discovering Universal Semantic Triggers for Text-to-Image Synthesis
Viaarxiv icon

TRLS: A Time Series Representation Learning Framework via Spectrogram for Medical Signal Processing

Add code
Jan 06, 2024
Figure 1 for TRLS: A Time Series Representation Learning Framework via Spectrogram for Medical Signal Processing
Figure 2 for TRLS: A Time Series Representation Learning Framework via Spectrogram for Medical Signal Processing
Figure 3 for TRLS: A Time Series Representation Learning Framework via Spectrogram for Medical Signal Processing
Figure 4 for TRLS: A Time Series Representation Learning Framework via Spectrogram for Medical Signal Processing
Viaarxiv icon