Nsfw detection


Value-Aligned Prompt Moderation via Zero-Shot Agentic Rewriting for Safe Image Generation

Add code
Nov 12, 2025
Figure 1 for Value-Aligned Prompt Moderation via Zero-Shot Agentic Rewriting for Safe Image Generation
Figure 2 for Value-Aligned Prompt Moderation via Zero-Shot Agentic Rewriting for Safe Image Generation
Figure 3 for Value-Aligned Prompt Moderation via Zero-Shot Agentic Rewriting for Safe Image Generation
Figure 4 for Value-Aligned Prompt Moderation via Zero-Shot Agentic Rewriting for Safe Image Generation
Viaarxiv icon

Wukong Framework for Not Safe For Work Detection in Text-to-Image systems

Add code
Aug 01, 2025
Figure 1 for Wukong Framework for Not Safe For Work Detection in Text-to-Image systems
Figure 2 for Wukong Framework for Not Safe For Work Detection in Text-to-Image systems
Figure 3 for Wukong Framework for Not Safe For Work Detection in Text-to-Image systems
Figure 4 for Wukong Framework for Not Safe For Work Detection in Text-to-Image systems
Viaarxiv icon

VModA: An Effective Framework for Adaptive NSFW Image Moderation

Add code
May 29, 2025
Figure 1 for VModA: An Effective Framework for Adaptive NSFW Image Moderation
Figure 2 for VModA: An Effective Framework for Adaptive NSFW Image Moderation
Figure 3 for VModA: An Effective Framework for Adaptive NSFW Image Moderation
Figure 4 for VModA: An Effective Framework for Adaptive NSFW Image Moderation
Viaarxiv icon

Towards Safe Synthetic Image Generation On the Web: A Multimodal Robust NSFW Defense and Million Scale Dataset

Add code
Apr 16, 2025
Viaarxiv icon

TokenProber: Jailbreaking Text-to-image Models via Fine-grained Word Impact Analysis

Add code
May 11, 2025
Viaarxiv icon

GhostPrompt: Jailbreaking Text-to-image Generative Models based on Dynamic Optimization

Add code
May 25, 2025
Viaarxiv icon

Adversarially Robust CLIP Models Can Induce Better (Robust) Perceptual Metrics

Add code
Feb 17, 2025
Figure 1 for Adversarially Robust CLIP Models Can Induce Better (Robust) Perceptual Metrics
Figure 2 for Adversarially Robust CLIP Models Can Induce Better (Robust) Perceptual Metrics
Figure 3 for Adversarially Robust CLIP Models Can Induce Better (Robust) Perceptual Metrics
Figure 4 for Adversarially Robust CLIP Models Can Induce Better (Robust) Perceptual Metrics
Viaarxiv icon

CROPS: Model-Agnostic Training-Free Framework for Safe Image Synthesis with Latent Diffusion Models

Add code
Jan 09, 2025
Viaarxiv icon

AEIOU: A Unified Defense Framework against NSFW Prompts in Text-to-Image Models

Add code
Dec 24, 2024
Figure 1 for AEIOU: A Unified Defense Framework against NSFW Prompts in Text-to-Image Models
Figure 2 for AEIOU: A Unified Defense Framework against NSFW Prompts in Text-to-Image Models
Figure 3 for AEIOU: A Unified Defense Framework against NSFW Prompts in Text-to-Image Models
Figure 4 for AEIOU: A Unified Defense Framework against NSFW Prompts in Text-to-Image Models
Viaarxiv icon

AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion models

Add code
Oct 28, 2024
Figure 1 for AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion models
Figure 2 for AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion models
Figure 3 for AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion models
Figure 4 for AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion models
Viaarxiv icon