Nsfw


JANUS: A Lightweight Framework for Jailbreaking Text-to-Image Models via Distribution Optimization

Add code
Mar 22, 2026
Viaarxiv icon

Localized Concept Erasure in Text-to-Image Diffusion Models via High-Level Representation Misdirection

Add code
Feb 23, 2026
Viaarxiv icon

Differential Vector Erasure: Unified Training-Free Concept Erasure for Flow Matching Models

Add code
Feb 01, 2026
Viaarxiv icon

The Illusion of Forgetting: Attack Unlearned Diffusion via Initial Latent Variable Optimization

Add code
Jan 30, 2026
Viaarxiv icon

SafeRedir: Prompt Embedding Redirection for Robust Unlearning in Image Generation Models

Add code
Jan 13, 2026
Viaarxiv icon

EMMA: Concept Erasure Benchmark with Comprehensive Semantic Metrics and Diverse Categories

Add code
Dec 19, 2025
Viaarxiv icon

SGM: Safety Glasses for Multimodal Large Language Models via Neuron-Level Detoxification

Add code
Dec 17, 2025
Viaarxiv icon

Value-Aligned Prompt Moderation via Zero-Shot Agentic Rewriting for Safe Image Generation

Add code
Nov 12, 2025
Figure 1 for Value-Aligned Prompt Moderation via Zero-Shot Agentic Rewriting for Safe Image Generation
Figure 2 for Value-Aligned Prompt Moderation via Zero-Shot Agentic Rewriting for Safe Image Generation
Figure 3 for Value-Aligned Prompt Moderation via Zero-Shot Agentic Rewriting for Safe Image Generation
Figure 4 for Value-Aligned Prompt Moderation via Zero-Shot Agentic Rewriting for Safe Image Generation
Viaarxiv icon

Evaluating Machine Translation Datasets for Low-Web Data Languages: A Gendered Lens

Add code
Nov 05, 2025
Viaarxiv icon

Security Risk of Misalignment between Text and Image in Multi-modal Model

Add code
Oct 30, 2025
Viaarxiv icon