Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Il-chul Moon

Prompt-Based Safety Guidance Is Ineffective for Unlearned Text-to-Image Diffusion Models

Nov 06, 2025

Jiwoo Shin, Byeonghu Na, Mina Kang, Wonhyeok Choi, Il-chul Moon

Abstract:Recent advances in text-to-image generative models have raised concerns about their potential to produce harmful content when provided with malicious input text prompts. To address this issue, two main approaches have emerged: (1) fine-tuning the model to unlearn harmful concepts and (2) training-free guidance methods that leverage negative prompts. However, we observe that combining these two orthogonal approaches often leads to marginal or even degraded defense performance. This observation indicates a critical incompatibility between two paradigms, which hinders their combined effectiveness. In this work, we address this issue by proposing a conceptually simple yet experimentally robust method: replacing the negative prompts used in training-free methods with implicit negative embeddings obtained through concept inversion. Our method requires no modification to either approach and can be easily integrated into existing pipelines. We experimentally validate its effectiveness on nudity and violence benchmarks, demonstrating consistent improvements in defense success rate while preserving the core semantics of input prompts.

* Accepted at NeurIPS 2025 Workshop on Generative and Protective AI for Content Creation

Via

Access Paper or Ask Questions

Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning

Mar 07, 2024

Hyungho Na, Yunkyeong Seo, Il-chul Moon

Figure 1 for Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning

Figure 2 for Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning

Figure 3 for Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning

Figure 4 for Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning

Abstract:In cooperative multi-agent reinforcement learning (MARL), agents aim to achieve a common goal, such as defeating enemies or scoring a goal. Existing MARL algorithms are effective but still require significant learning time and often get trapped in local optima by complex tasks, subsequently failing to discover a goal-reaching policy. To address this, we introduce Efficient episodic Memory Utilization (EMU) for MARL, with two primary objectives: (a) accelerating reinforcement learning by leveraging semantically coherent memory from an episodic buffer and (b) selectively promoting desirable transitions to prevent local convergence. To achieve (a), EMU incorporates a trainable encoder/decoder structure alongside MARL, creating coherent memory embeddings that facilitate exploratory memory recall. To achieve (b), EMU introduces a novel reward structure called episodic incentive based on the desirability of states. This reward improves the TD target in Q-learning and acts as an additional incentive for desirable transitions. We provide theoretical support for the proposed incentive and demonstrate the effectiveness of EMU compared to conventional episodic control. The proposed method is evaluated in StarCraft II and Google Research Football, and empirical results indicate further performance improvement over state-of-the-art methods.

* Accepted at ICLR 2024

Via

Access Paper or Ask Questions