Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:SDEval: Safety Dynamic Evaluation for Multimodal Large Language Models

Aug 08, 2025

Hanqing Wang, Yuan Tian, Mingyu Liu, Zhenhao Zhang, Xiangyang Zhu

Figure 1 for SDEval: Safety Dynamic Evaluation for Multimodal Large Language Models

Figure 2 for SDEval: Safety Dynamic Evaluation for Multimodal Large Language Models

Figure 3 for SDEval: Safety Dynamic Evaluation for Multimodal Large Language Models

Figure 4 for SDEval: Safety Dynamic Evaluation for Multimodal Large Language Models

Share this with someone who'll enjoy it:

Abstract:In the rapidly evolving landscape of Multimodal Large Language Models (MLLMs), the safety concerns of their outputs have earned significant attention. Although numerous datasets have been proposed, they may become outdated with MLLM advancements and are susceptible to data contamination issues. To address these problems, we propose \textbf{SDEval}, the \textit{first} safety dynamic evaluation framework to controllably adjust the distribution and complexity of safety benchmarks. Specifically, SDEval mainly adopts three dynamic strategies: text, image, and text-image dynamics to generate new samples from original benchmarks. We first explore the individual effects of text and image dynamics on model safety. Then, we find that injecting text dynamics into images can further impact safety, and conversely, injecting image dynamics into text also leads to safety risks. SDEval is general enough to be applied to various existing safety and even capability benchmarks. Experiments across safety benchmarks, MLLMGuard and VLSBench, and capability benchmarks, MMBench and MMVet, show that SDEval significantly influences safety evaluation, mitigates data contamination, and exposes safety limitations of MLLMs. Code is available at https://github.com/hq-King/SDEval

View paper on

Share this with someone who'll enjoy it:

Title:SDEval: Safety Dynamic Evaluation for Multimodal Large Language Models

Paper and Code