Picture for Ranjie Duan

Ranjie Duan

The Eye of Sherlock Holmes: Uncovering User Private Attribute Profiling via Vision-Language Model Agentic Framework

Add code
May 25, 2025
Viaarxiv icon

Enhancing Adversarial Robustness of Vision Language Models via Adversarial Mixture Prompt Tuning

Add code
May 23, 2025
Viaarxiv icon

DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models

Add code
Apr 25, 2025
Viaarxiv icon

STAIR: Improving Safety Alignment with Introspective Reasoning

Add code
Feb 04, 2025
Figure 1 for STAIR: Improving Safety Alignment with Introspective Reasoning
Figure 2 for STAIR: Improving Safety Alignment with Introspective Reasoning
Figure 3 for STAIR: Improving Safety Alignment with Introspective Reasoning
Figure 4 for STAIR: Improving Safety Alignment with Introspective Reasoning
Viaarxiv icon

Mirage in the Eyes: Hallucination Attack on Multi-modal Large Language Models with Only Attention Sink

Add code
Jan 25, 2025
Viaarxiv icon

Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency

Add code
Jan 09, 2025
Figure 1 for Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency
Figure 2 for Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency
Figure 3 for Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency
Figure 4 for Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency
Viaarxiv icon

MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue

Add code
Nov 06, 2024
Figure 1 for MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue
Figure 2 for MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue
Figure 3 for MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue
Figure 4 for MRJ-Agent: An Effective Jailbreak Agent for Multi-Round Dialogue
Viaarxiv icon

RT-Attack: Jailbreaking Text-to-Image Models via Random Token

Add code
Aug 27, 2024
Viaarxiv icon

Revisiting and Exploring Efficient Fast Adversarial Training via LAW: Lipschitz Regularization and Auto Weight Averaging

Add code
Aug 22, 2023
Viaarxiv icon

Robust Automatic Speech Recognition via WavAugment Guided Phoneme Adversarial Training

Add code
Jul 24, 2023
Viaarxiv icon