Picture for Xuankun Rong

Xuankun Rong

SafeGRPO: Self-Rewarded Multimodal Safety Alignment via Rule-Governed Policy Optimization

Add code
Nov 17, 2025
Viaarxiv icon

Backdoor Cleaning without External Guidance in MLLM Fine-tuning

Add code
May 22, 2025
Viaarxiv icon

Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model

Add code
Mar 06, 2025
Viaarxiv icon