Picture for Chengcan Wu

Chengcan Wu

Mitigating Fine-tuning Risks in LLMs via Safety-Aware Probing Optimization

Add code
May 22, 2025
Viaarxiv icon