Picture for Dongsheng Shi

Dongsheng Shi

Unified Defense for Large Language Models against Jailbreak and Fine-Tuning Attacks in Education

Add code
Nov 18, 2025
Figure 1 for Unified Defense for Large Language Models against Jailbreak and Fine-Tuning Attacks in Education
Figure 2 for Unified Defense for Large Language Models against Jailbreak and Fine-Tuning Attacks in Education
Figure 3 for Unified Defense for Large Language Models against Jailbreak and Fine-Tuning Attacks in Education
Figure 4 for Unified Defense for Large Language Models against Jailbreak and Fine-Tuning Attacks in Education
Viaarxiv icon

Hierarchical Safety Realignment: Lightweight Restoration of Safety in Pruned Large Vision-Language Models

Add code
May 22, 2025
Viaarxiv icon