Picture for Weikai Lu

Weikai Lu

SEA: Low-Resource Safety Alignment for Multimodal Large Language Models via Synthetic Embeddings

Add code
Feb 18, 2025
Figure 1 for SEA: Low-Resource Safety Alignment for Multimodal Large Language Models via Synthetic Embeddings
Figure 2 for SEA: Low-Resource Safety Alignment for Multimodal Large Language Models via Synthetic Embeddings
Figure 3 for SEA: Low-Resource Safety Alignment for Multimodal Large Language Models via Synthetic Embeddings
Figure 4 for SEA: Low-Resource Safety Alignment for Multimodal Large Language Models via Synthetic Embeddings
Viaarxiv icon

Eraser: Jailbreaking Defense in Large Language Models via Unlearning Harmful Knowledge

Add code
Apr 08, 2024
Figure 1 for Eraser: Jailbreaking Defense in Large Language Models via Unlearning Harmful Knowledge
Figure 2 for Eraser: Jailbreaking Defense in Large Language Models via Unlearning Harmful Knowledge
Figure 3 for Eraser: Jailbreaking Defense in Large Language Models via Unlearning Harmful Knowledge
Figure 4 for Eraser: Jailbreaking Defense in Large Language Models via Unlearning Harmful Knowledge
Viaarxiv icon