Picture for Sana Belguith

Sana Belguith

The dark deep side of DeepSeek: Fine-tuning attacks against the safety alignment of CoT-enabled models

Add code
Feb 03, 2025
Figure 1 for The dark deep side of DeepSeek: Fine-tuning attacks against the safety alignment of CoT-enabled models
Viaarxiv icon