Picture for Aleksandr Smechov

Aleksandr Smechov

Opir: Efficient Multi-Task Safety Classification for Toxicity, Jailbreaks, Hate Speech, and Harmful Content

Add code
May 28, 2026
Viaarxiv icon