Picture for Sangwu Park

Sangwu Park

R1-ACT: Efficient Reasoning Model Safety Alignment by Activating Safety Knowledge

Add code
Aug 01, 2025
Viaarxiv icon