Picture for Fenghua Weng

Fenghua Weng

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law

Add code
Jul 24, 2025
Viaarxiv icon

DELMAN: Dynamic Defense Against Large Language Model Jailbreaking with Model Editing

Add code
Feb 17, 2025
Viaarxiv icon