Picture for Xianying Luo

Xianying Luo

SafeThinker: Reasoning about Risk to Deepen Safety Beyond Shallow Alignment

Add code
Jan 23, 2026
Viaarxiv icon