Picture for Zheyuan Deng

Zheyuan Deng

Chain of Risk: Safety Failures in Large Reasoning Models and Mitigation via Adaptive Multi-Principle Steering

Add code
May 07, 2026
Viaarxiv icon