Picture for Yavuz Bakman

Yavuz Bakman

Hair-Trigger Alignment: Black-Box Evaluation Cannot Guarantee Post-Update Alignment

Add code
Jan 29, 2026
Viaarxiv icon