Picture for Zhenqian Xu

Zhenqian Xu

From Data to Behavior: Predicting Unintended Model Behaviors Before Training

Add code
Feb 04, 2026
Viaarxiv icon