Picture for Samuel Ratnam

Samuel Ratnam

Objective Matters: Fine-Tuning Objectives Shape Safety, Robustness, and Persona Drift

Add code
Jan 19, 2026
Viaarxiv icon

Alignment Pretraining: AI Discourse Causes Self-Fulfilling (Mis)alignment

Add code
Jan 15, 2026
Viaarxiv icon