Picture for Chloe Li

Chloe Li

Model Spec Midtraining: Improving How Alignment Training Generalizes

Add code
May 03, 2026
Viaarxiv icon

Spilling the Beans: Teaching LLMs to Self-Report Their Hidden Objectives

Add code
Nov 14, 2025
Viaarxiv icon