Picture for Tim Tian Hua

Tim Tian Hua

Steering Evaluation-Aware Language Models To Act Like They Are Deployed

Add code
Oct 23, 2025
Viaarxiv icon