Picture for Garrett Baker

Garrett Baker

Studying Small Language Models with Susceptibilities

Add code
Apr 25, 2025
Viaarxiv icon

Generalization Analogies: A Testbed for Generalizing AI Oversight to Hard-To-Measure Domains

Add code
Nov 19, 2023
Viaarxiv icon