Picture for Sam Martin

Sam Martin

How Useful Is Cross-Domain Generalization for Training LLM Monitors?

Add code
May 12, 2026
Viaarxiv icon

Classifier Context Rot: Monitor Performance Degrades with Context Length

Add code
May 12, 2026
Viaarxiv icon

CTRL-ALT-DECEIT: Sabotage Evaluations for Automated AI R&D

Add code
Nov 18, 2025
Viaarxiv icon

A robot-assisted pipeline to rapidly scan 1.7 million historical aerial photographs

Add code
Mar 31, 2025
Viaarxiv icon