Picture for David Africa

David Africa

A Decision-Theoretic Formalisation of Steganography With Applications to LLM Monitoring

Add code
Feb 26, 2026
Viaarxiv icon

Alignment Pretraining: AI Discourse Causes Self-Fulfilling (Mis)alignment

Add code
Jan 15, 2026
Viaarxiv icon