Picture for Martin Vanek

Martin Vanek

Latent Introspection: Models Can Detect Prior Concept Injections

Add code
Feb 26, 2026
Viaarxiv icon