Picture for Christian Schroeder de Witt

Christian Schroeder de Witt

Michael Pokorny

A Low-Rank Subspace Analysis of LLM Interventions

Add code
Jun 12, 2026
Viaarxiv icon

When Language Representations Interact: Separability and Cross-Lingual Effects in LLMs

Add code
Jun 12, 2026
Viaarxiv icon

A Note on the Strategic Confinement Problem

Add code
Jun 07, 2026
Viaarxiv icon

Architecture Matters for Multi-Agent Security

Add code
Apr 25, 2026
Viaarxiv icon

LongCoT: Benchmarking Long-Horizon Chain-of-Thought Reasoning

Add code
Apr 15, 2026
Viaarxiv icon

Detecting Multi-Agent Collusion Through Multi-Agent Interpretability

Add code
Apr 01, 2026
Viaarxiv icon

A Decision-Theoretic Formalisation of Steganography With Applications to LLM Monitoring

Add code
Feb 26, 2026
Viaarxiv icon

Towards Understanding Multimodal Fine-Tuning: Spatial Features

Add code
Feb 06, 2026
Viaarxiv icon

VET Your Agent: Towards Host-Independent Autonomy via Verifiable Execution Traces

Add code
Dec 17, 2025
Viaarxiv icon

DEEDEE: Fast and Scalable Out-of-Distribution Dynamics Detection

Add code
Oct 24, 2025
Viaarxiv icon