Picture for Christian Schroeder de Witt

Christian Schroeder de Witt

Michael Pokorny

Architecture Matters for Multi-Agent Security

Add code
Apr 25, 2026
Viaarxiv icon

LongCoT: Benchmarking Long-Horizon Chain-of-Thought Reasoning

Add code
Apr 15, 2026
Viaarxiv icon

Detecting Multi-Agent Collusion Through Multi-Agent Interpretability

Add code
Apr 01, 2026
Viaarxiv icon

A Decision-Theoretic Formalisation of Steganography With Applications to LLM Monitoring

Add code
Feb 26, 2026
Viaarxiv icon

Towards Understanding Multimodal Fine-Tuning: Spatial Features

Add code
Feb 06, 2026
Viaarxiv icon

VET Your Agent: Towards Host-Independent Autonomy via Verifiable Execution Traces

Add code
Dec 17, 2025
Viaarxiv icon

DEEDEE: Fast and Scalable Out-of-Distribution Dynamics Detection

Add code
Oct 24, 2025
Viaarxiv icon

h1: Bootstrapping LLMs to Reason over Longer Horizons via Reinforcement Learning

Add code
Oct 08, 2025
Viaarxiv icon

Architecting Resilient LLM Agents: A Guide to Secure Plan-then-Execute Implementations

Add code
Sep 10, 2025
Figure 1 for Architecting Resilient LLM Agents: A Guide to Secure Plan-then-Execute Implementations
Figure 2 for Architecting Resilient LLM Agents: A Guide to Secure Plan-then-Execute Implementations
Figure 3 for Architecting Resilient LLM Agents: A Guide to Secure Plan-then-Execute Implementations
Viaarxiv icon

Open Challenges in Multi-Agent Security: Towards Secure Systems of Interacting AI Agents

Add code
May 04, 2025
Viaarxiv icon