Picture for Christian Schroeder de Witt

Christian Schroeder de Witt

Michael Pokorny

DEEDEE: Fast and Scalable Out-of-Distribution Dynamics Detection

Add code
Oct 24, 2025
Viaarxiv icon

h1: Bootstrapping LLMs to Reason over Longer Horizons via Reinforcement Learning

Add code
Oct 08, 2025
Viaarxiv icon

Architecting Resilient LLM Agents: A Guide to Secure Plan-then-Execute Implementations

Add code
Sep 10, 2025
Figure 1 for Architecting Resilient LLM Agents: A Guide to Secure Plan-then-Execute Implementations
Figure 2 for Architecting Resilient LLM Agents: A Guide to Secure Plan-then-Execute Implementations
Figure 3 for Architecting Resilient LLM Agents: A Guide to Secure Plan-then-Execute Implementations
Viaarxiv icon

Open Challenges in Multi-Agent Security: Towards Secure Systems of Interacting AI Agents

Add code
May 04, 2025
Viaarxiv icon

REAL: Benchmarking Autonomous Agents on Deterministic Simulations of Real Websites

Add code
Apr 15, 2025
Viaarxiv icon

Multi-Agent Security Tax: Trading Off Security and Collaboration Capabilities in Multi-Agent Systems

Add code
Feb 26, 2025
Viaarxiv icon

Fundamental Limitations in Defending LLM Finetuning APIs

Add code
Feb 20, 2025
Figure 1 for Fundamental Limitations in Defending LLM Finetuning APIs
Figure 2 for Fundamental Limitations in Defending LLM Finetuning APIs
Figure 3 for Fundamental Limitations in Defending LLM Finetuning APIs
Figure 4 for Fundamental Limitations in Defending LLM Finetuning APIs
Viaarxiv icon

Multi-Agent Risks from Advanced AI

Add code
Feb 19, 2025
Figure 1 for Multi-Agent Risks from Advanced AI
Figure 2 for Multi-Agent Risks from Advanced AI
Figure 3 for Multi-Agent Risks from Advanced AI
Figure 4 for Multi-Agent Risks from Advanced AI
Viaarxiv icon

PSyDUCK: Training-Free Steganography for Latent Diffusion

Add code
Jan 31, 2025
Figure 1 for PSyDUCK: Training-Free Steganography for Latent Diffusion
Figure 2 for PSyDUCK: Training-Free Steganography for Latent Diffusion
Figure 3 for PSyDUCK: Training-Free Steganography for Latent Diffusion
Figure 4 for PSyDUCK: Training-Free Steganography for Latent Diffusion
Viaarxiv icon

Humanity's Last Exam

Add code
Jan 24, 2025
Viaarxiv icon