Picture for Kat He

Kat He

How Vulnerable Are AI Agents to Indirect Prompt Injections? Insights from a Large-Scale Public Competition

Add code
Mar 16, 2026
Viaarxiv icon

LlamaFirewall: An open source guardrail system for building secure AI agents

Add code
May 06, 2025
Figure 1 for LlamaFirewall: An open source guardrail system for building secure AI agents
Figure 2 for LlamaFirewall: An open source guardrail system for building secure AI agents
Figure 3 for LlamaFirewall: An open source guardrail system for building secure AI agents
Figure 4 for LlamaFirewall: An open source guardrail system for building secure AI agents
Viaarxiv icon

Adapting World Models with Latent-State Dynamics Residuals

Add code
Apr 03, 2025
Viaarxiv icon