Picture for Andrew Paverd

Andrew Paverd

Microsoft Research

MetaBackdoor: Exploiting Positional Encoding as a Backdoor Attack Surface in LLMs

Add code
May 14, 2026
Viaarxiv icon

Stateless Yet Not Forgetful: Implicit Memory as a Hidden Channel in LLMs

Add code
Feb 09, 2026
Viaarxiv icon

Design Patterns for Securing LLM Agents against Prompt Injections

Add code
Jun 11, 2025
Figure 1 for Design Patterns for Securing LLM Agents against Prompt Injections
Figure 2 for Design Patterns for Securing LLM Agents against Prompt Injections
Figure 3 for Design Patterns for Securing LLM Agents against Prompt Injections
Figure 4 for Design Patterns for Securing LLM Agents against Prompt Injections
Viaarxiv icon

LLMail-Inject: A Dataset from a Realistic Adaptive Prompt Injection Challenge

Add code
Jun 11, 2025
Viaarxiv icon

Securing AI Agents with Information-Flow Control

Add code
May 29, 2025
Viaarxiv icon

ExclaveFL: Providing Transparency to Federated Learning using Exclaves

Add code
Dec 13, 2024
Figure 1 for ExclaveFL: Providing Transparency to Federated Learning using Exclaves
Figure 2 for ExclaveFL: Providing Transparency to Federated Learning using Exclaves
Figure 3 for ExclaveFL: Providing Transparency to Federated Learning using Exclaves
Figure 4 for ExclaveFL: Providing Transparency to Federated Learning using Exclaves
Viaarxiv icon

Permissive Information-Flow Analysis for Large Language Models

Add code
Oct 04, 2024
Figure 1 for Permissive Information-Flow Analysis for Large Language Models
Figure 2 for Permissive Information-Flow Analysis for Large Language Models
Figure 3 for Permissive Information-Flow Analysis for Large Language Models
Figure 4 for Permissive Information-Flow Analysis for Large Language Models
Viaarxiv icon

Are you still on track!? Catching LLM Task Drift with Activations

Add code
Jun 02, 2024
Figure 1 for Are you still on track!? Catching LLM Task Drift with Activations
Figure 2 for Are you still on track!? Catching LLM Task Drift with Activations
Figure 3 for Are you still on track!? Catching LLM Task Drift with Activations
Figure 4 for Are you still on track!? Catching LLM Task Drift with Activations
Viaarxiv icon

Closed-Form Bounds for DP-SGD against Record-level Inference

Add code
Feb 22, 2024
Figure 1 for Closed-Form Bounds for DP-SGD against Record-level Inference
Figure 2 for Closed-Form Bounds for DP-SGD against Record-level Inference
Figure 3 for Closed-Form Bounds for DP-SGD against Record-level Inference
Figure 4 for Closed-Form Bounds for DP-SGD against Record-level Inference
Viaarxiv icon

Maatphor: Automated Variant Analysis for Prompt Injection Attacks

Add code
Dec 12, 2023
Viaarxiv icon