Picture for Matt Fredrikson

Matt Fredrikson

Multi-Rollout On-Policy Distillation via Peer Successes and Failures

Add code
May 12, 2026
Viaarxiv icon

How Vulnerable Are AI Agents to Indirect Prompt Injections? Insights from a Large-Scale Public Competition

Add code
Mar 16, 2026
Viaarxiv icon

The Vision Wormhole: Latent-Space Communication in Heterogeneous Multi-Agent Systems

Add code
Feb 17, 2026
Viaarxiv icon

LipNeXt: Scaling up Lipschitz-based Certified Robustness to Billion-parameter Models

Add code
Jan 26, 2026
Viaarxiv icon

The Trojan in the Vocabulary: Stealthy Sabotage of LLM Composition

Add code
Dec 31, 2025
Viaarxiv icon

Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing

Add code
Dec 10, 2025
Figure 1 for Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing
Figure 2 for Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing
Figure 3 for Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing
Figure 4 for Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing
Viaarxiv icon

Evaluating Language Model Reasoning about Confidential Information

Add code
Aug 27, 2025
Viaarxiv icon

Security Challenges in AI Agent Deployment: Insights from a Large Scale Public Competition

Add code
Jul 28, 2025
Viaarxiv icon

Transferable Adversarial Attacks on Black-Box Vision-Language Models

Add code
May 02, 2025
Figure 1 for Transferable Adversarial Attacks on Black-Box Vision-Language Models
Figure 2 for Transferable Adversarial Attacks on Black-Box Vision-Language Models
Figure 3 for Transferable Adversarial Attacks on Black-Box Vision-Language Models
Figure 4 for Transferable Adversarial Attacks on Black-Box Vision-Language Models
Viaarxiv icon

Is Your Text-to-Image Model Robust to Caption Noise?

Add code
Dec 27, 2024
Viaarxiv icon