Picture for Xudong Pan

Xudong Pan

AgentCyberRange: Benchmarking Frontier AI Systems in Realistic Cyber Ranges

Add code
Jun 12, 2026
Viaarxiv icon

The Emergence of Autonomous Penetration Capabilities in Large Language Model-Powered AI Systems

Add code
Jun 11, 2026
Viaarxiv icon

CyberEvolver: Structured Self-Evolution for Cybersecurity Agents On the Fly

Add code
May 25, 2026
Viaarxiv icon

FlowGuard: Towards Lightweight In-Generation Safety Detection for Diffusion Models via Linear Latent Decoding

Add code
Apr 09, 2026
Viaarxiv icon

Invisible Threats from Model Context Protocol: Generating Stealthy Injection Payload via Tree-based Adaptive Search

Add code
Mar 25, 2026
Viaarxiv icon

MirrorGuard: Toward Secure Computer-Use Agents via Simulation-to-Real Reasoning Correction

Add code
Jan 19, 2026
Viaarxiv icon

WebTrap Park: An Automated Platform for Systematic Security Evaluation of Web Agents

Add code
Jan 13, 2026
Viaarxiv icon

When Bots Take the Bait: Exposing and Mitigating the Emerging Social Engineering Attack in Web Automation Agent

Add code
Jan 12, 2026
Viaarxiv icon

Evaluation Faking: Unveiling Observer Effects in Safety Evaluation of Frontier AI Systems

Add code
May 23, 2025
Viaarxiv icon

ReasoningShield: Content Safety Detection over Reasoning Traces of Large Reasoning Models

Add code
May 22, 2025
Figure 1 for ReasoningShield: Content Safety Detection over Reasoning Traces of Large Reasoning Models
Figure 2 for ReasoningShield: Content Safety Detection over Reasoning Traces of Large Reasoning Models
Figure 3 for ReasoningShield: Content Safety Detection over Reasoning Traces of Large Reasoning Models
Figure 4 for ReasoningShield: Content Safety Detection over Reasoning Traces of Large Reasoning Models
Viaarxiv icon