Picture for Neil Perry

Neil Perry

Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing

Add code
Dec 10, 2025
Viaarxiv icon

Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risk of Language Models

Add code
Aug 15, 2024
Viaarxiv icon