Picture for Riya Dulepet

Riya Dulepet

BountyBench: Dollar Impact of AI Agent Attackers and Defenders on Real-World Cybersecurity Systems

Add code
May 21, 2025
Viaarxiv icon

Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risk of Language Models

Add code
Aug 15, 2024
Viaarxiv icon