Picture for Eliot Jones

Eliot Jones

Security Challenges in AI Agent Deployment: Insights from a Large Scale Public Competition

Add code
Jul 28, 2025
Viaarxiv icon

Toxicity of the Commons: Curating Open-Source Pre-Training Data

Add code
Oct 29, 2024
Viaarxiv icon

Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risk of Language Models

Add code
Aug 15, 2024
Viaarxiv icon