Picture for Jason Benn

Jason Benn

Breaking Barriers: Do Reinforcement Post Training Gains Transfer To Unseen Domains?

Add code
Jun 24, 2025
Viaarxiv icon

CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities

Add code
Mar 21, 2025
Viaarxiv icon