Picture for Andreas Happe

Andreas Happe

Benchmarking Practices in LLM-driven Offensive Security: Testbeds, Metrics, and Experiment Design

Add code
Apr 14, 2025
Viaarxiv icon

Evaluating LLMs for Privilege-Escalation Scenarios

Add code
Oct 23, 2023
Viaarxiv icon

Getting pwn'd by AI: Penetration Testing with Large Language Models

Add code
Aug 17, 2023
Viaarxiv icon