Picture for Andreas Happe

Andreas Happe

Post-Training Local LLM Agents for Linux Privilege Escalation with Verifiable Rewards

Add code
Mar 18, 2026
Viaarxiv icon

Benchmarking Practices in LLM-driven Offensive Security: Testbeds, Metrics, and Experiment Design

Add code
Apr 14, 2025
Figure 1 for Benchmarking Practices in LLM-driven Offensive Security: Testbeds, Metrics, and Experiment Design
Figure 2 for Benchmarking Practices in LLM-driven Offensive Security: Testbeds, Metrics, and Experiment Design
Figure 3 for Benchmarking Practices in LLM-driven Offensive Security: Testbeds, Metrics, and Experiment Design
Figure 4 for Benchmarking Practices in LLM-driven Offensive Security: Testbeds, Metrics, and Experiment Design
Viaarxiv icon

Evaluating LLMs for Privilege-Escalation Scenarios

Add code
Oct 23, 2023
Viaarxiv icon

Getting pwn'd by AI: Penetration Testing with Large Language Models

Add code
Aug 17, 2023
Figure 1 for Getting pwn'd by AI: Penetration Testing with Large Language Models
Viaarxiv icon