Picture for Federico Sartore

Federico Sartore

Boiling the Frog: A Multi-Turn Benchmark for Agentic Safety

Add code
May 21, 2026
Viaarxiv icon

Adversarial Humanities Benchmark: Results on Stylistic Robustness in Frontier Model Safety

Add code
Apr 20, 2026
Viaarxiv icon

Adversarial Poetry as a Universal Single-Turn Jailbreak Mechanism in Large Language Models

Add code
Nov 19, 2025
Viaarxiv icon