Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ryan Miller

Prompt Optimization and Evaluation for LLM Automated Red Teaming

Jul 29, 2025

Michael Freenor, Lauren Alvarez, Milton Leal, Lily Smith, Joel Garrett, Yelyzaveta Husieva, Madeline Woodruff, Ryan Miller, Erich Kummerfeld, Rafael Medeiros(+1 more)

Abstract:Applications that use Large Language Models (LLMs) are becoming widespread, making the identification of system vulnerabilities increasingly important. Automated Red Teaming accelerates this effort by using an LLM to generate and execute attacks against target systems. Attack generators are evaluated using the Attack Success Rate (ASR) the sample mean calculated over the judgment of success for each attack. In this paper, we introduce a method for optimizing attack generator prompts that applies ASR to individual attacks. By repeating each attack multiple times against a randomly seeded target, we measure an attack's discoverability the expectation of the individual attack success. This approach reveals exploitable patterns that inform prompt optimization, ultimately enabling more robust evaluation and refinement of generators.

* 9 pages, 5 Figures, and 1 Appendix item

Via

Access Paper or Ask Questions

Hardware-In-the-Loop for Connected Automated Vehicles Testing in Real Traffic

Jul 21, 2019

Yeojun Kim, Samuel Tay, Jacopo Guanetti, Francesco Borrelli, Ryan Miller

Figure 1 for Hardware-In-the-Loop for Connected Automated Vehicles Testing in Real Traffic

Figure 2 for Hardware-In-the-Loop for Connected Automated Vehicles Testing in Real Traffic

Figure 3 for Hardware-In-the-Loop for Connected Automated Vehicles Testing in Real Traffic

Figure 4 for Hardware-In-the-Loop for Connected Automated Vehicles Testing in Real Traffic

Abstract:We present a hardware-in-the-loop (HIL) simulation setup for repeatable testing of Connected Automated Vehicles (CAVs) in dynamic, real-world scenarios. Our goal is to test control and planning algorithms and their distributed implementation on the vehicle hardware and, possibly, in the cloud. The HIL setup combines PreScan for perception sensors, road topography, and signalized intersections; Vissim for traffic micro-simulation; ETAS DESK-LABCAR/a dynamometer for vehicle and powertrain dynamics; and on-board electronic control units for CAV real time control. Models of traffic and signalized intersections are driven by real-world measurements. To demonstrate this HIL simulation setup, we test a Model Predictive Control approach for maximizing energy efficiency of CAVs in urban environments.

* This work was presented at the 14th International Symposium in Advanced Vehicle Control (AVEC '18)

Via

Access Paper or Ask Questions