Alert button
Picture for Hyrum Anderson

Hyrum Anderson

Alert button

Tree of Attacks: Jailbreaking Black-Box LLMs Automatically

Add code
Bookmark button
Alert button
Dec 04, 2023
Anay Mehrotra, Manolis Zampetakis, Paul Kassianik, Blaine Nelson, Hyrum Anderson, Yaron Singer, Amin Karbasi

Viaarxiv icon

Poisoning Web-Scale Training Datasets is Practical

Add code
Bookmark button
Alert button
Feb 20, 2023
Nicholas Carlini, Matthew Jagielski, Christopher A. Choquette-Choo, Daniel Paleka, Will Pearce, Hyrum Anderson, Andreas Terzis, Kurt Thomas, Florian Tramèr

Figure 1 for Poisoning Web-Scale Training Datasets is Practical
Figure 2 for Poisoning Web-Scale Training Datasets is Practical
Figure 3 for Poisoning Web-Scale Training Datasets is Practical
Figure 4 for Poisoning Web-Scale Training Datasets is Practical
Viaarxiv icon

KiloGrams: Very Large N-Grams for Malware Classification

Add code
Bookmark button
Alert button
Aug 01, 2019
Edward Raff, William Fleming, Richard Zak, Hyrum Anderson, Bill Finlayson, Charles Nicholas, Mark McLean

Figure 1 for KiloGrams: Very Large N-Grams for Malware Classification
Figure 2 for KiloGrams: Very Large N-Grams for Malware Classification
Figure 3 for KiloGrams: Very Large N-Grams for Malware Classification
Figure 4 for KiloGrams: Very Large N-Grams for Malware Classification
Viaarxiv icon

The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation

Add code
Bookmark button
Alert button
Feb 20, 2018
Miles Brundage, Shahar Avin, Jack Clark, Helen Toner, Peter Eckersley, Ben Garfinkel, Allan Dafoe, Paul Scharre, Thomas Zeitzoff, Bobby Filar, Hyrum Anderson, Heather Roff, Gregory C. Allen, Jacob Steinhardt, Carrick Flynn, Seán Ó hÉigeartaigh, Simon Beard, Haydn Belfield, Sebastian Farquhar, Clare Lyle, Rebecca Crootof, Owain Evans, Michael Page, Joanna Bryson, Roman Yampolskiy, Dario Amodei

Figure 1 for The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation
Viaarxiv icon