Alert button
Picture for Florian Tramèr

Florian Tramèr

Alert button

Privacy Backdoors: Stealing Data with Corrupted Pretrained Models

Add code
Bookmark button
Alert button
Mar 30, 2024
Shanglun Feng, Florian Tramèr

Viaarxiv icon

Query-Based Adversarial Prompt Generation

Add code
Bookmark button
Alert button
Feb 19, 2024
Jonathan Hayase, Ema Borevkovic, Nicholas Carlini, Florian Tramèr, Milad Nasr

Viaarxiv icon

Scalable Extraction of Training Data from (Production) Language Models

Add code
Bookmark button
Alert button
Nov 28, 2023
Milad Nasr, Nicholas Carlini, Jonathan Hayase, Matthew Jagielski, A. Feder Cooper, Daphne Ippolito, Christopher A. Choquette-Choo, Eric Wallace, Florian Tramèr, Katherine Lee

Viaarxiv icon

Universal Jailbreak Backdoors from Poisoned Human Feedback

Add code
Bookmark button
Alert button
Nov 24, 2023
Javier Rando, Florian Tramèr

Viaarxiv icon

Privacy Side Channels in Machine Learning Systems

Add code
Bookmark button
Alert button
Sep 11, 2023
Edoardo Debenedetti, Giorgio Severi, Nicholas Carlini, Christopher A. Choquette-Choo, Matthew Jagielski, Milad Nasr, Eric Wallace, Florian Tramèr

Viaarxiv icon

Evaluating Superhuman Models with Consistency Checks

Add code
Bookmark button
Alert button
Jun 19, 2023
Lukas Fluri, Daniel Paleka, Florian Tramèr

Figure 1 for Evaluating Superhuman Models with Consistency Checks
Figure 2 for Evaluating Superhuman Models with Consistency Checks
Figure 3 for Evaluating Superhuman Models with Consistency Checks
Figure 4 for Evaluating Superhuman Models with Consistency Checks
Viaarxiv icon

Evading Black-box Classifiers Without Breaking Eggs

Add code
Bookmark button
Alert button
Jun 05, 2023
Edoardo Debenedetti, Nicholas Carlini, Florian Tramèr

Figure 1 for Evading Black-box Classifiers Without Breaking Eggs
Figure 2 for Evading Black-box Classifiers Without Breaking Eggs
Figure 3 for Evading Black-box Classifiers Without Breaking Eggs
Figure 4 for Evading Black-box Classifiers Without Breaking Eggs
Viaarxiv icon

Randomness in ML Defenses Helps Persistent Attackers and Hinders Evaluators

Add code
Bookmark button
Alert button
Feb 27, 2023
Keane Lucas, Matthew Jagielski, Florian Tramèr, Lujo Bauer, Nicholas Carlini

Figure 1 for Randomness in ML Defenses Helps Persistent Attackers and Hinders Evaluators
Figure 2 for Randomness in ML Defenses Helps Persistent Attackers and Hinders Evaluators
Figure 3 for Randomness in ML Defenses Helps Persistent Attackers and Hinders Evaluators
Figure 4 for Randomness in ML Defenses Helps Persistent Attackers and Hinders Evaluators
Viaarxiv icon

Poisoning Web-Scale Training Datasets is Practical

Add code
Bookmark button
Alert button
Feb 20, 2023
Nicholas Carlini, Matthew Jagielski, Christopher A. Choquette-Choo, Daniel Paleka, Will Pearce, Hyrum Anderson, Andreas Terzis, Kurt Thomas, Florian Tramèr

Figure 1 for Poisoning Web-Scale Training Datasets is Practical
Figure 2 for Poisoning Web-Scale Training Datasets is Practical
Figure 3 for Poisoning Web-Scale Training Datasets is Practical
Figure 4 for Poisoning Web-Scale Training Datasets is Practical
Viaarxiv icon