Picture for Nicolas Papernot

Nicolas Papernot

A False Sense of Safety: Unsafe Information Leakage in 'Safe' AI Responses

Add code
Jul 02, 2024
Figure 1 for A False Sense of Safety: Unsafe Information Leakage in 'Safe' AI Responses
Figure 2 for A False Sense of Safety: Unsafe Information Leakage in 'Safe' AI Responses
Figure 3 for A False Sense of Safety: Unsafe Information Leakage in 'Safe' AI Responses
Figure 4 for A False Sense of Safety: Unsafe Information Leakage in 'Safe' AI Responses
Viaarxiv icon

UnUnlearning: Unlearning is not sufficient for content regulation in advanced generative AI

Add code
Jun 27, 2024
Viaarxiv icon

LLM Dataset Inference: Did you train on my dataset?

Add code
Jun 10, 2024
Viaarxiv icon

Tighter Privacy Auditing of DP-SGD in the Hidden State Threat Model

Add code
May 23, 2024
Viaarxiv icon

Fairness Feedback Loops: Training on Synthetic Data Amplifies Bias

Add code
Mar 12, 2024
Figure 1 for Fairness Feedback Loops: Training on Synthetic Data Amplifies Bias
Figure 2 for Fairness Feedback Loops: Training on Synthetic Data Amplifies Bias
Figure 3 for Fairness Feedback Loops: Training on Synthetic Data Amplifies Bias
Figure 4 for Fairness Feedback Loops: Training on Synthetic Data Amplifies Bias
Viaarxiv icon

Inexact Unlearning Needs More Careful Evaluations to Avoid a False Sense of Privacy

Add code
Mar 02, 2024
Figure 1 for Inexact Unlearning Needs More Careful Evaluations to Avoid a False Sense of Privacy
Figure 2 for Inexact Unlearning Needs More Careful Evaluations to Avoid a False Sense of Privacy
Figure 3 for Inexact Unlearning Needs More Careful Evaluations to Avoid a False Sense of Privacy
Figure 4 for Inexact Unlearning Needs More Careful Evaluations to Avoid a False Sense of Privacy
Viaarxiv icon

Architectural Neural Backdoors from First Principles

Add code
Feb 10, 2024
Figure 1 for Architectural Neural Backdoors from First Principles
Figure 2 for Architectural Neural Backdoors from First Principles
Figure 3 for Architectural Neural Backdoors from First Principles
Figure 4 for Architectural Neural Backdoors from First Principles
Viaarxiv icon

Regulation Games for Trustworthy Machine Learning

Add code
Feb 05, 2024
Viaarxiv icon

Unlearnable Algorithms for In-context Learning

Add code
Feb 01, 2024
Viaarxiv icon

Decentralised, Collaborative, and Privacy-preserving Machine Learning for Multi-Hospital Data

Add code
Jan 31, 2024
Figure 1 for Decentralised, Collaborative, and Privacy-preserving Machine Learning for Multi-Hospital Data
Figure 2 for Decentralised, Collaborative, and Privacy-preserving Machine Learning for Multi-Hospital Data
Figure 3 for Decentralised, Collaborative, and Privacy-preserving Machine Learning for Multi-Hospital Data
Figure 4 for Decentralised, Collaborative, and Privacy-preserving Machine Learning for Multi-Hospital Data
Viaarxiv icon