Picture for Ilia Shumailov

Ilia Shumailov

A False Sense of Safety: Unsafe Information Leakage in 'Safe' AI Responses

Add code
Jul 02, 2024
Figure 1 for A False Sense of Safety: Unsafe Information Leakage in 'Safe' AI Responses
Figure 2 for A False Sense of Safety: Unsafe Information Leakage in 'Safe' AI Responses
Figure 3 for A False Sense of Safety: Unsafe Information Leakage in 'Safe' AI Responses
Figure 4 for A False Sense of Safety: Unsafe Information Leakage in 'Safe' AI Responses
Viaarxiv icon

UnUnlearning: Unlearning is not sufficient for content regulation in advanced generative AI

Add code
Jun 27, 2024
Viaarxiv icon

Measuring memorization in RLHF for code completion

Add code
Jun 17, 2024
Viaarxiv icon

Beyond Slow Signs in High-fidelity Model Extraction

Add code
Jun 14, 2024
Viaarxiv icon

Locking Machine Learning Models into Hardware

Add code
May 31, 2024
Viaarxiv icon

Fairness Feedback Loops: Training on Synthetic Data Amplifies Bias

Add code
Mar 12, 2024
Figure 1 for Fairness Feedback Loops: Training on Synthetic Data Amplifies Bias
Figure 2 for Fairness Feedback Loops: Training on Synthetic Data Amplifies Bias
Figure 3 for Fairness Feedback Loops: Training on Synthetic Data Amplifies Bias
Figure 4 for Fairness Feedback Loops: Training on Synthetic Data Amplifies Bias
Viaarxiv icon

Inexact Unlearning Needs More Careful Evaluations to Avoid a False Sense of Privacy

Add code
Mar 02, 2024
Figure 1 for Inexact Unlearning Needs More Careful Evaluations to Avoid a False Sense of Privacy
Figure 2 for Inexact Unlearning Needs More Careful Evaluations to Avoid a False Sense of Privacy
Figure 3 for Inexact Unlearning Needs More Careful Evaluations to Avoid a False Sense of Privacy
Figure 4 for Inexact Unlearning Needs More Careful Evaluations to Avoid a False Sense of Privacy
Viaarxiv icon

Architectural Neural Backdoors from First Principles

Add code
Feb 10, 2024
Figure 1 for Architectural Neural Backdoors from First Principles
Figure 2 for Architectural Neural Backdoors from First Principles
Figure 3 for Architectural Neural Backdoors from First Principles
Figure 4 for Architectural Neural Backdoors from First Principles
Viaarxiv icon

Buffer Overflow in Mixture of Experts

Add code
Feb 08, 2024
Viaarxiv icon

Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?

Add code
Oct 21, 2023
Figure 1 for Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?
Figure 2 for Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?
Figure 3 for Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?
Figure 4 for Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?
Viaarxiv icon