Picture for Nicholas Carlini

Nicholas Carlini

Dj

Query-Based Adversarial Prompt Generation

Add code
Feb 19, 2024
Figure 1 for Query-Based Adversarial Prompt Generation
Figure 2 for Query-Based Adversarial Prompt Generation
Figure 3 for Query-Based Adversarial Prompt Generation
Figure 4 for Query-Based Adversarial Prompt Generation
Viaarxiv icon

Initialization Matters for Adversarial Transfer Learning

Add code
Dec 10, 2023
Figure 1 for Initialization Matters for Adversarial Transfer Learning
Figure 2 for Initialization Matters for Adversarial Transfer Learning
Figure 3 for Initialization Matters for Adversarial Transfer Learning
Figure 4 for Initialization Matters for Adversarial Transfer Learning
Viaarxiv icon

Scalable Extraction of Training Data from (Production) Language Models

Add code
Nov 28, 2023
Figure 1 for Scalable Extraction of Training Data from (Production) Language Models
Figure 2 for Scalable Extraction of Training Data from (Production) Language Models
Figure 3 for Scalable Extraction of Training Data from (Production) Language Models
Figure 4 for Scalable Extraction of Training Data from (Production) Language Models
Viaarxiv icon

Privacy Side Channels in Machine Learning Systems

Add code
Sep 11, 2023
Figure 1 for Privacy Side Channels in Machine Learning Systems
Figure 2 for Privacy Side Channels in Machine Learning Systems
Figure 3 for Privacy Side Channels in Machine Learning Systems
Figure 4 for Privacy Side Channels in Machine Learning Systems
Viaarxiv icon

Reverse-Engineering Decoding Strategies Given Blackbox Access to a Language Generation System

Add code
Sep 09, 2023
Viaarxiv icon

Identifying and Mitigating the Security Risks of Generative AI

Add code
Aug 28, 2023
Figure 1 for Identifying and Mitigating the Security Risks of Generative AI
Viaarxiv icon

A LLM Assisted Exploitation of AI-Guardian

Add code
Jul 20, 2023
Viaarxiv icon

Are aligned neural networks adversarially aligned?

Add code
Jun 26, 2023
Figure 1 for Are aligned neural networks adversarially aligned?
Figure 2 for Are aligned neural networks adversarially aligned?
Figure 3 for Are aligned neural networks adversarially aligned?
Figure 4 for Are aligned neural networks adversarially aligned?
Viaarxiv icon

Evading Black-box Classifiers Without Breaking Eggs

Add code
Jun 05, 2023
Viaarxiv icon

Students Parrot Their Teachers: Membership Inference on Model Distillation

Add code
Mar 06, 2023
Viaarxiv icon