Picture for Nicholas Carlini

Nicholas Carlini

Dj

Unsolved Problems in ML Safety

Add code
Sep 28, 2021
Figure 1 for Unsolved Problems in ML Safety
Figure 2 for Unsolved Problems in ML Safety
Figure 3 for Unsolved Problems in ML Safety
Figure 4 for Unsolved Problems in ML Safety
Viaarxiv icon

Deduplicating Training Data Makes Language Models Better

Add code
Jul 14, 2021
Figure 1 for Deduplicating Training Data Makes Language Models Better
Figure 2 for Deduplicating Training Data Makes Language Models Better
Figure 3 for Deduplicating Training Data Makes Language Models Better
Figure 4 for Deduplicating Training Data Makes Language Models Better
Viaarxiv icon

Evading Adversarial Example Detection Defenses with Orthogonal Projected Gradient Descent

Add code
Jun 28, 2021
Figure 1 for Evading Adversarial Example Detection Defenses with Orthogonal Projected Gradient Descent
Figure 2 for Evading Adversarial Example Detection Defenses with Orthogonal Projected Gradient Descent
Figure 3 for Evading Adversarial Example Detection Defenses with Orthogonal Projected Gradient Descent
Figure 4 for Evading Adversarial Example Detection Defenses with Orthogonal Projected Gradient Descent
Viaarxiv icon

Indicators of Attack Failure: Debugging and Improving Optimization of Adversarial Examples

Add code
Jun 18, 2021
Figure 1 for Indicators of Attack Failure: Debugging and Improving Optimization of Adversarial Examples
Figure 2 for Indicators of Attack Failure: Debugging and Improving Optimization of Adversarial Examples
Figure 3 for Indicators of Attack Failure: Debugging and Improving Optimization of Adversarial Examples
Figure 4 for Indicators of Attack Failure: Debugging and Improving Optimization of Adversarial Examples
Viaarxiv icon

Poisoning and Backdooring Contrastive Learning

Add code
Jun 17, 2021
Figure 1 for Poisoning and Backdooring Contrastive Learning
Figure 2 for Poisoning and Backdooring Contrastive Learning
Figure 3 for Poisoning and Backdooring Contrastive Learning
Figure 4 for Poisoning and Backdooring Contrastive Learning
Viaarxiv icon

AdaMatch: A Unified Approach to Semi-Supervised Learning and Domain Adaptation

Add code
Jun 08, 2021
Figure 1 for AdaMatch: A Unified Approach to Semi-Supervised Learning and Domain Adaptation
Figure 2 for AdaMatch: A Unified Approach to Semi-Supervised Learning and Domain Adaptation
Figure 3 for AdaMatch: A Unified Approach to Semi-Supervised Learning and Domain Adaptation
Figure 4 for AdaMatch: A Unified Approach to Semi-Supervised Learning and Domain Adaptation
Viaarxiv icon

Handcrafted Backdoors in Deep Neural Networks

Add code
Jun 08, 2021
Figure 1 for Handcrafted Backdoors in Deep Neural Networks
Figure 2 for Handcrafted Backdoors in Deep Neural Networks
Figure 3 for Handcrafted Backdoors in Deep Neural Networks
Figure 4 for Handcrafted Backdoors in Deep Neural Networks
Viaarxiv icon

Poisoning the Unlabeled Dataset of Semi-Supervised Learning

Add code
May 04, 2021
Figure 1 for Poisoning the Unlabeled Dataset of Semi-Supervised Learning
Figure 2 for Poisoning the Unlabeled Dataset of Semi-Supervised Learning
Figure 3 for Poisoning the Unlabeled Dataset of Semi-Supervised Learning
Figure 4 for Poisoning the Unlabeled Dataset of Semi-Supervised Learning
Viaarxiv icon

Adversary Instantiation: Lower Bounds for Differentially Private Machine Learning

Add code
Jan 11, 2021
Figure 1 for Adversary Instantiation: Lower Bounds for Differentially Private Machine Learning
Figure 2 for Adversary Instantiation: Lower Bounds for Differentially Private Machine Learning
Figure 3 for Adversary Instantiation: Lower Bounds for Differentially Private Machine Learning
Figure 4 for Adversary Instantiation: Lower Bounds for Differentially Private Machine Learning
Viaarxiv icon

Extracting Training Data from Large Language Models

Add code
Dec 14, 2020
Figure 1 for Extracting Training Data from Large Language Models
Figure 2 for Extracting Training Data from Large Language Models
Figure 3 for Extracting Training Data from Large Language Models
Figure 4 for Extracting Training Data from Large Language Models
Viaarxiv icon