Picture for Justin D. Li

Justin D. Li

The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

Add code
Mar 06, 2024
Figure 1 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Figure 2 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Figure 3 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Figure 4 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Viaarxiv icon

On Achieving Optimal Adversarial Test Error

Add code
Jun 13, 2023
Viaarxiv icon

Early-stopped neural networks are consistent

Add code
Jun 10, 2021
Figure 1 for Early-stopped neural networks are consistent
Viaarxiv icon