Alert button
Picture for Aleksandar Makelov

Aleksandar Makelov

Alert button

Is This the Subspace You Are Looking for? An Interpretability Illusion for Subspace Activation Patching

Add code
Bookmark button
Alert button
Dec 06, 2023
Aleksandar Makelov, Georg Lange, Neel Nanda

Viaarxiv icon

Rethinking Backdoor Attacks

Add code
Bookmark button
Alert button
Jul 19, 2023
Alaa Khaddaj, Guillaume Leclerc, Aleksandar Makelov, Kristian Georgiev, Hadi Salman, Andrew Ilyas, Aleksander Madry

Figure 1 for Rethinking Backdoor Attacks
Figure 2 for Rethinking Backdoor Attacks
Figure 3 for Rethinking Backdoor Attacks
Figure 4 for Rethinking Backdoor Attacks
Viaarxiv icon

Towards Deep Learning Models Resistant to Adversarial Attacks

Add code
Bookmark button
Alert button
Nov 09, 2017
Aleksander Madry, Aleksandar Makelov, Ludwig Schmidt, Dimitris Tsipras, Adrian Vladu

Figure 1 for Towards Deep Learning Models Resistant to Adversarial Attacks
Figure 2 for Towards Deep Learning Models Resistant to Adversarial Attacks
Figure 3 for Towards Deep Learning Models Resistant to Adversarial Attacks
Figure 4 for Towards Deep Learning Models Resistant to Adversarial Attacks
Viaarxiv icon