Alert button
Picture for Arush Tagade

Arush Tagade

Alert button

The SaTML '24 CNN Interpretability Competition: New Innovations for Concept-Level Interpretability

Add code
Bookmark button
Alert button
Apr 03, 2024
Stephen Casper, Jieun Yun, Joonhyuk Baek, Yeseong Jung, Minhwan Kim, Kiwan Kwon, Saerom Park, Hayden Moore, David Shriver, Marissa Connor, Keltin Grimes, Angus Nicolson, Arush Tagade, Jessica Rumbelow, Hieu Minh Nguyen, Dylan Hadfield-Menell

Viaarxiv icon

Scalable and Transferable Black-Box Jailbreaks for Language Models via Persona Modulation

Add code
Bookmark button
Alert button
Nov 06, 2023
Rusheb Shah, Quentin Feuillade--Montixi, Soroush Pour, Arush Tagade, Stephen Casper, Javier Rando

Viaarxiv icon

Prototype Generation: Robust Feature Visualisation for Data Independent Interpretability

Add code
Bookmark button
Alert button
Sep 29, 2023
Arush Tagade, Jessica Rumbelow

Viaarxiv icon

Why do CNNs excel at feature extraction? A mathematical explanation

Add code
Bookmark button
Alert button
Jul 03, 2023
Vinoth Nandakumar, Arush Tagade, Tongliang Liu

Figure 1 for Why do CNNs excel at feature extraction? A mathematical explanation
Figure 2 for Why do CNNs excel at feature extraction? A mathematical explanation
Figure 3 for Why do CNNs excel at feature extraction? A mathematical explanation
Figure 4 for Why do CNNs excel at feature extraction? A mathematical explanation
Viaarxiv icon