Alert button
Picture for Ann-Kathrin Dombrowski

Ann-Kathrin Dombrowski

Alert button

The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

Add code
Bookmark button
Alert button
Mar 06, 2024
Nathaniel Li, Alexander Pan, Anjali Gopal, Summer Yue, Daniel Berrios, Alice Gatti, Justin D. Li, Ann-Kathrin Dombrowski, Shashwat Goel, Long Phan, Gabriel Mukobi, Nathan Helm-Burger, Rassin Lababidi, Lennart Justen, Andrew B. Liu, Michael Chen, Isabelle Barrass, Oliver Zhang, Xiaoyuan Zhu, Rishub Tamirisa, Bhrugu Bharathi, Adam Khoja, Zhenqi Zhao, Ariel Herbert-Voss, Cort B. Breuer, Andy Zou, Mantas Mazeika, Zifan Wang, Palash Oswal, Weiran Liu, Adam A. Hunt, Justin Tienken-Harder, Kevin Y. Shih, Kemper Talley, John Guan, Russell Kaplan, Ian Steneker, David Campbell, Brad Jokubaitis, Alex Levinson, Jean Wang, William Qian, Kallol Krishna Karmakar, Steven Basart, Stephen Fitz, Mindy Levine, Ponnurangam Kumaraguru, Uday Tupakula, Vijay Varadharajan, Yan Shoshitaishvili, Jimmy Ba, Kevin M. Esvelt, Alexandr Wang, Dan Hendrycks

Figure 1 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Figure 2 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Figure 3 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Figure 4 for The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Viaarxiv icon

Representation Engineering: A Top-Down Approach to AI Transparency

Add code
Bookmark button
Alert button
Oct 10, 2023
Andy Zou, Long Phan, Sarah Chen, James Campbell, Phillip Guo, Richard Ren, Alexander Pan, Xuwang Yin, Mantas Mazeika, Ann-Kathrin Dombrowski, Shashwat Goel, Nathaniel Li, Michael J. Byun, Zifan Wang, Alex Mallen, Steven Basart, Sanmi Koyejo, Dawn Song, Matt Fredrikson, J. Zico Kolter, Dan Hendrycks

Figure 1 for Representation Engineering: A Top-Down Approach to AI Transparency
Figure 2 for Representation Engineering: A Top-Down Approach to AI Transparency
Figure 3 for Representation Engineering: A Top-Down Approach to AI Transparency
Figure 4 for Representation Engineering: A Top-Down Approach to AI Transparency
Viaarxiv icon

Diffeomorphic Counterfactuals with Generative Models

Add code
Bookmark button
Alert button
Jun 16, 2022
Ann-Kathrin Dombrowski, Jan E. Gerken, Klaus-Robert Müller, Pan Kessel

Figure 1 for Diffeomorphic Counterfactuals with Generative Models
Figure 2 for Diffeomorphic Counterfactuals with Generative Models
Figure 3 for Diffeomorphic Counterfactuals with Generative Models
Figure 4 for Diffeomorphic Counterfactuals with Generative Models
Viaarxiv icon

Automated Dissipation Control for Turbulence Simulation with Shell Models

Add code
Bookmark button
Alert button
Jan 07, 2022
Ann-Kathrin Dombrowski, Klaus-Robert Müller, Wolf Christian Müller

Figure 1 for Automated Dissipation Control for Turbulence Simulation with Shell Models
Figure 2 for Automated Dissipation Control for Turbulence Simulation with Shell Models
Figure 3 for Automated Dissipation Control for Turbulence Simulation with Shell Models
Figure 4 for Automated Dissipation Control for Turbulence Simulation with Shell Models
Viaarxiv icon

Towards Robust Explanations for Deep Neural Networks

Add code
Bookmark button
Alert button
Dec 18, 2020
Ann-Kathrin Dombrowski, Christopher J. Anders, Klaus-Robert Müller, Pan Kessel

Figure 1 for Towards Robust Explanations for Deep Neural Networks
Figure 2 for Towards Robust Explanations for Deep Neural Networks
Figure 3 for Towards Robust Explanations for Deep Neural Networks
Figure 4 for Towards Robust Explanations for Deep Neural Networks
Viaarxiv icon

Fairwashing Explanations with Off-Manifold Detergent

Add code
Bookmark button
Alert button
Jul 20, 2020
Christopher J. Anders, Plamen Pasliev, Ann-Kathrin Dombrowski, Klaus-Robert Müller, Pan Kessel

Figure 1 for Fairwashing Explanations with Off-Manifold Detergent
Figure 2 for Fairwashing Explanations with Off-Manifold Detergent
Figure 3 for Fairwashing Explanations with Off-Manifold Detergent
Figure 4 for Fairwashing Explanations with Off-Manifold Detergent
Viaarxiv icon

Explanations can be manipulated and geometry is to blame

Add code
Bookmark button
Alert button
Jun 19, 2019
Ann-Kathrin Dombrowski, Maximilian Alber, Christopher J. Anders, Marcel Ackermann, Klaus-Robert Müller, Pan Kessel

Figure 1 for Explanations can be manipulated and geometry is to blame
Figure 2 for Explanations can be manipulated and geometry is to blame
Figure 3 for Explanations can be manipulated and geometry is to blame
Figure 4 for Explanations can be manipulated and geometry is to blame
Viaarxiv icon

CNN Cascades for Segmenting Whole Slide Images of the Kidney

Add code
Bookmark button
Alert button
Aug 01, 2017
Michael Gadermayr, Ann-Kathrin Dombrowski, Barbara Mara Klinkhammer, Peter Boor, Dorit Merhof

Figure 1 for CNN Cascades for Segmenting Whole Slide Images of the Kidney
Figure 2 for CNN Cascades for Segmenting Whole Slide Images of the Kidney
Figure 3 for CNN Cascades for Segmenting Whole Slide Images of the Kidney
Figure 4 for CNN Cascades for Segmenting Whole Slide Images of the Kidney
Viaarxiv icon