Picture for Maximilian Dreyer

Maximilian Dreyer

From What to How: Attributing CLIP's Latent Components Reveals Unexpected Semantic Reliance

Add code
May 26, 2025
Viaarxiv icon

Mechanistic understanding and validation of large AI models with SemanticLens

Add code
Jan 09, 2025
Viaarxiv icon

Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers

Add code
Aug 22, 2024
Figure 1 for Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers
Figure 2 for Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers
Figure 3 for Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers
Figure 4 for Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers
Viaarxiv icon

Explainable concept mappings of MRI: Revealing the mechanisms underlying deep learning-based brain disease classification

Add code
Apr 16, 2024
Figure 1 for Explainable concept mappings of MRI: Revealing the mechanisms underlying deep learning-based brain disease classification
Figure 2 for Explainable concept mappings of MRI: Revealing the mechanisms underlying deep learning-based brain disease classification
Figure 3 for Explainable concept mappings of MRI: Revealing the mechanisms underlying deep learning-based brain disease classification
Figure 4 for Explainable concept mappings of MRI: Revealing the mechanisms underlying deep learning-based brain disease classification
Viaarxiv icon

Reactive Model Correction: Mitigating Harm to Task-Relevant Features via Conditional Bias Suppression

Add code
Apr 15, 2024
Figure 1 for Reactive Model Correction: Mitigating Harm to Task-Relevant Features via Conditional Bias Suppression
Figure 2 for Reactive Model Correction: Mitigating Harm to Task-Relevant Features via Conditional Bias Suppression
Figure 3 for Reactive Model Correction: Mitigating Harm to Task-Relevant Features via Conditional Bias Suppression
Figure 4 for Reactive Model Correction: Mitigating Harm to Task-Relevant Features via Conditional Bias Suppression
Viaarxiv icon

PURE: Turning Polysemantic Neurons Into Pure Features by Identifying Relevant Circuits

Add code
Apr 09, 2024
Figure 1 for PURE: Turning Polysemantic Neurons Into Pure Features by Identifying Relevant Circuits
Figure 2 for PURE: Turning Polysemantic Neurons Into Pure Features by Identifying Relevant Circuits
Figure 3 for PURE: Turning Polysemantic Neurons Into Pure Features by Identifying Relevant Circuits
Figure 4 for PURE: Turning Polysemantic Neurons Into Pure Features by Identifying Relevant Circuits
Viaarxiv icon

AttnLRP: Attention-Aware Layer-wise Relevance Propagation for Transformers

Add code
Feb 08, 2024
Figure 1 for AttnLRP: Attention-Aware Layer-wise Relevance Propagation for Transformers
Figure 2 for AttnLRP: Attention-Aware Layer-wise Relevance Propagation for Transformers
Figure 3 for AttnLRP: Attention-Aware Layer-wise Relevance Propagation for Transformers
Figure 4 for AttnLRP: Attention-Aware Layer-wise Relevance Propagation for Transformers
Viaarxiv icon

Understanding the (Extra-)Ordinary: Validating Deep Model Decisions with Prototypical Concept-based Explanations

Add code
Nov 28, 2023
Viaarxiv icon

From Hope to Safety: Unlearning Biases of Deep Models by Enforcing the Right Reasons in Latent Space

Add code
Aug 18, 2023
Viaarxiv icon

Reveal to Revise: An Explainable AI Life Cycle for Iterative Bias Correction of Deep Models

Add code
Mar 27, 2023
Viaarxiv icon