Saved in:
| Main Authors: | Tinauer, Christian, Damulina, Anna, Sackl, Maximilian, Soellradl, Martin, Achtibat, Reduan, Dreyer, Maximilian, Pahde, Frederik, Lapuschkin, Sebastian, Schmidt, Reinhold, Ropele, Stefan, Samek, Wojciech, Langkammer, Christian |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.10433 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Skull-stripping induces shortcut learning in MRI-based Alzheimer's disease classification
by: Tinauer, Christian, et al.
Published: (2025)
by: Tinauer, Christian, et al.
Published: (2025)
Identifying Alzheimer's Disease Prediction Strategies of Convolutional Neural Network Classifiers using R2* Maps and Spectral Clustering
by: Tinauer, Christian, et al.
Published: (2025)
by: Tinauer, Christian, et al.
Published: (2025)
Understanding the (Extra-)Ordinary: Validating Deep Model Decisions with Prototypical Concept-based Explanations
by: Dreyer, Maximilian, et al.
Published: (2023)
by: Dreyer, Maximilian, et al.
Published: (2023)
Reactive Model Correction: Mitigating Harm to Task-Relevant Features via Conditional Bias Suppression
by: Bareeva, Dilyara, et al.
Published: (2024)
by: Bareeva, Dilyara, et al.
Published: (2024)
Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers
by: Hatefi, Sayed Mohammad Vakilzadeh, et al.
Published: (2024)
by: Hatefi, Sayed Mohammad Vakilzadeh, et al.
Published: (2024)
From Attribution Maps to Human-Understandable Explanations through Concept Relevance Propagation
by: Achtibat, Reduan, et al.
Published: (2022)
by: Achtibat, Reduan, et al.
Published: (2022)
AttnLRP: Attention-Aware Layer-Wise Relevance Propagation for Transformers
by: Achtibat, Reduan, et al.
Published: (2024)
by: Achtibat, Reduan, et al.
Published: (2024)
The Atlas of In-Context Learning: How Attention Heads Shape In-Context Retrieval Augmentation
by: Kahardipraja, Patrick, et al.
Published: (2025)
by: Kahardipraja, Patrick, et al.
Published: (2025)
Post-Hoc Concept Disentanglement: From Correlated to Isolated Concept Representations
by: Erogullari, Eren, et al.
Published: (2025)
by: Erogullari, Eren, et al.
Published: (2025)
Ensuring Medical AI Safety: Interpretability-Driven Detection and Mitigation of Spurious Model Behavior and Associated Data
by: Pahde, Frederik, et al.
Published: (2025)
by: Pahde, Frederik, et al.
Published: (2025)
Attribution-Guided Pruning for Insight and Control: Circuit Discovery and Targeted Correction in Small-scale LLMs
by: Hatefi, Sayed Mohammad Vakilzadeh, et al.
Published: (2025)
by: Hatefi, Sayed Mohammad Vakilzadeh, et al.
Published: (2025)
Attribution-Guided Decoding
by: Komorowski, Piotr, et al.
Published: (2025)
by: Komorowski, Piotr, et al.
Published: (2025)
Navigating Neural Space: Revisiting Concept Activation Vectors to Overcome Directional Divergence
by: Pahde, Frederik, et al.
Published: (2022)
by: Pahde, Frederik, et al.
Published: (2022)
ECQ$^{\text{x}}$: Explainability-Driven Quantization for Low-Bit and Sparse DNNs
by: Becking, Daniel, et al.
Published: (2021)
by: Becking, Daniel, et al.
Published: (2021)
From What to How: Attributing CLIP's Latent Components Reveals Unexpected Semantic Reliance
by: Dreyer, Maximilian, et al.
Published: (2025)
by: Dreyer, Maximilian, et al.
Published: (2025)
PURE: Turning Polysemantic Neurons Into Pure Features by Identifying Relevant Circuits
by: Dreyer, Maximilian, et al.
Published: (2024)
by: Dreyer, Maximilian, et al.
Published: (2024)
Contrastive Semantic Projection: Faithful Neuron Labeling with Contrastive Examples
by: Bouanani, Oussama, et al.
Published: (2026)
by: Bouanani, Oussama, et al.
Published: (2026)
From Attribution to Action: A Human-Centered Application of Activation Steering
by: Labarta, Tobias, et al.
Published: (2026)
by: Labarta, Tobias, et al.
Published: (2026)
Dyslexify: A Mechanistic Defense Against Typographic Attacks in CLIP
by: Hufe, Lorenz, et al.
Published: (2025)
by: Hufe, Lorenz, et al.
Published: (2025)
Mechanistic understanding and validation of large AI models with SemanticLens
by: Dreyer, Maximilian, et al.
Published: (2025)
by: Dreyer, Maximilian, et al.
Published: (2025)
X-SYS: A Reference Architecture for Interactive Explanation Systems
by: Labarta, Tobias, et al.
Published: (2026)
by: Labarta, Tobias, et al.
Published: (2026)
Iterative Inference in a Chess-Playing Neural Network
by: Sandmann, Elias, et al.
Published: (2025)
by: Sandmann, Elias, et al.
Published: (2025)
Sparse, Efficient and Explainable Data Attribution with DualXDA
by: Yolcu, Galip Ümit, et al.
Published: (2024)
by: Yolcu, Galip Ümit, et al.
Published: (2024)
Explaining Predictive Uncertainty by Exposing Second-Order Effects
by: Bley, Florian, et al.
Published: (2024)
by: Bley, Florian, et al.
Published: (2024)
Atlas-Alignment: Making Interpretability Transferable Across Language Models
by: Puri, Bruno, et al.
Published: (2025)
by: Puri, Bruno, et al.
Published: (2025)
Temporal Dynamics of Parkinson's Disease Tremor: Clinical and Neuroimaging Insights
by: Melanie Suette, et al.
Published: (2026)
by: Melanie Suette, et al.
Published: (2026)
Time‐Varying Brain Functional Reconfiguration Patterns Associated With Fatigue in Multiple Sclerosis
by: Stefanie Hechenberger, et al.
Published: (2026)
by: Stefanie Hechenberger, et al.
Published: (2026)
XAI-guided Insulator Anomaly Detection for Imbalanced Datasets
by: Hoefler, Maximilian Andreas, et al.
Published: (2024)
by: Hoefler, Maximilian Andreas, et al.
Published: (2024)
Human-Centered Evaluation of XAI Methods
by: Dawoud, Karam, et al.
Published: (2023)
by: Dawoud, Karam, et al.
Published: (2023)
A Close Look at Decomposition-based XAI-Methods for Transformer Language Models
by: Arras, Leila, et al.
Published: (2025)
by: Arras, Leila, et al.
Published: (2025)
Relevance-driven Input Dropout: an Explanation-guided Regularization Technique
by: Gururaj, Shreyas, et al.
Published: (2025)
by: Gururaj, Shreyas, et al.
Published: (2025)
Structural Compactness as a Complementary Criterion for Explanation Quality
by: Mesgari, Mohammad Mahdi, et al.
Published: (2026)
by: Mesgari, Mohammad Mahdi, et al.
Published: (2026)
Playing the network backward: A Game Theoretic Attribution Framework
by: Zimmermann, Jakob Paul, et al.
Published: (2026)
by: Zimmermann, Jakob Paul, et al.
Published: (2026)
See What I Mean? CUE: A Cognitive Model of Understanding Explanations
by: Labarta, Tobias, et al.
Published: (2025)
by: Labarta, Tobias, et al.
Published: (2025)
Circuit Insights: Towards Interpretability Beyond Activations
by: Golimblevskaia, Elena, et al.
Published: (2025)
by: Golimblevskaia, Elena, et al.
Published: (2025)
Synthetic Generation of Dermatoscopic Images with GAN and Closed-Form Factorization
by: Mekala, Rohan Reddy, et al.
Published: (2024)
by: Mekala, Rohan Reddy, et al.
Published: (2024)
Building Trust in PINNs: Error Estimation through Finite Difference Methods
by: Krasowski, Aleksander, et al.
Published: (2026)
by: Krasowski, Aleksander, et al.
Published: (2026)
Efficient and Flexible Neural Network Training through Layer-wise Feedback Propagation
by: Weber, Leander, et al.
Published: (2023)
by: Weber, Leander, et al.
Published: (2023)
FADE: Why Bad Descriptions Happen to Good Features
by: Puri, Bruno, et al.
Published: (2025)
by: Puri, Bruno, et al.
Published: (2025)
ReReLRP -- Remembering and Recognizing Tasks with LRP
by: Bogacka, Karolina, et al.
Published: (2025)
by: Bogacka, Karolina, et al.
Published: (2025)
Similar Items
-
Skull-stripping induces shortcut learning in MRI-based Alzheimer's disease classification
by: Tinauer, Christian, et al.
Published: (2025) -
Identifying Alzheimer's Disease Prediction Strategies of Convolutional Neural Network Classifiers using R2* Maps and Spectral Clustering
by: Tinauer, Christian, et al.
Published: (2025) -
Understanding the (Extra-)Ordinary: Validating Deep Model Decisions with Prototypical Concept-based Explanations
by: Dreyer, Maximilian, et al.
Published: (2023) -
Reactive Model Correction: Mitigating Harm to Task-Relevant Features via Conditional Bias Suppression
by: Bareeva, Dilyara, et al.
Published: (2024) -
Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers
by: Hatefi, Sayed Mohammad Vakilzadeh, et al.
Published: (2024)