Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Mihelčić, Matej, Grubišić, Ivan, Keber, Miha
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2501.01209
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916548259610624
author	Mihelčić, Matej Grubišić, Ivan Keber, Miha
author_facet	Mihelčić, Matej Grubišić, Ivan Keber, Miha
contents	Deep learning models (DLMs) achieve increasingly high performance both on structured and unstructured data. They significantly extended applicability of machine learning to various domains. Their success in making predictions, detecting patterns and generating new data made significant impact on science and industry. Despite these accomplishments, DLMs are difficult to explain because of their enormous size. In this work, we propose a novel framework for post-hoc explaining and relating DLMs using redescriptions. The framework allows cohort analysis of arbitrary DLMs by identifying statistically significant redescriptions of neuron activations. It allows coupling neurons to a set of target labels or sets of descriptive attributes, relating layers within a single DLM or associating different DLMs. The proposed framework is independent of the artificial neural network architecture and can work with more complex target labels (e.g. multi-label or multi-target scenario). Additionally, it can emulate both pedagogical and decompositional approach to rule extraction. The aforementioned properties of the proposed framework can increase explainability and interpretability of arbitrary DLMs by providing different information compared to existing explainable-AI approaches.
format	Preprint
id	arxiv_https___arxiv_org_abs_2501_01209
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	A redescription mining framework for post-hoc explaining and relating deep learning models Mihelčić, Matej Grubišić, Ivan Keber, Miha Artificial Intelligence Machine Learning Deep learning models (DLMs) achieve increasingly high performance both on structured and unstructured data. They significantly extended applicability of machine learning to various domains. Their success in making predictions, detecting patterns and generating new data made significant impact on science and industry. Despite these accomplishments, DLMs are difficult to explain because of their enormous size. In this work, we propose a novel framework for post-hoc explaining and relating DLMs using redescriptions. The framework allows cohort analysis of arbitrary DLMs by identifying statistically significant redescriptions of neuron activations. It allows coupling neurons to a set of target labels or sets of descriptive attributes, relating layers within a single DLM or associating different DLMs. The proposed framework is independent of the artificial neural network architecture and can work with more complex target labels (e.g. multi-label or multi-target scenario). Additionally, it can emulate both pedagogical and decompositional approach to rule extraction. The aforementioned properties of the proposed framework can increase explainability and interpretability of arbitrary DLMs by providing different information compared to existing explainable-AI approaches.
title	A redescription mining framework for post-hoc explaining and relating deep learning models
topic	Artificial Intelligence Machine Learning
url	https://arxiv.org/abs/2501.01209

Similar Items