Saved in:
Bibliographic Details
Main Authors: Bouadi, Mohamed, Alavi, Arta, Benbernou, Salima, Ouziri, Mourad
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2410.02519
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917793688977408
author Bouadi, Mohamed
Alavi, Arta
Benbernou, Salima
Ouziri, Mourad
author_facet Bouadi, Mohamed
Alavi, Arta
Benbernou, Salima
Ouziri, Mourad
contents The quality of Machine Learning (ML) models strongly depends on the input data, as such generating high-quality features is often required to improve the predictive accuracy. This process is referred to as Feature Engineering (FE). However, since manual feature engineering is time-consuming and requires case-by-case domain knowledge, Automated Feature Engineering (AutoFE) is crucial. A major challenge that remains is to generate interpretable features. To tackle this problem, we introduce SMART, a hybrid approach that uses semantic technologies to guide the generation of interpretable features through a two-step process: Exploitation and Exploration. The former uses Description Logics (DL) to reason on the semantics embedded in Knowledge Graphs (KG) to infer domain-specific features, while the latter exploits the knowledge graph to conduct a guided exploration of the search space through Deep Reinforcement Learning (DRL). Our experiments on public datasets demonstrate that SMART significantly improves prediction accuracy while ensuring a high level of interpretability.
format Preprint
id arxiv_https___arxiv_org_abs_2410_02519
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Semantic-Guided RL for Interpretable Feature Engineering
Bouadi, Mohamed
Alavi, Arta
Benbernou, Salima
Ouziri, Mourad
Machine Learning
The quality of Machine Learning (ML) models strongly depends on the input data, as such generating high-quality features is often required to improve the predictive accuracy. This process is referred to as Feature Engineering (FE). However, since manual feature engineering is time-consuming and requires case-by-case domain knowledge, Automated Feature Engineering (AutoFE) is crucial. A major challenge that remains is to generate interpretable features. To tackle this problem, we introduce SMART, a hybrid approach that uses semantic technologies to guide the generation of interpretable features through a two-step process: Exploitation and Exploration. The former uses Description Logics (DL) to reason on the semantics embedded in Knowledge Graphs (KG) to infer domain-specific features, while the latter exploits the knowledge graph to conduct a guided exploration of the search space through Deep Reinforcement Learning (DRL). Our experiments on public datasets demonstrate that SMART significantly improves prediction accuracy while ensuring a high level of interpretability.
title Semantic-Guided RL for Interpretable Feature Engineering
topic Machine Learning
url https://arxiv.org/abs/2410.02519