Saved in:
Bibliographic Details
Main Authors: Overman, Tom, Klabjan, Diego, Utke, Jean
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2409.04665
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917938128224256
author Overman, Tom
Klabjan, Diego
Utke, Jean
author_facet Overman, Tom
Klabjan, Diego
Utke, Jean
contents Automated feature engineering (AutoFE) is the process of automatically building and selecting new features that help improve downstream predictive performance. While traditional feature engineering requires significant domain expertise and time-consuming iterative testing, AutoFE strives to make feature engineering easy and accessible to all data science practitioners. We introduce a new AutoFE algorithm, IIFE, based on determining which feature pairs synergize well through an information-theoretic perspective called interaction information. We demonstrate the superior performance of IIFE over existing algorithms. We also show how interaction information can be used to improve existing AutoFE algorithms. Finally, we highlight several critical experimental setup issues in the existing AutoFE literature and their effects on performance.
format Preprint
id arxiv_https___arxiv_org_abs_2409_04665
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle IIFE: Interaction Information Based Automated Feature Engineering
Overman, Tom
Klabjan, Diego
Utke, Jean
Machine Learning
Automated feature engineering (AutoFE) is the process of automatically building and selecting new features that help improve downstream predictive performance. While traditional feature engineering requires significant domain expertise and time-consuming iterative testing, AutoFE strives to make feature engineering easy and accessible to all data science practitioners. We introduce a new AutoFE algorithm, IIFE, based on determining which feature pairs synergize well through an information-theoretic perspective called interaction information. We demonstrate the superior performance of IIFE over existing algorithms. We also show how interaction information can be used to improve existing AutoFE algorithms. Finally, we highlight several critical experimental setup issues in the existing AutoFE literature and their effects on performance.
title IIFE: Interaction Information Based Automated Feature Engineering
topic Machine Learning
url https://arxiv.org/abs/2409.04665