Guardado en:
Detalles Bibliográficos
Autores principales: Campi, Marta, Staerman, Guillaume, Peters, Gareth W., Matsui, Tomoko
Formato: Preprint
Publicado: 2024
Materias:
Acceso en línea:https://arxiv.org/abs/2403.04405
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
_version_ 1866916629383741440
author Campi, Marta
Staerman, Guillaume
Peters, Gareth W.
Matsui, Tomoko
author_facet Campi, Marta
Staerman, Guillaume
Peters, Gareth W.
Matsui, Tomoko
contents Functional Isolation Forest (FIF) is a recent state-of-the-art Anomaly Detection (AD) algorithm designed for functional data. It relies on a tree partition procedure where an abnormality score is computed by projecting each curve observation on a drawn dictionary through a linear inner product. Such linear inner product and the dictionary are a priori choices that highly influence the algorithm's performances and might lead to unreliable results, particularly with complex datasets. This work addresses these challenges by introducing \textit{Signature Isolation Forest}, a novel AD algorithm class leveraging the rough path theory's signature transform. Our objective is to remove the constraints imposed by FIF through the proposition of two algorithms which specifically target the linearity of the FIF inner product and the choice of the dictionary. We provide several numerical experiments, including a real-world applications benchmark showing the relevance of our methods.
format Preprint
id arxiv_https___arxiv_org_abs_2403_04405
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Signature Isolation Forest
Campi, Marta
Staerman, Guillaume
Peters, Gareth W.
Matsui, Tomoko
Machine Learning
Functional Isolation Forest (FIF) is a recent state-of-the-art Anomaly Detection (AD) algorithm designed for functional data. It relies on a tree partition procedure where an abnormality score is computed by projecting each curve observation on a drawn dictionary through a linear inner product. Such linear inner product and the dictionary are a priori choices that highly influence the algorithm's performances and might lead to unreliable results, particularly with complex datasets. This work addresses these challenges by introducing \textit{Signature Isolation Forest}, a novel AD algorithm class leveraging the rough path theory's signature transform. Our objective is to remove the constraints imposed by FIF through the proposition of two algorithms which specifically target the linearity of the FIF inner product and the choice of the dictionary. We provide several numerical experiments, including a real-world applications benchmark showing the relevance of our methods.
title Signature Isolation Forest
topic Machine Learning
url https://arxiv.org/abs/2403.04405