Saved in:
Bibliographic Details
Main Authors: Woodward, Nathaniel S., Park, Sang Eon, Grosso, Gaia, Krupa, Jeffrey, Harris, Philip
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2412.07033
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915055668297728
author Woodward, Nathaniel S.
Park, Sang Eon
Grosso, Gaia
Krupa, Jeffrey
Harris, Philip
author_facet Woodward, Nathaniel S.
Park, Sang Eon
Grosso, Gaia
Krupa, Jeffrey
Harris, Philip
contents Physical data are representations of the fundamental laws governing the Universe, hiding complex compositional structures often well captured by hierarchical graphs. Hyperbolic spaces are endowed with a non-Euclidean geometry that naturally embeds those structures. To leverage the benefits of non-Euclidean geometries in representing natural data we develop machine learning on $\mathcal P \mathcal M$ spaces, Cartesian products of constant curvature Riemannian manifolds. As a use case we consider the classification of "jets", sprays of hadrons and other subatomic particles produced by the hadronization of quarks and gluons in collider experiments. We compare the performance of $\mathcal P \mathcal M$-MLP and $\mathcal P \mathcal M$-Transformer models across several possible representations. Our experiments show that $\mathcal P \mathcal M$ representations generally perform equal or better to fully Euclidean models of similar size, with the most significant gains found for highly hierarchical jets and small models. We discover significant correlation between the degree of hierarchical structure at a per-jet level and classification performance with the $\mathcal P \mathcal M$-Transformer in top tagging benchmarks. This is a promising result highlighting a potential direction for further improving machine learning model performance through tailoring geometric representation at a per-sample level in hierarchical datasets. These results reinforce the view of geometric representation as a key parameter in maximizing both performance and efficiency of machine learning on natural data.
format Preprint
id arxiv_https___arxiv_org_abs_2412_07033
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Product Manifold Machine Learning for Physics
Woodward, Nathaniel S.
Park, Sang Eon
Grosso, Gaia
Krupa, Jeffrey
Harris, Philip
High Energy Physics - Phenomenology
Physical data are representations of the fundamental laws governing the Universe, hiding complex compositional structures often well captured by hierarchical graphs. Hyperbolic spaces are endowed with a non-Euclidean geometry that naturally embeds those structures. To leverage the benefits of non-Euclidean geometries in representing natural data we develop machine learning on $\mathcal P \mathcal M$ spaces, Cartesian products of constant curvature Riemannian manifolds. As a use case we consider the classification of "jets", sprays of hadrons and other subatomic particles produced by the hadronization of quarks and gluons in collider experiments. We compare the performance of $\mathcal P \mathcal M$-MLP and $\mathcal P \mathcal M$-Transformer models across several possible representations. Our experiments show that $\mathcal P \mathcal M$ representations generally perform equal or better to fully Euclidean models of similar size, with the most significant gains found for highly hierarchical jets and small models. We discover significant correlation between the degree of hierarchical structure at a per-jet level and classification performance with the $\mathcal P \mathcal M$-Transformer in top tagging benchmarks. This is a promising result highlighting a potential direction for further improving machine learning model performance through tailoring geometric representation at a per-sample level in hierarchical datasets. These results reinforce the view of geometric representation as a key parameter in maximizing both performance and efficiency of machine learning on natural data.
title Product Manifold Machine Learning for Physics
topic High Energy Physics - Phenomenology
url https://arxiv.org/abs/2412.07033