MARC21: :: Library Catalog

Salvato in:

Dettagli Bibliografici
Autori principali:	Harris, Philip, Kagan, Michael, Krupa, Jeffrey, Maier, Benedikt, Woodward, Nathaniel
Natura:	Preprint
Pubblicazione:	2024
Soggetti:	High Energy Physics - Phenomenology Machine Learning High Energy Physics - Experiment
Accesso online:	https://arxiv.org/abs/2403.07066
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

_version_	1866910842932428800
author	Harris, Philip Kagan, Michael Krupa, Jeffrey Maier, Benedikt Woodward, Nathaniel
author_facet	Harris, Philip Kagan, Michael Krupa, Jeffrey Maier, Benedikt Woodward, Nathaniel
contents	Self-Supervised Learning (SSL) is at the core of training modern large machine learning models, providing a scheme for learning powerful representations that can be used in a variety of downstream tasks. However, SSL strategies must be adapted to the type of training data and downstream tasks required. We propose RS3L ("Re-simulation-based self-supervised representation learning"), a novel simulation-based SSL strategy that employs a method of re-simulation to drive data augmentation for contrastive learning in the physical sciences, particularly, in fields that rely on stochastic simulators. By intervening in the middle of the simulation process and re-running simulation components downstream of the intervention, we generate multiple realizations of an event, thus producing a set of augmentations covering all physics-driven variations available in the simulator. Using experiments from high-energy physics, we explore how this strategy may enable the development of a foundation model; we show how RS3L pre-training enables powerful performance in downstream tasks such as discrimination of a variety of objects and uncertainty mitigation. In addition to our results, we make the RS3L dataset publicly available for further studies on how to improve SSL strategies.
format	Preprint
id	arxiv_https___arxiv_org_abs_2403_07066
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Re-Simulation-based Self-Supervised Learning for Pre-Training Foundation Models Harris, Philip Kagan, Michael Krupa, Jeffrey Maier, Benedikt Woodward, Nathaniel High Energy Physics - Phenomenology Machine Learning High Energy Physics - Experiment Self-Supervised Learning (SSL) is at the core of training modern large machine learning models, providing a scheme for learning powerful representations that can be used in a variety of downstream tasks. However, SSL strategies must be adapted to the type of training data and downstream tasks required. We propose RS3L ("Re-simulation-based self-supervised representation learning"), a novel simulation-based SSL strategy that employs a method of re-simulation to drive data augmentation for contrastive learning in the physical sciences, particularly, in fields that rely on stochastic simulators. By intervening in the middle of the simulation process and re-running simulation components downstream of the intervention, we generate multiple realizations of an event, thus producing a set of augmentations covering all physics-driven variations available in the simulator. Using experiments from high-energy physics, we explore how this strategy may enable the development of a foundation model; we show how RS3L pre-training enables powerful performance in downstream tasks such as discrimination of a variety of objects and uncertainty mitigation. In addition to our results, we make the RS3L dataset publicly available for further studies on how to improve SSL strategies.
title	Re-Simulation-based Self-Supervised Learning for Pre-Training Foundation Models
topic	High Energy Physics - Phenomenology Machine Learning High Energy Physics - Experiment
url	https://arxiv.org/abs/2403.07066

Documenti analoghi