Internformat: :: Library Catalog

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Xu, Chenwei, Hu, Jerry Yao-Chieh, Narayanan, Aakaash, Thieme, Mattson, Nagaslaev, Vladimir, Austin, Mark, Arnold, Jeremy, Berlioz, Jose, Hanlet, Pierrick, Ibrahim, Aisha, Nicklaus, Dennis, Mitrevski, Jovan, John, Jason Michael St., Pradhan, Gauri, Saewert, Andrea, Seiya, Kiyomi, Schupbach, Brian, Thurman-Keup, Randy, Tran, Nhan, Shi, Rui, Ogrenci, Seda, Shuping, Alexis Maya-Isabelle, Hazelwood, Kyle, Liu, Han
Format:	Preprint
Veröffentlicht:	2023
Schlagworte:	Machine Learning Artificial Intelligence Accelerator Physics
Online-Zugang:	https://arxiv.org/abs/2312.17372
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

_version_	1866913181053485056
author	Xu, Chenwei Hu, Jerry Yao-Chieh Narayanan, Aakaash Thieme, Mattson Nagaslaev, Vladimir Austin, Mark Arnold, Jeremy Berlioz, Jose Hanlet, Pierrick Ibrahim, Aisha Nicklaus, Dennis Mitrevski, Jovan John, Jason Michael St. Pradhan, Gauri Saewert, Andrea Seiya, Kiyomi Schupbach, Brian Thurman-Keup, Randy Tran, Nhan Shi, Rui Ogrenci, Seda Shuping, Alexis Maya-Isabelle Hazelwood, Kyle Liu, Han
author_facet	Xu, Chenwei Hu, Jerry Yao-Chieh Narayanan, Aakaash Thieme, Mattson Nagaslaev, Vladimir Austin, Mark Arnold, Jeremy Berlioz, Jose Hanlet, Pierrick Ibrahim, Aisha Nicklaus, Dennis Mitrevski, Jovan John, Jason Michael St. Pradhan, Gauri Saewert, Andrea Seiya, Kiyomi Schupbach, Brian Thurman-Keup, Randy Tran, Nhan Shi, Rui Ogrenci, Seda Shuping, Alexis Maya-Isabelle Hazelwood, Kyle Liu, Han
contents	We introduce a novel Proximal Policy Optimization (PPO) algorithm aimed at addressing the challenge of maintaining a uniform proton beam intensity delivery in the Muon to Electron Conversion Experiment (Mu2e) at Fermi National Accelerator Laboratory (Fermilab). Our primary objective is to regulate the spill process to ensure a consistent intensity profile, with the ultimate goal of creating an automated controller capable of providing real-time feedback and calibration of the Spill Regulation System (SRS) parameters on a millisecond timescale. We treat the Mu2e accelerator system as a Markov Decision Process suitable for Reinforcement Learning (RL), utilizing PPO to reduce bias and enhance training stability. A key innovation in our approach is the integration of a neuralized Proportional-Integral-Derivative (PID) controller into the policy function, resulting in a significant improvement in the Spill Duty Factor (SDF) by 13.6%, surpassing the performance of the current PID controller baseline by an additional 1.6%. This paper presents the preliminary offline results based on a differentiable simulator of the Mu2e accelerator. It paves the groundwork for real-time implementations and applications, representing a crucial step towards automated proton beam intensity control for the Mu2e experiment.
format	Preprint
id	arxiv_https___arxiv_org_abs_2312_17372
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	Beyond PID Controllers: PPO with Neuralized PID Policy for Proton Beam Intensity Control in Mu2e Xu, Chenwei Hu, Jerry Yao-Chieh Narayanan, Aakaash Thieme, Mattson Nagaslaev, Vladimir Austin, Mark Arnold, Jeremy Berlioz, Jose Hanlet, Pierrick Ibrahim, Aisha Nicklaus, Dennis Mitrevski, Jovan John, Jason Michael St. Pradhan, Gauri Saewert, Andrea Seiya, Kiyomi Schupbach, Brian Thurman-Keup, Randy Tran, Nhan Shi, Rui Ogrenci, Seda Shuping, Alexis Maya-Isabelle Hazelwood, Kyle Liu, Han Machine Learning Artificial Intelligence Accelerator Physics We introduce a novel Proximal Policy Optimization (PPO) algorithm aimed at addressing the challenge of maintaining a uniform proton beam intensity delivery in the Muon to Electron Conversion Experiment (Mu2e) at Fermi National Accelerator Laboratory (Fermilab). Our primary objective is to regulate the spill process to ensure a consistent intensity profile, with the ultimate goal of creating an automated controller capable of providing real-time feedback and calibration of the Spill Regulation System (SRS) parameters on a millisecond timescale. We treat the Mu2e accelerator system as a Markov Decision Process suitable for Reinforcement Learning (RL), utilizing PPO to reduce bias and enhance training stability. A key innovation in our approach is the integration of a neuralized Proportional-Integral-Derivative (PID) controller into the policy function, resulting in a significant improvement in the Spill Duty Factor (SDF) by 13.6%, surpassing the performance of the current PID controller baseline by an additional 1.6%. This paper presents the preliminary offline results based on a differentiable simulator of the Mu2e accelerator. It paves the groundwork for real-time implementations and applications, representing a crucial step towards automated proton beam intensity control for the Mu2e experiment.
title	Beyond PID Controllers: PPO with Neuralized PID Policy for Proton Beam Intensity Control in Mu2e
topic	Machine Learning Artificial Intelligence Accelerator Physics
url	https://arxiv.org/abs/2312.17372

Ähnliche Einträge