Saved in:
Bibliographic Details
Main Authors: Ying, Jie, Lin, Haowei, Yue, Chao, Chen, Yajie, Xiao, Chao, Shi, Quanqi, Liang, Yitao, Yau, Shing-Tung, Zhou, Yuan, Ma, Jianzhu
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2503.07994
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915584218759168
author Ying, Jie
Lin, Haowei
Yue, Chao
Chen, Yajie
Xiao, Chao
Shi, Quanqi
Liang, Yitao
Yau, Shing-Tung
Zhou, Yuan
Ma, Jianzhu
author_facet Ying, Jie
Lin, Haowei
Yue, Chao
Chen, Yajie
Xiao, Chao
Shi, Quanqi
Liang, Yitao
Yau, Shing-Tung
Zhou, Yuan
Ma, Jianzhu
contents In this study, we unveil a new AI model, termed PhyE2E, to discover physical formulas through symbolic regression. PhyE2E simplifies symbolic regression by decomposing it into sub-problems using the second-order derivatives of an oracle neural network, and employs a transformer model to translate data into symbolic formulas in an end-to-end manner. The resulting formulas are refined through Monte-Carlo Tree Search and Genetic Programming. We leverage a large language model to synthesize extensive symbolic expressions resembling real physics, and train the model to recover these formulas directly from data. A comprehensive evaluation reveals that PhyE2E outperforms existing state-of-the-art approaches, delivering superior symbolic accuracy, precision in data fitting, and consistency in physical units. We deployed PhyE2E to five applications in space physics, including the prediction of sunspot numbers, solar rotational angular velocity, emission line contribution functions, near-Earth plasma pressure, and lunar-tide plasma signals. The physical formulas generated by AI demonstrate a high degree of accuracy in fitting the experimental data from satellites and astronomical telescopes. We have successfully upgraded the formula proposed by NASA in 1993 regarding solar activity, and for the first time, provided the explanations for the long cycle of solar activity in an explicit form. We also found that the decay of near-Earth plasma pressure is proportional to r^2 to Earth, where subsequent mathematical derivations are consistent with satellite data from another independent study. Moreover, we found physical formulas that can describe the relationships between emission lines in the extreme ultraviolet spectrum of the Sun, temperatures, electron densities, and magnetic fields. The formula obtained is consistent with the properties that physicists had previously hypothesized it should possess.
format Preprint
id arxiv_https___arxiv_org_abs_2503_07994
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle A Neural Symbolic Model for Space Physics
Ying, Jie
Lin, Haowei
Yue, Chao
Chen, Yajie
Xiao, Chao
Shi, Quanqi
Liang, Yitao
Yau, Shing-Tung
Zhou, Yuan
Ma, Jianzhu
Solar and Stellar Astrophysics
Earth and Planetary Astrophysics
Instrumentation and Methods for Astrophysics
Artificial Intelligence
Space Physics
In this study, we unveil a new AI model, termed PhyE2E, to discover physical formulas through symbolic regression. PhyE2E simplifies symbolic regression by decomposing it into sub-problems using the second-order derivatives of an oracle neural network, and employs a transformer model to translate data into symbolic formulas in an end-to-end manner. The resulting formulas are refined through Monte-Carlo Tree Search and Genetic Programming. We leverage a large language model to synthesize extensive symbolic expressions resembling real physics, and train the model to recover these formulas directly from data. A comprehensive evaluation reveals that PhyE2E outperforms existing state-of-the-art approaches, delivering superior symbolic accuracy, precision in data fitting, and consistency in physical units. We deployed PhyE2E to five applications in space physics, including the prediction of sunspot numbers, solar rotational angular velocity, emission line contribution functions, near-Earth plasma pressure, and lunar-tide plasma signals. The physical formulas generated by AI demonstrate a high degree of accuracy in fitting the experimental data from satellites and astronomical telescopes. We have successfully upgraded the formula proposed by NASA in 1993 regarding solar activity, and for the first time, provided the explanations for the long cycle of solar activity in an explicit form. We also found that the decay of near-Earth plasma pressure is proportional to r^2 to Earth, where subsequent mathematical derivations are consistent with satellite data from another independent study. Moreover, we found physical formulas that can describe the relationships between emission lines in the extreme ultraviolet spectrum of the Sun, temperatures, electron densities, and magnetic fields. The formula obtained is consistent with the properties that physicists had previously hypothesized it should possess.
title A Neural Symbolic Model for Space Physics
topic Solar and Stellar Astrophysics
Earth and Planetary Astrophysics
Instrumentation and Methods for Astrophysics
Artificial Intelligence
Space Physics
url https://arxiv.org/abs/2503.07994