Saved in:
Bibliographic Details
Main Authors: Hong, Wensheng, Yang, Xiaohu, Li, Junde, Wang, Huiyuan, Chen, Zhao, Zhu, Hong-Ming, Li, Qingyang, Gu, Yizhou, Zhang, Youcai, Shi, Feng, Han, Jiaxin, Yu, Yu, Zhai, Zhongxu
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.06463
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • We present a highly scalable, MPI-parallelized framework for reconstructing the initial cosmic density field, designed to meet the computational demands of next-generation cosmological simulations, particularly the upcoming ELUCID-DESI simulation based on DESI BGS data. Building upon the Hamiltonian Monte Carlo approach and the FastPM solver, our code employs domain decomposition to efficiently distribute memory between nodes. Although communication overhead increases the per-step runtime of the MPI version by roughly a factor of eight relative to the shared-memory implementation, our scaling tests-spanning different particle numbers, core counts, and node layouts-show nearly linear scaling with respect to both the number of particles and the number of CPU cores. Furthermore, to significantly reduce computational costs during the initial burn-in phase, we introduce a novel ``guess'' module that rapidly generates a high-quality initial density field. The results of the simulation test confirm substantial efficiency gains: for $256^3$ particles, 53 steps ($\sim$54 CPU hours) are saved; for $1024^3$, 106 steps ($\sim$7500 CPU hours). The relative gain grows with the number of particles, rendering large-volume reconstructions computationally practical for upcoming surveys, including our planned ELUCID-DESI reconstruction simulation with $8192^3$ particles, with a rough estimation of 720 steps ($\sim$37,000,000 CPU hours).