Saved in:
Bibliographic Details
Main Authors: Thaler, Felix, Keller, Sebastian
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.19873
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917288673804288
author Thaler, Felix
Keller, Sebastian
author_facet Thaler, Felix
Keller, Sebastian
contents We have developed a compressed neighbor list for short-range particle-particle interaction based on a space- filling curve (SFC) memory layout and particle clusters. The neighbor list can be constructed efficiently on GPUs, supporting NVIDIA and AMD hardware, and has a memory footprint of only 4 bytes per particle to store approximately 200 neighbors. Compared to the highly-optimized domain-specific neighbor list implementation of GROMACS, a molecular dynamics code, it has a comparable cluster overhead and delivers similar performance in a neighborhood pass. Thanks to the SFC-based data layout and the support for varying interaction radii per particle, our neighbor list performs well for systems with high density contrasts, such as those encountered in many astrophysical and cosmological applications. Due to the close relation between SFCs and octrees, our neighbor list seamlessly integrates with octree-based domain decomposition and multipole-based methods for long-range gravitational or electrostatic interactions. To demonstrate the coupling between long- and short-range forces, we simulate an Evrard collapse, a standard test case for the coupling between hydrodynamical and gravitational forces, on up to 1024 GPUs, and compare our results to the analytical solution.
format Preprint
id arxiv_https___arxiv_org_abs_2602_19873
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle GPU-Native Compressed Neighbor Lists with a Space-Filling-Curve Data Layout
Thaler, Felix
Keller, Sebastian
Computational Engineering, Finance, and Science
Instrumentation and Methods for Astrophysics
Data Structures and Algorithms
68W10
J.2
We have developed a compressed neighbor list for short-range particle-particle interaction based on a space- filling curve (SFC) memory layout and particle clusters. The neighbor list can be constructed efficiently on GPUs, supporting NVIDIA and AMD hardware, and has a memory footprint of only 4 bytes per particle to store approximately 200 neighbors. Compared to the highly-optimized domain-specific neighbor list implementation of GROMACS, a molecular dynamics code, it has a comparable cluster overhead and delivers similar performance in a neighborhood pass. Thanks to the SFC-based data layout and the support for varying interaction radii per particle, our neighbor list performs well for systems with high density contrasts, such as those encountered in many astrophysical and cosmological applications. Due to the close relation between SFCs and octrees, our neighbor list seamlessly integrates with octree-based domain decomposition and multipole-based methods for long-range gravitational or electrostatic interactions. To demonstrate the coupling between long- and short-range forces, we simulate an Evrard collapse, a standard test case for the coupling between hydrodynamical and gravitational forces, on up to 1024 GPUs, and compare our results to the analytical solution.
title GPU-Native Compressed Neighbor Lists with a Space-Filling-Curve Data Layout
topic Computational Engineering, Finance, and Science
Instrumentation and Methods for Astrophysics
Data Structures and Algorithms
68W10
J.2
url https://arxiv.org/abs/2602.19873