Saved in:
| Main Authors: | Radtke, Pawel K., Barrera-Hinojosa, Cristian G., Ivkovic, Mladen, Weinzierl, Tobias |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.06095 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Compiler-supported reduced precision and AoS-SoA transformations for heterogeneous hardware
by: Radtke, Pawel K., et al.
Published: (2025)
by: Radtke, Pawel K., et al.
Published: (2025)
Annotation-guided AoS-to-SoA conversions and GPU offloading with data views in C++
by: Radtke, Pawel K., et al.
Published: (2025)
by: Radtke, Pawel K., et al.
Published: (2025)
SYCL compute kernels for ExaHyPE
by: Loi, Chung Ming, et al.
Published: (2023)
by: Loi, Chung Ming, et al.
Published: (2023)
Annotation‐Guided AoS‐to‐SoA Conversions and GPU Offloading With Data Views in C++
by: Pawel K. Radtke, et al.
Published: (2025)
by: Pawel K. Radtke, et al.
Published: (2025)
A shared compilation stack for distributed-memory parallelism in stencil DSLs
by: Bisbas, George, et al.
Published: (2024)
by: Bisbas, George, et al.
Published: (2024)
Fast Higher-Order Interpolation and Restriction in ExaHyPE Avoiding Non-physical Reflections
by: Stokes, Timothy, et al.
Published: (2025)
by: Stokes, Timothy, et al.
Published: (2025)
Compiler support for semi-manual AoS-to-SoA conversions with data views
by: Radtke, Pawel K., et al.
Published: (2024)
by: Radtke, Pawel K., et al.
Published: (2024)
Distributed-memory Algorithms for Sparse Matrix Permutation, Extraction, and Assignment
by: Hassani, Elaheh, et al.
Published: (2025)
by: Hassani, Elaheh, et al.
Published: (2025)
Performance measurements of modern Fortran MPI applications with Score-P
by: Corbin, Gregor
Published: (2025)
by: Corbin, Gregor
Published: (2025)
Automated MPI-X code generation for scalable finite-difference solvers
by: Bisbas, George, et al.
Published: (2023)
by: Bisbas, George, et al.
Published: (2023)
On the Challenges of Energy-Efficiency Analysis in HPC Systems: Evaluating Synthetic Benchmarks and Gromacs
by: Machado, Rafael Ravedutti Lucio, et al.
Published: (2025)
by: Machado, Rafael Ravedutti Lucio, et al.
Published: (2025)
Enabling MPI communication within Numba/LLVM JIT-compiled Python code using numba-mpi v1.0
by: Derlatka, Kacper, et al.
Published: (2024)
by: Derlatka, Kacper, et al.
Published: (2024)
TTK is Getting MPI-Ready
by: Guillou, Eve Le, et al.
Published: (2023)
by: Guillou, Eve Le, et al.
Published: (2023)
Experience converting a large mathematical software package written in C++ to C++20 modules
by: Bangerth, Wolfgang
Published: (2025)
by: Bangerth, Wolfgang
Published: (2025)
Towards a user-centric HPC-QC environment
by: Wennersteen, Aleksander, et al.
Published: (2025)
by: Wennersteen, Aleksander, et al.
Published: (2025)
PennyLane-Lightning MPI: A massively scalable quantum circuit simulator based on distributed computing in CPU clusters
by: Kang, Ji-Hoon, et al.
Published: (2025)
by: Kang, Ji-Hoon, et al.
Published: (2025)
TumorTwin: A python framework for patient-specific digital twins in oncology
by: Kapteyn, Michael, et al.
Published: (2025)
by: Kapteyn, Michael, et al.
Published: (2025)
svds-C: A Multi-Thread C Code for Computing Truncated Singular Value Decomposition
by: Feng, Xu, et al.
Published: (2024)
by: Feng, Xu, et al.
Published: (2024)
A modular and extensible library for parameterized terrain generation
by: Wallin, Erik
Published: (2025)
by: Wallin, Erik
Published: (2025)
SySTeC: A Symmetric Sparse Tensor Compiler
by: Patel, Radha, et al.
Published: (2024)
by: Patel, Radha, et al.
Published: (2024)
Efficient space-time reduced order model for linear dynamical systems in Python using less than 120 lines of code
by: Kim, Youngkyu, et al.
Published: (2020)
by: Kim, Youngkyu, et al.
Published: (2020)
QMCPy: A Python Software for Randomized Low-Discrepancy Sequences, Quasi-Monte Carlo, and Fast Kernel Methods
by: Sorokin, Aleksei G
Published: (2025)
by: Sorokin, Aleksei G
Published: (2025)
ExaGRyPE: Numerical General Relativity Solvers Based upon the Hyperbolic PDEs Solver Engine ExaHyPE
by: Zhang, Han, et al.
Published: (2024)
by: Zhang, Han, et al.
Published: (2024)
LLM-HPC++: Evaluating LLM-Generated Modern C++ and MPI+OpenMP Codes for Scalable Mandelbrot Set Computation
by: Diehl, Patrick, et al.
Published: (2025)
by: Diehl, Patrick, et al.
Published: (2025)
A comparison of two effective methods for reordering columns within supernodes
by: Karsavuran, M. Ozan, et al.
Published: (2025)
by: Karsavuran, M. Ozan, et al.
Published: (2025)
Enhancing non-Perl bioinformatic applications with Perl: Building novel, component based applications using Object Orientation, PDL, Alien, FFI, Inline and OpenMP
by: Argyropoulos, Christos
Published: (2024)
by: Argyropoulos, Christos
Published: (2024)
Towards Richer Challenge Problems for Scientific Computing Correctness
by: Sottile, Matthew, et al.
Published: (2025)
by: Sottile, Matthew, et al.
Published: (2025)
Deriving Algorithms for Triangular Tridiagonalization a Skew-Symmetric Matrix
by: van de Geijn, Robert, et al.
Published: (2023)
by: van de Geijn, Robert, et al.
Published: (2023)
Performant Tridiagonal Factorization of Skew-Symmetric Matrices
by: Satyarth, Ishna, et al.
Published: (2024)
by: Satyarth, Ishna, et al.
Published: (2024)
Porting the Nonlinear Optimization Library HiOp to Accelerator-Based Hardware Architectures
by: Peles, Slaven, et al.
Published: (2026)
by: Peles, Slaven, et al.
Published: (2026)
12 Labours tools for developing Functional Tissue Units
by: Hussan, Jagir R.
Published: (2024)
by: Hussan, Jagir R.
Published: (2024)
Interface for Sparse Linear Algebra Operations
by: Abdelfattah, Ahmad, et al.
Published: (2024)
by: Abdelfattah, Ahmad, et al.
Published: (2024)
Software Development Aspects of Integrating Linear Algebra Libraries
by: Koch, Marcel, et al.
Published: (2025)
by: Koch, Marcel, et al.
Published: (2025)
Type-II/III DCT/DST algorithms with reduced number of arithmetic operations
by: Shao, Xuancheng, et al.
Published: (2007)
by: Shao, Xuancheng, et al.
Published: (2007)
Compressing Structured Tensor Algebra
by: Ghorbani, Mahdi, et al.
Published: (2024)
by: Ghorbani, Mahdi, et al.
Published: (2024)
Basilisk and Docker for Reproducible GN&C Simulation: A Workflow Reference
by: Gupta, Anubhav
Published: (2026)
by: Gupta, Anubhav
Published: (2026)
AD-HOC: A C++ Expression Template package for high-order derivatives backpropagation
by: Rey, Juan Lucas
Published: (2024)
by: Rey, Juan Lucas
Published: (2024)
Transformations of Computational Meshes
by: Knepley, Matthew G.
Published: (2025)
by: Knepley, Matthew G.
Published: (2025)
Revealing Floating-Point Accumulation Orders in Software/Hardware Implementations
by: Xie, Peichen, et al.
Published: (2024)
by: Xie, Peichen, et al.
Published: (2024)
Minimization of Nonlinear Energies in Python Using FEM and Automatic Differentiation Tools
by: Béreš, Michal, et al.
Published: (2024)
by: Béreš, Michal, et al.
Published: (2024)
Similar Items
-
Compiler-supported reduced precision and AoS-SoA transformations for heterogeneous hardware
by: Radtke, Pawel K., et al.
Published: (2025) -
Annotation-guided AoS-to-SoA conversions and GPU offloading with data views in C++
by: Radtke, Pawel K., et al.
Published: (2025) -
SYCL compute kernels for ExaHyPE
by: Loi, Chung Ming, et al.
Published: (2023) -
Annotation‐Guided AoS‐to‐SoA Conversions and GPU Offloading With Data Views in C++
by: Pawel K. Radtke, et al.
Published: (2025) -
A shared compilation stack for distributed-memory parallelism in stencil DSLs
by: Bisbas, George, et al.
Published: (2024)