Saved in:
| Main Authors: | Ramirez-Hidalgo, Gustavo, He, Lianhua, Zhang, Ke-Long |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.08092 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Multiple right hand side multigrid for domain wall fermions with a multigrid preconditioned block conjugate gradient algorithm
by: Boyle, Peter A
Published: (2024)
by: Boyle, Peter A
Published: (2024)
Performance-Portable Optimization and Analysis of Multiple Right-Hand Sides in a Lattice QCD Solver
by: Long, Shiting, et al.
Published: (2026)
by: Long, Shiting, et al.
Published: (2026)
Accelerating Lattice QCD Simulations using GPUs
by: Matthaei, Tilmann
Published: (2024)
by: Matthaei, Tilmann
Published: (2024)
Portable, Massively Parallel Implementation of a Material Point Method for Compressible Flows
by: Baioni, Paolo Joseph, et al.
Published: (2024)
by: Baioni, Paolo Joseph, et al.
Published: (2024)
A simple GPU implementation of spectral-element methods for solving 3D Poisson type equations on rectangular domains and its applications
by: Liu, Xinyu, et al.
Published: (2023)
by: Liu, Xinyu, et al.
Published: (2023)
Energy Efficiency trends in HPC: what high-energy and astrophysicists need to know
by: Suarez, Estela, et al.
Published: (2025)
by: Suarez, Estela, et al.
Published: (2025)
Towards a GPU-Parallelization of the neXtSIM-DG Dynamical Core
by: Jendersie, Robert, et al.
Published: (2024)
by: Jendersie, Robert, et al.
Published: (2024)
Two-Stage Block Orthogonalization to Improve Performance of $s$-step GMRES
by: Yamazaki, Ichitaro, et al.
Published: (2024)
by: Yamazaki, Ichitaro, et al.
Published: (2024)
SUNDIALS Time Integrators for Exascale Applications with Many Independent ODE Systems
by: Balos, Cody J., et al.
Published: (2024)
by: Balos, Cody J., et al.
Published: (2024)
Neural Acceleration of Incomplete Cholesky Preconditioners
by: Booth, Joshua Dennis, et al.
Published: (2024)
by: Booth, Joshua Dennis, et al.
Published: (2024)
A Parallel in Time Algorithm Based on ParaExp for Optimal Control Problems
by: Kwok, Felix, et al.
Published: (2024)
by: Kwok, Felix, et al.
Published: (2024)
Cucheb: A GPU implementation of the filtered Lanczos procedure
by: Aurentz, Jared L., et al.
Published: (2024)
by: Aurentz, Jared L., et al.
Published: (2024)
GPU Accelerated Implicit Kinetic Meshfree Method based on Modified LU-SGS
by: Verma, Mayuri, et al.
Published: (2024)
by: Verma, Mayuri, et al.
Published: (2024)
CG-Kit: Code Generation Toolkit for Performant and Maintainable Variants of Source Code Applied to Flash-X Hydrodynamics Simulations
by: Rudi, Johann, et al.
Published: (2024)
by: Rudi, Johann, et al.
Published: (2024)
Error Analysis of Matrix Multiplication Emulation Using Ozaki-II Scheme
by: Uchino, Yuki, et al.
Published: (2026)
by: Uchino, Yuki, et al.
Published: (2026)
Parallel simulation and adaptive mesh refinement for 3D elastostatic contact mechanics problems between deformable bodies
by: Epalle, Alexandre, et al.
Published: (2025)
by: Epalle, Alexandre, et al.
Published: (2025)
Teaching An Old Dog New Tricks: Porting Legacy Code to Heterogeneous Compute Architectures With Automated Code Translation
by: Nytko, Nicolas, et al.
Published: (2025)
by: Nytko, Nicolas, et al.
Published: (2025)
On some orthogonalization schemes in Tensor Train format
by: Coulaud, Olivier, et al.
Published: (2022)
by: Coulaud, Olivier, et al.
Published: (2022)
Asymptotic Analysis of a Leader Election Algorithm
by: Lavault, Christian, et al.
Published: (2006)
by: Lavault, Christian, et al.
Published: (2006)
Random-sketching Techniques to Enhance the Numerical Stability of Block Orthogonalization Algorithms for s-step GMRES
by: Yamazaki, Ichitaro, et al.
Published: (2025)
by: Yamazaki, Ichitaro, et al.
Published: (2025)
RAPTOR: Practical Numerical Profiling of Scientific Applications
by: Hoerold, Faveo, et al.
Published: (2025)
by: Hoerold, Faveo, et al.
Published: (2025)
Residual-Weighted Randomized Jacobi: Sharpened Bounds via Residual Concentration and Asynchronous Extension
by: Coleman, Evan
Published: (2026)
by: Coleman, Evan
Published: (2026)
Algebraic Temporal Blocking for Sparse Iterative Solvers on Multi-Core CPUs
by: Alappat, Christie, et al.
Published: (2023)
by: Alappat, Christie, et al.
Published: (2023)
Modifying the Asynchronous Jacobi Method for Data Corruption Resilience
by: Vogl, Christopher J., et al.
Published: (2022)
by: Vogl, Christopher J., et al.
Published: (2022)
Randomized algorithms for distributed computation of principal component analysis and singular value decomposition
by: Li, Huamin, et al.
Published: (2016)
by: Li, Huamin, et al.
Published: (2016)
Efficient and scalable atmospheric dynamics simulations using non-conforming meshes
by: Orlando, Giuseppe, et al.
Published: (2024)
by: Orlando, Giuseppe, et al.
Published: (2024)
Real-time Bayesian inference at extreme scale: A digital twin for tsunami early warning applied to the Cascadia subduction zone
by: Henneking, Stefan, et al.
Published: (2025)
by: Henneking, Stefan, et al.
Published: (2025)
Improving the scalability of a high-order atmospheric dynamics solver based on the deal.II library
by: Orlando, Giuseppe, et al.
Published: (2025)
by: Orlando, Giuseppe, et al.
Published: (2025)
A High Performance GPU CountSketch Implementation and Its Application to Multisketching and Least Squares Problems
by: Higgins, Andrew J., et al.
Published: (2025)
by: Higgins, Andrew J., et al.
Published: (2025)
TTrace: Lightweight Error Checking and Diagnosis for Distributed Training
by: Jiang, Haitian, et al.
Published: (2025)
by: Jiang, Haitian, et al.
Published: (2025)
TriMe++: Multi-threaded triangular meshing in two dimensions
by: Lu, Jiayin, et al.
Published: (2023)
by: Lu, Jiayin, et al.
Published: (2023)
Accelerating Diffusion Models with Parallel Sampling: Inference at Sub-Linear Time Complexity
by: Chen, Haoxuan, et al.
Published: (2024)
by: Chen, Haoxuan, et al.
Published: (2024)
Distributed computing for physics-based data-driven reduced modeling at scale: Application to a rotating detonation rocket engine
by: Farcas, Ionut-Gabriel, et al.
Published: (2024)
by: Farcas, Ionut-Gabriel, et al.
Published: (2024)
Impact of EIP-4844 on Ethereum: Consensus Security, Ethereum Usage, Rollup Transaction Dynamics, and Blob Gas Fee Markets
by: Park, Seongwan, et al.
Published: (2024)
by: Park, Seongwan, et al.
Published: (2024)
SUperman: Efficient Permanent Computation on GPUs
by: Elbek, Deniz, et al.
Published: (2025)
by: Elbek, Deniz, et al.
Published: (2025)
Decomposing Solution Sets of Polynomial Systems: A New Parallel Monodromy Breakup Algorithm
by: Leykin, Anton, et al.
Published: (2005)
by: Leykin, Anton, et al.
Published: (2005)
A Distributed Block Chebyshev-Davidson Algorithm for Parallel Spectral Clustering
by: Pang, Qiyuan, et al.
Published: (2022)
by: Pang, Qiyuan, et al.
Published: (2022)
Parallelization of Software Systems Test Case Selection Algorithm Based on Singular Value Decomposition
by: Moghaddam, Mahdi Movahedian
Published: (2022)
by: Moghaddam, Mahdi Movahedian
Published: (2022)
DiffPhD: A Unified Differentiable Solver for Projective Heterogeneous Materials in Elastodynamics with Contact-Rich GPU-Acceleration
by: Lai, Shih-Yu, et al.
Published: (2026)
by: Lai, Shih-Yu, et al.
Published: (2026)
Distributed Matrix-Vector Multiplication: A Convolutional Coding Approach
by: Das, Anindya Bijoy, et al.
Published: (2019)
by: Das, Anindya Bijoy, et al.
Published: (2019)
Similar Items
-
Multiple right hand side multigrid for domain wall fermions with a multigrid preconditioned block conjugate gradient algorithm
by: Boyle, Peter A
Published: (2024) -
Performance-Portable Optimization and Analysis of Multiple Right-Hand Sides in a Lattice QCD Solver
by: Long, Shiting, et al.
Published: (2026) -
Accelerating Lattice QCD Simulations using GPUs
by: Matthaei, Tilmann
Published: (2024) -
Portable, Massively Parallel Implementation of a Material Point Method for Compressible Flows
by: Baioni, Paolo Joseph, et al.
Published: (2024) -
A simple GPU implementation of spectral-element methods for solving 3D Poisson type equations on rectangular domains and its applications
by: Liu, Xinyu, et al.
Published: (2023)