Saved in:
| Main Authors: | Wang, Weidong, Zhu, Haoran |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.03215 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DiOMP-Offloading: Toward Portable Distributed Heterogeneous OpenMP
by: Shan, Baodi, et al.
Published: (2025)
by: Shan, Baodi, et al.
Published: (2025)
OMP4Py: a pure Python implementation of OpenMP
by: Piñeiro, César, et al.
Published: (2024)
by: Piñeiro, César, et al.
Published: (2024)
Parallel Paradigms in Modern HPC: A Comparative Analysis of MPI, OpenMP, and CUDA
by: ALHafez, Nizar, et al.
Published: (2025)
by: ALHafez, Nizar, et al.
Published: (2025)
LLOR: Automated Repair of OpenMP Programs
by: Bora, Utpal, et al.
Published: (2024)
by: Bora, Utpal, et al.
Published: (2024)
Static Generation of Efficient OpenMP Offload Data Mappings
by: Marzen, Luke, et al.
Published: (2024)
by: Marzen, Luke, et al.
Published: (2024)
Developing an Interactive OpenMP Programming Book with Large Language Models
by: Yi, Xinyao, et al.
Published: (2024)
by: Yi, Xinyao, et al.
Published: (2024)
Towards a Scalable and Efficient PGAS-based Distributed OpenMP
by: Shan, Baodi, et al.
Published: (2024)
by: Shan, Baodi, et al.
Published: (2024)
Distributed OpenMP Offloading of OpenMC on Intel GPU MAX Accelerators
by: Fridman, Yehonatan, et al.
Published: (2024)
by: Fridman, Yehonatan, et al.
Published: (2024)
Implementing OpenMP for Zig to enable its use in HPC context
by: Kacs, David, et al.
Published: (2024)
by: Kacs, David, et al.
Published: (2024)
High-Performance Parallel Optimization of the Fish School Behaviour on the Setonix Platform Using OpenMP
by: Wang, Haitian, et al.
Published: (2025)
by: Wang, Haitian, et al.
Published: (2025)
Auto-Tuning for OpenMP Dynamic Scheduling applied to Full Waveform Inversion
by: da Silva, Felipe H. S., et al.
Published: (2024)
by: da Silva, Felipe H. S., et al.
Published: (2024)
Dynamic Detection of Inefficient Data Mapping Patterns in Heterogeneous OpenMP Applications
by: Marzen, Luke, et al.
Published: (2026)
by: Marzen, Luke, et al.
Published: (2026)
Multithreaded Fine-Grained Asynchronous BSP for Integer Sorting with LCI and OpenMP
by: Cheng, Minyu, et al.
Published: (2026)
by: Cheng, Minyu, et al.
Published: (2026)
Parallel FFTW on RISC-V: A Comparative Study including OpenMP, MPI, and HPX
by: Strack, Alexander, et al.
Published: (2025)
by: Strack, Alexander, et al.
Published: (2025)
Detrimental task execution patterns in mainstream OpenMP runtimes
by: Tuft, Adam S., et al.
Published: (2024)
by: Tuft, Adam S., et al.
Published: (2024)
An MLIR pipeline for offloading Fortran to FPGAs via OpenMP
by: Rodriguez-Canal, Gabriel, et al.
Published: (2025)
by: Rodriguez-Canal, Gabriel, et al.
Published: (2025)
A Formal Semantics of C with OpenMP Parallelism (Extended Version)
by: Du, Ke, et al.
Published: (2026)
by: Du, Ke, et al.
Published: (2026)
Enhanced OpenMP Algorithm to Compute All-Pairs Shortest Path on x86 Architectures
by: Calderón, Sergio, et al.
Published: (2024)
by: Calderón, Sergio, et al.
Published: (2024)
Porting HPC Applications to AMD Instinct$^\text{TM}$ MI300A Using Unified Memory and OpenMP
by: Tandon, Suyash, et al.
Published: (2024)
by: Tandon, Suyash, et al.
Published: (2024)
Pragma driven shared memory parallelism in Zig by supporting OpenMP loop directives
by: Kacs, David, et al.
Published: (2024)
by: Kacs, David, et al.
Published: (2024)
Enabling performance portability of data-parallel OpenMP applications on asymmetric multicore processors
by: Saez, Juan Carlos, et al.
Published: (2024)
by: Saez, Juan Carlos, et al.
Published: (2024)
Optimizing the Weather Research and Forecasting Model with OpenMP Offload and Codee
by: Chayanon, et al.
Published: (2024)
by: Chayanon, et al.
Published: (2024)
Scaling Sample-Based Quantum Diagonalization on GPU-Accelerated Systems using OpenMP Offload
by: Walkup, Robert, et al.
Published: (2026)
by: Walkup, Robert, et al.
Published: (2026)
OMPGPT: A Generative Pre-trained Transformer Model for OpenMP
by: Chen, Le, et al.
Published: (2024)
by: Chen, Le, et al.
Published: (2024)
Accelerating Particle-in-Cell Monte Carlo Simulations with MPI, OpenMP/OpenACC and Asynchronous Multi-GPU Programming
by: Williams, Jeremy J., et al.
Published: (2024)
by: Williams, Jeremy J., et al.
Published: (2024)
A Comparative Study of OpenMP Scheduling Algorithm Selection Strategies
by: Korndörfer, Jonas H. Müller, et al.
Published: (2025)
by: Korndörfer, Jonas H. Müller, et al.
Published: (2025)
LLM-HPC++: Evaluating LLM-Generated Modern C++ and MPI+OpenMP Codes for Scalable Mandelbrot Set Computation
by: Diehl, Patrick, et al.
Published: (2025)
by: Diehl, Patrick, et al.
Published: (2025)
DNA sequence alignment: An assignment for OpenMP, MPI, and CUDA/OpenCL
by: Gonzalez-Escribano, Arturo, et al.
Published: (2024)
by: Gonzalez-Escribano, Arturo, et al.
Published: (2024)
MP-SL: Multihop Parallel Split Learning
by: Tirana, Joana, et al.
Published: (2024)
by: Tirana, Joana, et al.
Published: (2024)
LB4OMP: A Dynamic Load Balancing Library for Multithreaded Applications
by: Korndörfer, Jonas H. Müller, et al.
Published: (2021)
by: Korndörfer, Jonas H. Müller, et al.
Published: (2021)
Integrating High Performance In-Memory Data Streaming and In-Situ Visualization in Hybrid MPI+OpenMP PIC MC Simulations Towards Exascale
by: Williams, Jeremy J., et al.
Published: (2025)
by: Williams, Jeremy J., et al.
Published: (2025)
Performant Automatic BLAS Offloading on Unified Memory Architecture with OpenMP First-Touch Style Data Movement
by: Li, Junjie
Published: (2024)
by: Li, Junjie
Published: (2024)
LoongTrain: Efficient Training of Long-Sequence LLMs with Head-Context Parallelism
by: Gu, Diandian, et al.
Published: (2024)
by: Gu, Diandian, et al.
Published: (2024)
HAP: Hybrid Adaptive Parallelism for Efficient Mixture-of-Experts Inference
by: Lin, Haoran, et al.
Published: (2025)
by: Lin, Haoran, et al.
Published: (2025)
StarTrail: Concentric Ring Sequence Parallelism for Efficient Near-Infinite-Context Transformer Model Training
by: Liu, Ziming, et al.
Published: (2024)
by: Liu, Ziming, et al.
Published: (2024)
Automated Programmatic Performance Analysis of Parallel Programs
by: Cankur, Onur, et al.
Published: (2024)
by: Cankur, Onur, et al.
Published: (2024)
GPU Acceleration and Portability of the TRIMEG Code for Gyrokinetic Plasma Simulations using OpenMP
by: Daneri, Giorgio
Published: (2026)
by: Daneri, Giorgio
Published: (2026)
SiPipe: Bridging the CPU-GPU Utilization Gap for Efficient Pipeline-Parallel LLM Inference
by: He, Yongchao, et al.
Published: (2025)
by: He, Yongchao, et al.
Published: (2025)
NanoCP: Request-Level Dynamic Context Parallelism for Data-Expert Parallel Decoding
by: Chen, Jiefei, et al.
Published: (2026)
by: Chen, Jiefei, et al.
Published: (2026)
HARP: Orchestrating Automated Parallel Training on Heterogeneous GPU Clusters
by: Liang, Antian, et al.
Published: (2025)
by: Liang, Antian, et al.
Published: (2025)
Similar Items
-
DiOMP-Offloading: Toward Portable Distributed Heterogeneous OpenMP
by: Shan, Baodi, et al.
Published: (2025) -
OMP4Py: a pure Python implementation of OpenMP
by: Piñeiro, César, et al.
Published: (2024) -
Parallel Paradigms in Modern HPC: A Comparative Analysis of MPI, OpenMP, and CUDA
by: ALHafez, Nizar, et al.
Published: (2025) -
LLOR: Automated Repair of OpenMP Programs
by: Bora, Utpal, et al.
Published: (2024) -
Static Generation of Efficient OpenMP Offload Data Mappings
by: Marzen, Luke, et al.
Published: (2024)