Saved in:
| Main Authors: | Wang, Haitian, Qin, Long |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.20173 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Comparative Study of OpenMP Scheduling Algorithm Selection Strategies
by: Korndörfer, Jonas H. Müller, et al.
Published: (2025)
by: Korndörfer, Jonas H. Müller, et al.
Published: (2025)
OMP-Engineer: Bridging Syntax Analysis and In-Context Learning for Efficient Automated OpenMP Parallelization
by: Wang, Weidong, et al.
Published: (2024)
by: Wang, Weidong, et al.
Published: (2024)
Parallel Paradigms in Modern HPC: A Comparative Analysis of MPI, OpenMP, and CUDA
by: ALHafez, Nizar, et al.
Published: (2025)
by: ALHafez, Nizar, et al.
Published: (2025)
Developing an Interactive OpenMP Programming Book with Large Language Models
by: Yi, Xinyao, et al.
Published: (2024)
by: Yi, Xinyao, et al.
Published: (2024)
Static Generation of Efficient OpenMP Offload Data Mappings
by: Marzen, Luke, et al.
Published: (2024)
by: Marzen, Luke, et al.
Published: (2024)
Distributed OpenMP Offloading of OpenMC on Intel GPU MAX Accelerators
by: Fridman, Yehonatan, et al.
Published: (2024)
by: Fridman, Yehonatan, et al.
Published: (2024)
DiOMP-Offloading: Toward Portable Distributed Heterogeneous OpenMP
by: Shan, Baodi, et al.
Published: (2025)
by: Shan, Baodi, et al.
Published: (2025)
Implementing OpenMP for Zig to enable its use in HPC context
by: Kacs, David, et al.
Published: (2024)
by: Kacs, David, et al.
Published: (2024)
LLOR: Automated Repair of OpenMP Programs
by: Bora, Utpal, et al.
Published: (2024)
by: Bora, Utpal, et al.
Published: (2024)
Dynamic Detection of Inefficient Data Mapping Patterns in Heterogeneous OpenMP Applications
by: Marzen, Luke, et al.
Published: (2026)
by: Marzen, Luke, et al.
Published: (2026)
Auto-Tuning for OpenMP Dynamic Scheduling applied to Full Waveform Inversion
by: da Silva, Felipe H. S., et al.
Published: (2024)
by: da Silva, Felipe H. S., et al.
Published: (2024)
Multithreaded Fine-Grained Asynchronous BSP for Integer Sorting with LCI and OpenMP
by: Cheng, Minyu, et al.
Published: (2026)
by: Cheng, Minyu, et al.
Published: (2026)
Parallel FFTW on RISC-V: A Comparative Study including OpenMP, MPI, and HPX
by: Strack, Alexander, et al.
Published: (2025)
by: Strack, Alexander, et al.
Published: (2025)
Towards a Scalable and Efficient PGAS-based Distributed OpenMP
by: Shan, Baodi, et al.
Published: (2024)
by: Shan, Baodi, et al.
Published: (2024)
An MLIR pipeline for offloading Fortran to FPGAs via OpenMP
by: Rodriguez-Canal, Gabriel, et al.
Published: (2025)
by: Rodriguez-Canal, Gabriel, et al.
Published: (2025)
Detrimental task execution patterns in mainstream OpenMP runtimes
by: Tuft, Adam S., et al.
Published: (2024)
by: Tuft, Adam S., et al.
Published: (2024)
A Formal Semantics of C with OpenMP Parallelism (Extended Version)
by: Du, Ke, et al.
Published: (2026)
by: Du, Ke, et al.
Published: (2026)
Enhanced OpenMP Algorithm to Compute All-Pairs Shortest Path on x86 Architectures
by: Calderón, Sergio, et al.
Published: (2024)
by: Calderón, Sergio, et al.
Published: (2024)
Optimizing the Weather Research and Forecasting Model with OpenMP Offload and Codee
by: Chayanon, et al.
Published: (2024)
by: Chayanon, et al.
Published: (2024)
Porting HPC Applications to AMD Instinct$^\text{TM}$ MI300A Using Unified Memory and OpenMP
by: Tandon, Suyash, et al.
Published: (2024)
by: Tandon, Suyash, et al.
Published: (2024)
DWDP: Distributed Weight Data Parallelism for High-Performance LLM Inference on NVL72
by: Li, Wanqian, et al.
Published: (2026)
by: Li, Wanqian, et al.
Published: (2026)
OMP4Py: a pure Python implementation of OpenMP
by: Piñeiro, César, et al.
Published: (2024)
by: Piñeiro, César, et al.
Published: (2024)
Research on Model Parallelism and Data Parallelism Optimization Methods in Large Language Model-Based Recommendation Systems
by: Yang, Haowei, et al.
Published: (2025)
by: Yang, Haowei, et al.
Published: (2025)
Pragma driven shared memory parallelism in Zig by supporting OpenMP loop directives
by: Kacs, David, et al.
Published: (2024)
by: Kacs, David, et al.
Published: (2024)
Enabling performance portability of data-parallel OpenMP applications on asymmetric multicore processors
by: Saez, Juan Carlos, et al.
Published: (2024)
by: Saez, Juan Carlos, et al.
Published: (2024)
HiDVFS: A Hierarchical Multi-Agent DVFS Scheduler for OpenMP DAG Workloads
by: Pivezhandi, Mohammad, et al.
Published: (2026)
by: Pivezhandi, Mohammad, et al.
Published: (2026)
Scaling Sample-Based Quantum Diagonalization on GPU-Accelerated Systems using OpenMP Offload
by: Walkup, Robert, et al.
Published: (2026)
by: Walkup, Robert, et al.
Published: (2026)
OMPGPT: A Generative Pre-trained Transformer Model for OpenMP
by: Chen, Le, et al.
Published: (2024)
by: Chen, Le, et al.
Published: (2024)
Accelerating Long-Tail Generation in Synchronous RLHF Training via Adaptive Tensor Parallelism
by: Zhao, Long, et al.
Published: (2026)
by: Zhao, Long, et al.
Published: (2026)
Integrating High Performance In-Memory Data Streaming and In-Situ Visualization in Hybrid MPI+OpenMP PIC MC Simulations Towards Exascale
by: Williams, Jeremy J., et al.
Published: (2025)
by: Williams, Jeremy J., et al.
Published: (2025)
Using Sequential Runtime Distributions for the Parallel Speedup Prediction of SAT Local Search
by: Arbelaez, Alejandro, et al.
Published: (2024)
by: Arbelaez, Alejandro, et al.
Published: (2024)
Accelerating Particle-in-Cell Monte Carlo Simulations with MPI, OpenMP/OpenACC and Asynchronous Multi-GPU Programming
by: Williams, Jeremy J., et al.
Published: (2024)
by: Williams, Jeremy J., et al.
Published: (2024)
Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization
by: Zhu, Zhanda, et al.
Published: (2025)
by: Zhu, Zhanda, et al.
Published: (2025)
Decentralized Distributed Proximal Policy Optimization (DD-PPO) for High Performance Computing Scheduling on Multi-User Systems
by: Sgambati, Matthew, et al.
Published: (2025)
by: Sgambati, Matthew, et al.
Published: (2025)
Rethinking Dynamic Networks and Heterogeneous Computing with Automatic Parallelization
by: Wu, Ruilong, et al.
Published: (2025)
by: Wu, Ruilong, et al.
Published: (2025)
Can Large Language Models Predict Parallel Code Performance?
by: Bolet, Gregory, et al.
Published: (2025)
by: Bolet, Gregory, et al.
Published: (2025)
LLM-HPC++: Evaluating LLM-Generated Modern C++ and MPI+OpenMP Codes for Scalable Mandelbrot Set Computation
by: Diehl, Patrick, et al.
Published: (2025)
by: Diehl, Patrick, et al.
Published: (2025)
Astra: Efficient and Money-saving Automatic Parallel Strategies Search on Heterogeneous GPUs
by: Wang, Peiran, et al.
Published: (2025)
by: Wang, Peiran, et al.
Published: (2025)
Efficient MoE Inference with Fine-Grained Scheduling of Disaggregated Expert Parallelism
by: Pan, Xinglin, et al.
Published: (2025)
by: Pan, Xinglin, et al.
Published: (2025)
Para-B&B: Load-Balanced Deterministic Parallelization of Solving MIP
by: Zhang, Jinyu, et al.
Published: (2026)
by: Zhang, Jinyu, et al.
Published: (2026)
Similar Items
-
A Comparative Study of OpenMP Scheduling Algorithm Selection Strategies
by: Korndörfer, Jonas H. Müller, et al.
Published: (2025) -
OMP-Engineer: Bridging Syntax Analysis and In-Context Learning for Efficient Automated OpenMP Parallelization
by: Wang, Weidong, et al.
Published: (2024) -
Parallel Paradigms in Modern HPC: A Comparative Analysis of MPI, OpenMP, and CUDA
by: ALHafez, Nizar, et al.
Published: (2025) -
Developing an Interactive OpenMP Programming Book with Large Language Models
by: Yi, Xinyao, et al.
Published: (2024) -
Static Generation of Efficient OpenMP Offload Data Mappings
by: Marzen, Luke, et al.
Published: (2024)