Saved in:
| Main Authors: | Jarmusch, Aaron, Cabarcas, Felipe, Pophale, Swaroop, Kallai, Andrew, Doerfert, Johannes, Peyralans, Luke, Lee, Seyong, Denny, Joel, Chandrasekaran, Sunita |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.11777 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Microbenchmark-Driven Analytical Performance Modeling Across Modern GPU Architectures
by: Jarmusch, Aaron, et al.
Published: (2026)
by: Jarmusch, Aaron, et al.
Published: (2026)
Microbenchmarking NVIDIA's Blackwell Architecture: An in-depth Architectural Analysis
by: Jarmusch, Aaron, et al.
Published: (2025)
by: Jarmusch, Aaron, et al.
Published: (2025)
LLM4VV: Developing LLM-Driven Testsuite for Compiler Validation
by: Munley, Christian, et al.
Published: (2023)
by: Munley, Christian, et al.
Published: (2023)
LLM4VV: Exploring LLM-as-a-Judge for Validation and Verification Testsuites
by: Sollenberger, Zachariah, et al.
Published: (2024)
by: Sollenberger, Zachariah, et al.
Published: (2024)
Dissecting the NVIDIA Blackwell Architecture with Microbenchmarks
by: Jarmusch, Aaron, et al.
Published: (2025)
by: Jarmusch, Aaron, et al.
Published: (2025)
Execution-Centric Characterization of FP8 Matrix Cores, Asynchronous Execution, and Structured Sparsity on AMD MI300A
by: Jarmusch, Aaron, et al.
Published: (2026)
by: Jarmusch, Aaron, et al.
Published: (2026)
Static Generation of Efficient OpenMP Offload Data Mappings
by: Marzen, Luke, et al.
Published: (2024)
by: Marzen, Luke, et al.
Published: (2024)
Implementing OpenMP for Zig to enable its use in HPC context
by: Kacs, David, et al.
Published: (2024)
by: Kacs, David, et al.
Published: (2024)
Dynamic Detection of Inefficient Data Mapping Patterns in Heterogeneous OpenMP Applications
by: Marzen, Luke, et al.
Published: (2026)
by: Marzen, Luke, et al.
Published: (2026)
LLOR: Automated Repair of OpenMP Programs
by: Bora, Utpal, et al.
Published: (2024)
by: Bora, Utpal, et al.
Published: (2024)
MPI-Rockstar: a Hybrid MPI and OpenMP Parallel Implementation of the Rockstar Halo finder
by: Tokuue, Tomoyuki, et al.
Published: (2024)
by: Tokuue, Tomoyuki, et al.
Published: (2024)
Auto-Tuning for OpenMP Dynamic Scheduling applied to Full Waveform Inversion
by: da Silva, Felipe H. S., et al.
Published: (2024)
by: da Silva, Felipe H. S., et al.
Published: (2024)
An MLIR pipeline for offloading Fortran to FPGAs via OpenMP
by: Rodriguez-Canal, Gabriel, et al.
Published: (2025)
by: Rodriguez-Canal, Gabriel, et al.
Published: (2025)
Detrimental task execution patterns in mainstream OpenMP runtimes
by: Tuft, Adam S., et al.
Published: (2024)
by: Tuft, Adam S., et al.
Published: (2024)
rcpptimer: Rcpp Tic-Toc Timer with OpenMP Support
by: Berrisch, Jonathan
Published: (2025)
by: Berrisch, Jonathan
Published: (2025)
Learning to Parallelize with OpenMP by Augmented Heterogeneous AST Representation
by: Chen, Le, et al.
Published: (2023)
by: Chen, Le, et al.
Published: (2023)
Distributed OpenMP Offloading of OpenMC on Intel GPU MAX Accelerators
by: Fridman, Yehonatan, et al.
Published: (2024)
by: Fridman, Yehonatan, et al.
Published: (2024)
DNA sequence alignment: An assignment for OpenMP, MPI, and CUDA/OpenCL
by: Gonzalez-Escribano, Arturo, et al.
Published: (2024)
by: Gonzalez-Escribano, Arturo, et al.
Published: (2024)
DiOMP-Offloading: Toward Portable Distributed Heterogeneous OpenMP
by: Shan, Baodi, et al.
Published: (2025)
by: Shan, Baodi, et al.
Published: (2025)
A Formal Semantics of C with OpenMP Parallelism (Extended Version)
by: Du, Ke, et al.
Published: (2026)
by: Du, Ke, et al.
Published: (2026)
OMP4Py: a pure Python implementation of OpenMP
by: Piñeiro, César, et al.
Published: (2024)
by: Piñeiro, César, et al.
Published: (2024)
OMPGPT: A Generative Pre-trained Transformer Model for OpenMP
by: Chen, Le, et al.
Published: (2024)
by: Chen, Le, et al.
Published: (2024)
Optimizing the Weather Research and Forecasting Model with OpenMP Offload and Codee
by: Chayanon, et al.
Published: (2024)
by: Chayanon, et al.
Published: (2024)
A Comparative Study of OpenMP Scheduling Algorithm Selection Strategies
by: Korndörfer, Jonas H. Müller, et al.
Published: (2025)
by: Korndörfer, Jonas H. Müller, et al.
Published: (2025)
Towards a Scalable and Efficient PGAS-based Distributed OpenMP
by: Shan, Baodi, et al.
Published: (2024)
by: Shan, Baodi, et al.
Published: (2024)
A Microbenchmark Framework for Performance Evaluation of OpenMP Target Offloading
by: Atif, Mohammad, et al.
Published: (2025)
by: Atif, Mohammad, et al.
Published: (2025)
Developing an Interactive OpenMP Programming Book with Large Language Models
by: Yi, Xinyao, et al.
Published: (2024)
by: Yi, Xinyao, et al.
Published: (2024)
Scaling Sample-Based Quantum Diagonalization on GPU-Accelerated Systems using OpenMP Offload
by: Walkup, Robert, et al.
Published: (2026)
by: Walkup, Robert, et al.
Published: (2026)
Solution of finite element problems using hybrid parallelization with MPI and OpenMP
by: Miguel Vargas-Félix
Published: (2012)
by: Miguel Vargas-Félix
Published: (2012)
Accelerating cosmological simulations on GPUs: a portable approach using OpenMP
by: Lepinzan, M. D., et al.
Published: (2025)
by: Lepinzan, M. D., et al.
Published: (2025)
Multithreaded Fine-Grained Asynchronous BSP for Integer Sorting with LCI and OpenMP
by: Cheng, Minyu, et al.
Published: (2026)
by: Cheng, Minyu, et al.
Published: (2026)
MPI vs OpenMP: A case study on parallel generation of Mandelbrot set
by: Ernesto Soto Gómez
Published: (2020)
by: Ernesto Soto Gómez
Published: (2020)
GPU Acceleration and Portability of the TRIMEG Code for Gyrokinetic Plasma Simulations using OpenMP
by: Daneri, Giorgio
Published: (2026)
by: Daneri, Giorgio
Published: (2026)
Parallel Paradigms in Modern HPC: A Comparative Analysis of MPI, OpenMP, and CUDA
by: ALHafez, Nizar, et al.
Published: (2025)
by: ALHafez, Nizar, et al.
Published: (2025)
P4OMP: Retrieval-Augmented Prompting for OpenMP Parallelism in Serial Code
by: Abdullah, Wali Mohammad, et al.
Published: (2025)
by: Abdullah, Wali Mohammad, et al.
Published: (2025)
Testing the Unknown: A Framework for OpenMP Testing via Random Program Generation
by: Laguna, Ignacio, et al.
Published: (2024)
by: Laguna, Ignacio, et al.
Published: (2024)
Pragma driven shared memory parallelism in Zig by supporting OpenMP loop directives
by: Kacs, David, et al.
Published: (2024)
by: Kacs, David, et al.
Published: (2024)
Enabling performance portability of data-parallel OpenMP applications on asymmetric multicore processors
by: Saez, Juan Carlos, et al.
Published: (2024)
by: Saez, Juan Carlos, et al.
Published: (2024)
An OpenMP‐based breadth‐first search implementation using the bag data structure
by: S. L. Gonzaga de Oliveira, et al.
Published: (2024)
by: S. L. Gonzaga de Oliveira, et al.
Published: (2024)
High-Performance Parallel Optimization of the Fish School Behaviour on the Setonix Platform Using OpenMP
by: Wang, Haitian, et al.
Published: (2025)
by: Wang, Haitian, et al.
Published: (2025)
Similar Items
-
Microbenchmark-Driven Analytical Performance Modeling Across Modern GPU Architectures
by: Jarmusch, Aaron, et al.
Published: (2026) -
Microbenchmarking NVIDIA's Blackwell Architecture: An in-depth Architectural Analysis
by: Jarmusch, Aaron, et al.
Published: (2025) -
LLM4VV: Developing LLM-Driven Testsuite for Compiler Validation
by: Munley, Christian, et al.
Published: (2023) -
LLM4VV: Exploring LLM-as-a-Judge for Validation and Verification Testsuites
by: Sollenberger, Zachariah, et al.
Published: (2024) -
Dissecting the NVIDIA Blackwell Architecture with Microbenchmarks
by: Jarmusch, Aaron, et al.
Published: (2025)