Saved in:
| Main Authors: | Wu, Qiying, Zolnikov, Pavel |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2606.00601 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
NPB-Rust: NAS Parallel Benchmarks in Rust
by: Martins, Eduardo M., et al.
Published: (2025)
by: Martins, Eduardo M., et al.
Published: (2025)
Extending Contract Verification for Parallel Programming Models to Fortran
by: Oraji, Yussur Mustafa, et al.
Published: (2026)
by: Oraji, Yussur Mustafa, et al.
Published: (2026)
KPerfIR: Towards an Open and Compiler-centric Ecosystem for GPU Kernel Performance Tooling on Modern AI Workloads
by: Guan, Yue, et al.
Published: (2025)
by: Guan, Yue, et al.
Published: (2025)
Verifying Properties of Index Arrays in a Purely-Functional Data-Parallel Language
by: Hinnerskov, Nikolaj Hey, et al.
Published: (2025)
by: Hinnerskov, Nikolaj Hey, et al.
Published: (2025)
Same Engine, Multiple Gears: Parallelizing Fixpoint Iteration at Different Granularities (Extended Version)
by: Kocal, Ali Rasim, et al.
Published: (2026)
by: Kocal, Ali Rasim, et al.
Published: (2026)
Virtual Garbage Collector (VGC): A Zone-Based Garbage Collection Architecture for Python's Parallel Runtime
by: M, Abdulla
Published: (2025)
by: M, Abdulla
Published: (2025)
Comparing Parallel Functional Array Languages: Programming and Performance
by: van Balen, David, et al.
Published: (2025)
by: van Balen, David, et al.
Published: (2025)
Developing a Modular Compiler for a Subset of a C-like Language
by: Dutta, Debasish, et al.
Published: (2025)
by: Dutta, Debasish, et al.
Published: (2025)
Mirage Persistent Kernel: A Compiler and Runtime for Mega-Kernelizing Tensor Programs
by: Cheng, Xinhao, et al.
Published: (2025)
by: Cheng, Xinhao, et al.
Published: (2025)
VTC: DNN Compilation with Virtual Tensors for Data Movement Elimination
by: Hu, Muyan, et al.
Published: (2026)
by: Hu, Muyan, et al.
Published: (2026)
Event Tensor: A Unified Abstraction for Compiling Dynamic Megakernel
by: Jin, Hongyi, et al.
Published: (2026)
by: Jin, Hongyi, et al.
Published: (2026)
LOOPer: A Learned Automatic Code Optimizer For Polyhedral Compilers
by: Merouani, Massinissa, et al.
Published: (2024)
by: Merouani, Massinissa, et al.
Published: (2024)
Theoretical Foundations of GPU-Native Compilation for Rapid Code Iteration
by: Metinov, Adilet, et al.
Published: (2025)
by: Metinov, Adilet, et al.
Published: (2025)
Parallel Scan on Ascend AI Accelerators
by: Wróblewski, Bartłomiej, et al.
Published: (2025)
by: Wróblewski, Bartłomiej, et al.
Published: (2025)
pSTL-Bench: A Micro-Benchmark Suite for Assessing Scalability of C++ Parallel STL Implementations
by: Laso, Ruben, et al.
Published: (2024)
by: Laso, Ruben, et al.
Published: (2024)
Compiler-supported reduced precision and AoS-SoA transformations for heterogeneous hardware
by: Radtke, Pawel K., et al.
Published: (2025)
by: Radtke, Pawel K., et al.
Published: (2025)
An MLIR Lowering Pipeline for Stencils at Wafer-Scale
by: Stawinoga, Nicolai, et al.
Published: (2026)
by: Stawinoga, Nicolai, et al.
Published: (2026)
An MLIR pipeline for offloading Fortran to FPGAs via OpenMP
by: Rodriguez-Canal, Gabriel, et al.
Published: (2025)
by: Rodriguez-Canal, Gabriel, et al.
Published: (2025)
Scaling Deep Learning Training with MPMD Pipeline Parallelism
by: Xhebraj, Anxhelo, et al.
Published: (2024)
by: Xhebraj, Anxhelo, et al.
Published: (2024)
PopPy: Opportunistically Exploiting Parallelism in Python Compound AI Applications
by: Mell, Stephen, et al.
Published: (2026)
by: Mell, Stephen, et al.
Published: (2026)
From Prompts to Performance: Evaluating LLMs for Task-based Parallel Code Generation
by: Bantel, Linus, et al.
Published: (2026)
by: Bantel, Linus, et al.
Published: (2026)
RVISmith: Fuzzing Compilers for RVV Intrinsics
by: He, Yibo, et al.
Published: (2025)
by: He, Yibo, et al.
Published: (2025)
Multi-Relational Algebra for Multi-Granular Data Analytics
by: Wu, Xi, et al.
Published: (2023)
by: Wu, Xi, et al.
Published: (2023)
A Parallel Scan Algorithm in the Tensor Core Unit Model
by: Zouzias, Anastasios, et al.
Published: (2024)
by: Zouzias, Anastasios, et al.
Published: (2024)
Sal: Multi-modal Verification of Replicated Data Types
by: Ramesh, Pranav, et al.
Published: (2026)
by: Ramesh, Pranav, et al.
Published: (2026)
Evaluating SYCL as a Unified Programming Model for Heterogeneous Systems
by: Marowka, Ami
Published: (2026)
by: Marowka, Ami
Published: (2026)
Mat2Boundary: Treating User-Defined Boundary Condition as SpMV for Distributed PDE Solvers on Block-Structured Grids
by: Cai, Yanzheng, et al.
Published: (2026)
by: Cai, Yanzheng, et al.
Published: (2026)
Branching Out: Existential External Choice in Effpi
by: Robinson, Benjamin, et al.
Published: (2026)
by: Robinson, Benjamin, et al.
Published: (2026)
Publish on Ping: A Better Way to Publish Reservations in Memory Reclamation for Concurrent Data Structures
by: Singh, Ajay, et al.
Published: (2025)
by: Singh, Ajay, et al.
Published: (2025)
Timetide: A programming model for logically synchronous distributed systems
by: Kenwright, Logan, et al.
Published: (2025)
by: Kenwright, Logan, et al.
Published: (2025)
Hydra: Virtualized Multi-Language Runtime for High-Density Serverless Platforms
by: Ivanenko, Serhii, et al.
Published: (2022)
by: Ivanenko, Serhii, et al.
Published: (2022)
MCFuser: High-Performance and Rapid Fusion of Memory-Bound Compute-Intensive Operators
by: Zhang, Zheng, et al.
Published: (2025)
by: Zhang, Zheng, et al.
Published: (2025)
Assessing Opportunities of SYCL for Biological Sequence Alignment on GPU-based Systems
by: Costanzo, Manuel, et al.
Published: (2022)
by: Costanzo, Manuel, et al.
Published: (2022)
Flo: a Semantic Foundation for Progressive Stream Processing
by: Laddad, Shadaj, et al.
Published: (2024)
by: Laddad, Shadaj, et al.
Published: (2024)
Choreographies as Macros
by: Bohosian, Alexander, et al.
Published: (2025)
by: Bohosian, Alexander, et al.
Published: (2025)
OMP4Py: a pure Python implementation of OpenMP
by: Piñeiro, César, et al.
Published: (2024)
by: Piñeiro, César, et al.
Published: (2024)
Suki: Choreographed Distributed Dataflow in Rust
by: Laddad, Shadaj, et al.
Published: (2024)
by: Laddad, Shadaj, et al.
Published: (2024)
Towards a Function-as-a-Service Choreographic Programming Language: Examples and Applications
by: De Palma, Giuseppe, et al.
Published: (2024)
by: De Palma, Giuseppe, et al.
Published: (2024)
On the Duality of Task and Actor Programming Models
by: Yadav, Rohan, et al.
Published: (2025)
by: Yadav, Rohan, et al.
Published: (2025)
GuStL - An Experimental Guarded States Language
by: Schirmer, Oskar
Published: (2016)
by: Schirmer, Oskar
Published: (2016)
Similar Items
-
NPB-Rust: NAS Parallel Benchmarks in Rust
by: Martins, Eduardo M., et al.
Published: (2025) -
Extending Contract Verification for Parallel Programming Models to Fortran
by: Oraji, Yussur Mustafa, et al.
Published: (2026) -
KPerfIR: Towards an Open and Compiler-centric Ecosystem for GPU Kernel Performance Tooling on Modern AI Workloads
by: Guan, Yue, et al.
Published: (2025) -
Verifying Properties of Index Arrays in a Purely-Functional Data-Parallel Language
by: Hinnerskov, Nikolaj Hey, et al.
Published: (2025) -
Same Engine, Multiple Gears: Parallelizing Fixpoint Iteration at Different Granularities (Extended Version)
by: Kocal, Ali Rasim, et al.
Published: (2026)