Saved in:
| Main Authors: | Homola, Jakub, Vavřík, Radim, Meca, Ondřej, Brzobohatý, Tomáš, Říha, Lubomír |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.08382 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Utilizing Sparsity in the GPU-accelerated Assembly of Schur Complement Matrices in Domain Decomposition Methods
by: Homola, Jakub, et al.
Published: (2025)
by: Homola, Jakub, et al.
Published: (2025)
Bandicoot: A Templated C++ Library for GPU Linear Algebra
by: Curtin, Ryan R., et al.
Published: (2025)
by: Curtin, Ryan R., et al.
Published: (2025)
Local Adjoints for Simultaneous Preaccumulations with Shared Inputs
by: Blühdorn, Johannes, et al.
Published: (2024)
by: Blühdorn, Johannes, et al.
Published: (2024)
Hybrid parallel discrete adjoints in SU2
by: Blühdorn, Johannes, et al.
Published: (2024)
by: Blühdorn, Johannes, et al.
Published: (2024)
Minimum Cost Loop Nests for Contraction of a Sparse Tensor with a Tensor Network
by: Kanakagiri, Raghavendra, et al.
Published: (2023)
by: Kanakagiri, Raghavendra, et al.
Published: (2023)
Software Development Aspects of Integrating Linear Algebra Libraries
by: Koch, Marcel, et al.
Published: (2025)
by: Koch, Marcel, et al.
Published: (2025)
Tensor Decompositions for Count Data that Leverage Stochastic and Deterministic Optimization
by: Myers, Jeremy M., et al.
Published: (2022)
by: Myers, Jeremy M., et al.
Published: (2022)
Verifying a Sparse Matrix Algorithm Using Symbolic Execution
by: Wilton, Alexander C.
Published: (2025)
by: Wilton, Alexander C.
Published: (2025)
Cascaded Prediction and Asynchronous Execution of Iterative Algorithms on Heterogeneous Platforms
by: Gao, Jianhua, et al.
Published: (2024)
by: Gao, Jianhua, et al.
Published: (2024)
Mapping Sparse Triangular Solves to GPUs via Fine-grained Domain Decomposition
by: Gondhalekar, Atharva, et al.
Published: (2025)
by: Gondhalekar, Atharva, et al.
Published: (2025)
Reasoning about expression evaluation under interference
by: Hayes, Ian J., et al.
Published: (2024)
by: Hayes, Ian J., et al.
Published: (2024)
Data reification in a concurrent rely-guarantee algebra
by: Meinicke, Larissa A., et al.
Published: (2024)
by: Meinicke, Larissa A., et al.
Published: (2024)
Memory-Efficient Training with In-Place FFT Implementation
by: Ding, Xinyu, et al.
Published: (2025)
by: Ding, Xinyu, et al.
Published: (2025)
A Symbolic Computing Perspective on Software Systems
by: Norman, Arthur C., et al.
Published: (2024)
by: Norman, Arthur C., et al.
Published: (2024)
Pyroclast: A Modular High-Performance Python Solver for Geodynamics
by: Ferrari, Marcel
Published: (2026)
by: Ferrari, Marcel
Published: (2026)
Reasoning about concurrent loops and recursion with rely-guarantee rules
by: Hayes, Ian J., et al.
Published: (2025)
by: Hayes, Ian J., et al.
Published: (2025)
BurTorch: Revisiting Training from First Principles by Coupling Autodiff, Math Optimization, and Systems
by: Burlachenko, Konstantin, et al.
Published: (2025)
by: Burlachenko, Konstantin, et al.
Published: (2025)
The ensmallen library for flexible numerical optimization
by: Curtin, Ryan R., et al.
Published: (2021)
by: Curtin, Ryan R., et al.
Published: (2021)
Stencil Computations on AMD and Nvidia Graphics Processors: Performance and Tuning Strategies
by: Pekkilä, Johannes, et al.
Published: (2024)
by: Pekkilä, Johannes, et al.
Published: (2024)
Reduction of the graph isomorphism problem to equality checking of $n$-variables polynomials and the algorithms that use the reduction
by: Prolubnikov, Alexander
Published: (2015)
by: Prolubnikov, Alexander
Published: (2015)
CUDABeaver: Benchmarking LLM-Based Automated CUDA Debugging
by: Li, Shiyang, et al.
Published: (2026)
by: Li, Shiyang, et al.
Published: (2026)
Trilinos: Enabling Scientific Computing Across Diverse Hardware Architectures at Scale
by: Mayr, Matthias, et al.
Published: (2025)
by: Mayr, Matthias, et al.
Published: (2025)
Fast GPU Linear Algebra via Compile Time Expression Fusion
by: Curtin, Ryan R., et al.
Published: (2026)
by: Curtin, Ryan R., et al.
Published: (2026)
Armadillo: An Efficient Framework for Numerical Linear Algebra
by: Sanderson, Conrad, et al.
Published: (2025)
by: Sanderson, Conrad, et al.
Published: (2025)
Randomized Approach to Matrix Completion: Applications in Recommendation Systems and Image Inpainting
by: Krajewska, Antonina, et al.
Published: (2024)
by: Krajewska, Antonina, et al.
Published: (2024)
Mathematical modeling of the mechanical behavior of three-layer plates with a tetrachiral honeycomb core
by: Mazaev, A. V.
Published: (2023)
by: Mazaev, A. V.
Published: (2023)
On the relativistic viability of multi-automaton systems: essential concepts, challenges and prospects
by: Băbeanu, Alexandru-Ionuţ
Published: (2024)
by: Băbeanu, Alexandru-Ionuţ
Published: (2024)
Summa Summarum: Moessner's Theorem without Dynamic Programming
by: Danvy, Olivier
Published: (2024)
by: Danvy, Olivier
Published: (2024)
Optimizing Tensor Train Decomposition in DNNs for RISC-V Architectures Using Design Space Exploration and Compiler Optimizations
by: Anthimopoulos, Theologos, et al.
Published: (2026)
by: Anthimopoulos, Theologos, et al.
Published: (2026)
Sampling patterns for Zernike-like bases in non-standard geometries
by: Díaz-Elbal, Sergio, et al.
Published: (2025)
by: Díaz-Elbal, Sergio, et al.
Published: (2025)
HYLU: Hybrid Parallel Sparse LU Factorization
by: Chen, Xiaoming
Published: (2025)
by: Chen, Xiaoming
Published: (2025)
Optimization under uncertainty: understanding orders and testing programs with specifications
by: Jansson, Patrik, et al.
Published: (2025)
by: Jansson, Patrik, et al.
Published: (2025)
Canonicalization of Batched Einstein Summations for Tuning Retrieval
by: Kulkarni, Kaushik, et al.
Published: (2026)
by: Kulkarni, Kaushik, et al.
Published: (2026)
Problems from Optimization and Computational Algebra Equivalent to Hilbert's Nullstellensatz
by: Bläser, Markus, et al.
Published: (2025)
by: Bläser, Markus, et al.
Published: (2025)
Flexible Quaternion Generalized Minimal Residual Method for Ill-Posed Quaternion Inverse Problems
by: Liu, Xuan, et al.
Published: (2024)
by: Liu, Xuan, et al.
Published: (2024)
Reasoning about distributive laws in a concurrent refinement algebra
by: Meinicke, Larissa A., et al.
Published: (2024)
by: Meinicke, Larissa A., et al.
Published: (2024)
Restructuring a concurrent refinement algebra
by: Hayes, Ian J., et al.
Published: (2024)
by: Hayes, Ian J., et al.
Published: (2024)
Formal Verification of COO to CSR Sparse Matrix Conversion (Invited Paper)
by: Appel, Andrew W.
Published: (2025)
by: Appel, Andrew W.
Published: (2025)
Covariance Matrix Analysis for Optimal Portfolio Selection
by: Keith, Lim Hao Shen
Published: (2024)
by: Keith, Lim Hao Shen
Published: (2024)
Conversational Concurrency
by: Garnock-Jones, Tony
Published: (2024)
by: Garnock-Jones, Tony
Published: (2024)
Similar Items
-
Utilizing Sparsity in the GPU-accelerated Assembly of Schur Complement Matrices in Domain Decomposition Methods
by: Homola, Jakub, et al.
Published: (2025) -
Bandicoot: A Templated C++ Library for GPU Linear Algebra
by: Curtin, Ryan R., et al.
Published: (2025) -
Local Adjoints for Simultaneous Preaccumulations with Shared Inputs
by: Blühdorn, Johannes, et al.
Published: (2024) -
Hybrid parallel discrete adjoints in SU2
by: Blühdorn, Johannes, et al.
Published: (2024) -
Minimum Cost Loop Nests for Contraction of a Sparse Tensor with a Tensor Network
by: Kanakagiri, Raghavendra, et al.
Published: (2023)