Saved in:
| Main Authors: | Ozaki, Katsuhisa, Uchino, Yuki, Imamura, Toshiyuki |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.08009 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Error Analysis of Matrix Multiplication Emulation Using Ozaki-II Scheme
by: Uchino, Yuki, et al.
Published: (2026)
by: Uchino, Yuki, et al.
Published: (2026)
Performance Enhancement of the Ozaki Scheme on Integer Matrix Multiplication Unit
by: Uchino, Yuki, et al.
Published: (2024)
by: Uchino, Yuki, et al.
Published: (2024)
Double-Precision Matrix Multiplication Emulation via Ozaki-II Scheme with FP8 Quantization
by: Uchino, Yuki, et al.
Published: (2026)
by: Uchino, Yuki, et al.
Published: (2026)
High-Performance and Power-Efficient Emulation of Matrix Multiplication using INT8 Matrix Engines
by: Uchino, Yuki, et al.
Published: (2025)
by: Uchino, Yuki, et al.
Published: (2025)
Sparse Iterative Solvers Using High-Precision Arithmetic with Quasi Multi-Word Algorithms
by: Mukunoki, Daichi, et al.
Published: (2025)
by: Mukunoki, Daichi, et al.
Published: (2025)
DGEMM without FP64 Arithmetic - Using FP64 Emulation and FP8 Tensor Cores with Ozaki Scheme
by: Mukunoki, Daichi
Published: (2025)
by: Mukunoki, Daichi
Published: (2025)
Emulation of Complex Matrix Multiplication based on the Chinese Remainder Theorem
by: Uchino, Yuki, et al.
Published: (2025)
by: Uchino, Yuki, et al.
Published: (2025)
A method of using RSVD in residual calculation of LowBit GEMM
by: Gu, Hongyaoxing
Published: (2024)
by: Gu, Hongyaoxing
Published: (2024)
Cascading GEMM: High Precision from Low Precision
by: Parikh, Devangi N., et al.
Published: (2023)
by: Parikh, Devangi N., et al.
Published: (2023)
Faster arbitrary-precision dot product and matrix multiplication
by: Johansson, Fredrik
Published: (2019)
by: Johansson, Fredrik
Published: (2019)
FalconGEMM: Surpassing Hardware Peaks with Lower-Complexity Matrix Multiplication
by: Zhu, Honglin, et al.
Published: (2026)
by: Zhu, Honglin, et al.
Published: (2026)
Communication-Avoiding SpGEMM via Trident Partitioning on Hierarchical GPU Interconnects
by: Bellavita, Julian, et al.
Published: (2026)
by: Bellavita, Julian, et al.
Published: (2026)
Optimistix: modular optimisation in JAX and Equinox
by: Rader, Jason, et al.
Published: (2024)
by: Rader, Jason, et al.
Published: (2024)
Some new techniques to use in serial sparse Cholesky factorization algorithms
by: Karsavuran, M. Ozan, et al.
Published: (2024)
by: Karsavuran, M. Ozan, et al.
Published: (2024)
A square root algorithm faster than Newton's method for multiprecision numbers, using floating-point arithmetic
by: Romano, Fabio
Published: (2024)
by: Romano, Fabio
Published: (2024)
A modular and extensible library for parameterized terrain generation
by: Wallin, Erik
Published: (2025)
by: Wallin, Erik
Published: (2025)
Iterative Refinement for a Subset of Eigenvectors of Symmetric Matrices via Matrix Multiplications
by: Terao, Takeshi, et al.
Published: (2026)
by: Terao, Takeshi, et al.
Published: (2026)
Interface for Sparse Linear Algebra Operations
by: Abdelfattah, Ahmad, et al.
Published: (2024)
by: Abdelfattah, Ahmad, et al.
Published: (2024)
A new object-oriented framework for solving multiphysics problems via combination of different numerical methods
by: Sargado, Juan Michael
Published: (2019)
by: Sargado, Juan Michael
Published: (2019)
MooAFEM: An object oriented Matlab code for higher-order adaptive FEM for (nonlinear) elliptic PDEs
by: Innerberger, Michael, et al.
Published: (2022)
by: Innerberger, Michael, et al.
Published: (2022)
Enhancing non-Perl bioinformatic applications with Perl: Building novel, component based applications using Object Orientation, PDL, Alien, FFI, Inline and OpenMP
by: Argyropoulos, Christos
Published: (2024)
by: Argyropoulos, Christos
Published: (2024)
Acceleration of multi-component multiple-precision arithmetic with branch-free algorithms and SIMD vectorization
by: Kouya, Tomonori
Published: (2026)
by: Kouya, Tomonori
Published: (2026)
Fast multiplication by two's complement addition of numbers represented as a set of polynomial radix 2 indexes, stored as an integer list for massively parallel computation
by: Stocks, Mark
Published: (2023)
by: Stocks, Mark
Published: (2023)
On the energy efficiency of sparse matrix computations on multi-GPU clusters
by: Bernaschi, Massimo, et al.
Published: (2025)
by: Bernaschi, Massimo, et al.
Published: (2025)
Towards Richer Challenge Problems for Scientific Computing Correctness
by: Sottile, Matthew, et al.
Published: (2025)
by: Sottile, Matthew, et al.
Published: (2025)
Experience converting a large mathematical software package written in C++ to C++20 modules
by: Bangerth, Wolfgang
Published: (2025)
by: Bangerth, Wolfgang
Published: (2025)
Extended Abstract: Partial-encapsulate and Its Support for Floating-point Operations in ACL2
by: Kaufmann, Matt, et al.
Published: (2025)
by: Kaufmann, Matt, et al.
Published: (2025)
Tensor Evolution: A Framework for Fast Evaluation of Tensor Computations using Recurrences
by: Absar, Javed, et al.
Published: (2025)
by: Absar, Javed, et al.
Published: (2025)
evomap: A Toolbox for Dynamic Mapping in Python
by: Matthe, Maximilian
Published: (2025)
by: Matthe, Maximilian
Published: (2025)
QUBOLite: A lightweigth Python toolkit for QUBO
by: Mücke, Sascha, et al.
Published: (2025)
by: Mücke, Sascha, et al.
Published: (2025)
A FAIR File Format for Mathematical Software
by: Della Vecchia, Antony, et al.
Published: (2023)
by: Della Vecchia, Antony, et al.
Published: (2023)
Predefined Software Environment Runtimes As A Measure For Reproducibility
by: Kaushik, Aaruni
Published: (2024)
by: Kaushik, Aaruni
Published: (2024)
Correctly Rounded Functions For Vector Applications: A Performance Study
by: Anderson, Cristina, et al.
Published: (2026)
by: Anderson, Cristina, et al.
Published: (2026)
SySTeC: A Symmetric Sparse Tensor Compiler
by: Patel, Radha, et al.
Published: (2024)
by: Patel, Radha, et al.
Published: (2024)
AsaPy: A Python Library for Aerospace Simulation Analysis
by: Dantas, Joao P. A., et al.
Published: (2023)
by: Dantas, Joao P. A., et al.
Published: (2023)
A comparison of two effective methods for reordering columns within supernodes
by: Karsavuran, M. Ozan, et al.
Published: (2025)
by: Karsavuran, M. Ozan, et al.
Published: (2025)
A Matlab code for analysis and topology optimization with Third Medium Contact
by: Frederiksen, Andreas Henrik, et al.
Published: (2025)
by: Frederiksen, Andreas Henrik, et al.
Published: (2025)
Grassland: A Rapid Algebraic Modeling System for Million-variable Optimization
by: Li, Xihan, et al.
Published: (2021)
by: Li, Xihan, et al.
Published: (2021)
Analyzing Computational Approaches for Differential Equations: A Study of MATLAB, Mathematica, and Maple
by: Ogethakpo, Arhonefe Joseph, et al.
Published: (2025)
by: Ogethakpo, Arhonefe Joseph, et al.
Published: (2025)
A Novel SIMD-Optimized Implementation for Fast and Memory-Efficient Trigonometric Computation
by: Goyal, Nikhil Dev, et al.
Published: (2025)
by: Goyal, Nikhil Dev, et al.
Published: (2025)
Similar Items
-
Error Analysis of Matrix Multiplication Emulation Using Ozaki-II Scheme
by: Uchino, Yuki, et al.
Published: (2026) -
Performance Enhancement of the Ozaki Scheme on Integer Matrix Multiplication Unit
by: Uchino, Yuki, et al.
Published: (2024) -
Double-Precision Matrix Multiplication Emulation via Ozaki-II Scheme with FP8 Quantization
by: Uchino, Yuki, et al.
Published: (2026) -
High-Performance and Power-Efficient Emulation of Matrix Multiplication using INT8 Matrix Engines
by: Uchino, Yuki, et al.
Published: (2025) -
Sparse Iterative Solvers Using High-Precision Arithmetic with Quasi Multi-Word Algorithms
by: Mukunoki, Daichi, et al.
Published: (2025)