Saved in:
| Main Authors: | Zhang, Yihong, Gerstmann, Derek, Adams, Andrew, Ahmad, Maaz Bin Safeer |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.02371 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Scalable MatMul-free Language Modeling
by: Zhu, Rui-Jie, et al.
Published: (2024)
by: Zhu, Rui-Jie, et al.
Published: (2024)
Assessing Tenstorrent's RISC-V MatMul Acceleration Capabilities
by: Cavagna, Hiari Pizzini, et al.
Published: (2025)
by: Cavagna, Hiari Pizzini, et al.
Published: (2025)
Changing Base Without Losing Pace: A GPU-Efficient Alternative to MatMul in DNNs
by: Ailon, Nir, et al.
Published: (2025)
by: Ailon, Nir, et al.
Published: (2025)
Chameleon: A MatMul-Free Temporal Convolutional Network Accelerator for End-to-End Few-Shot and Continual Learning from Sequential Data
by: Blanken, Douwe den, et al.
Published: (2025)
by: Blanken, Douwe den, et al.
Published: (2025)
A Solver-Aided Hierarchical Language for LLM-Driven CAD Design
by: Jones, Benjamin T., et al.
Published: (2025)
by: Jones, Benjamin T., et al.
Published: (2025)
Nautilus: An Auto-Scheduling Tensor Compiler for Efficient Tiled GPU Kernels
by: Zhao, Yifan, et al.
Published: (2026)
by: Zhao, Yifan, et al.
Published: (2026)
SparseAuto: An Auto-Scheduler for Sparse Tensor Computations Using Recursive Loop Nest Restructuring
by: Dias, Adhitha, et al.
Published: (2023)
by: Dias, Adhitha, et al.
Published: (2023)
Exo 2: Growing a Scheduling Language
by: Ikarashi, Yuka, et al.
Published: (2024)
by: Ikarashi, Yuka, et al.
Published: (2024)
Domain-Specific Tensor Languages
by: Bernardy, Jean-Philippe, et al.
Published: (2023)
by: Bernardy, Jean-Philippe, et al.
Published: (2023)
LLM-Aided Compilation for Tensor Accelerators
by: Hong, Charles, et al.
Published: (2024)
by: Hong, Charles, et al.
Published: (2024)
Mat2Boundary: Treating User-Defined Boundary Condition as SpMV for Distributed PDE Solvers on Block-Structured Grids
by: Cai, Yanzheng, et al.
Published: (2026)
by: Cai, Yanzheng, et al.
Published: (2026)
How Programming Concepts and Neurons Are Shared in Code Language Models
by: Kargaran, Amir Hossein, et al.
Published: (2025)
by: Kargaran, Amir Hossein, et al.
Published: (2025)
Effects and Coeffects in Call-By-Push-Value (Extended Version)
by: Torczon, Cassia, et al.
Published: (2023)
by: Torczon, Cassia, et al.
Published: (2023)
Rewrite System Showdown: Stochastic Search vs. EqSat
by: Hong, Qiantan, et al.
Published: (2026)
by: Hong, Qiantan, et al.
Published: (2026)
Semantic foundations of equality saturation
by: Suciu, Dan, et al.
Published: (2025)
by: Suciu, Dan, et al.
Published: (2025)
Verifying Correctness of Shared Channels in a Cooperatively Scheduled Process-Oriented Language
by: Pedersen, Jan, et al.
Published: (2025)
by: Pedersen, Jan, et al.
Published: (2025)
ACT: Automatically Generating Compiler Backends from Tensor Accelerator ISA Descriptions
by: Jain, Devansh, et al.
Published: (2025)
by: Jain, Devansh, et al.
Published: (2025)
CLMTracing: Black-box User-level Watermarking for Code Language Model Tracing
by: Zhang, Boyu, et al.
Published: (2025)
by: Zhang, Boyu, et al.
Published: (2025)
Abstract Operational Methods for Call-by-Push-Value
by: Goncharov, Sergey, et al.
Published: (2024)
by: Goncharov, Sergey, et al.
Published: (2024)
Autocomp: A Powerful and Portable Code Optimizer for Tensor Accelerators
by: Hong, Charles, et al.
Published: (2025)
by: Hong, Charles, et al.
Published: (2025)
LLMigrate: Transforming "Lazy" Large Language Models into Efficient Source Code Migrators
by: Liu, Yuchen, et al.
Published: (2025)
by: Liu, Yuchen, et al.
Published: (2025)
Expression Acceleration: Seamless Parallelization of Typed High-Level Languages
by: Hummelgren, Lars, et al.
Published: (2022)
by: Hummelgren, Lars, et al.
Published: (2022)
HaliVer: Deductive Verification and Scheduling Languages Join Forces
by: Haak, Lars B. van den, et al.
Published: (2024)
by: Haak, Lars B. van den, et al.
Published: (2024)
Monk: Opportunistic Scheduling to Delay Horizontal Scaling
by: Shimchenko, Marina, et al.
Published: (2025)
by: Shimchenko, Marina, et al.
Published: (2025)
Task-Based Tensor Computations on Modern GPUs
by: Yadav, Rohan, et al.
Published: (2025)
by: Yadav, Rohan, et al.
Published: (2025)
The Continuous Tensor Abstraction: Where Indices are Real
by: Won, Jaeyeon, et al.
Published: (2024)
by: Won, Jaeyeon, et al.
Published: (2024)
Tensor Evolution: A Framework for Fast Evaluation of Tensor Computations using Recurrences
by: Absar, Javed, et al.
Published: (2025)
by: Absar, Javed, et al.
Published: (2025)
Scheduling Languages: A Past, Present, and Future Taxonomy
by: Hall, Mary, et al.
Published: (2024)
by: Hall, Mary, et al.
Published: (2024)
TENSURE: Fuzzing Sparse Tensor Compilers (Registered Report)
by: Mahathevan, Kabilan, et al.
Published: (2026)
by: Mahathevan, Kabilan, et al.
Published: (2026)
SkyEgg: Joint Implementation Selection and Scheduling for Hardware Synthesis using E-graphs
by: Xiao, Youwei, et al.
Published: (2025)
by: Xiao, Youwei, et al.
Published: (2025)
Scheduling Garbage Collection for Energy Efficiency on Asymmetric Multicore Processors
by: Shimchenko, Marina, et al.
Published: (2024)
by: Shimchenko, Marina, et al.
Published: (2024)
Cedar: A New Language for Expressive, Fast, Safe, and Analyzable Authorization (Extended Version)
by: Cutler, Joseph W., et al.
Published: (2024)
by: Cutler, Joseph W., et al.
Published: (2024)
GPU-Accelerated Synthesis of Mixed-Boolean Arithmetic: Beyond Caching
by: Bathie, Gabriel, et al.
Published: (2026)
by: Bathie, Gabriel, et al.
Published: (2026)
DITRON: Distributed Multi-level Tiling Compiler for Parallel Tensor Programs
by: Zheng, Size, et al.
Published: (2026)
by: Zheng, Size, et al.
Published: (2026)
Galley: Modern Query Optimization for Sparse Tensor Programs
by: Deeds, Kyle, et al.
Published: (2024)
by: Deeds, Kyle, et al.
Published: (2024)
Qwerty: A Basis-Oriented Quantum Programming Language
by: Adams, Austin J., et al.
Published: (2024)
by: Adams, Austin J., et al.
Published: (2024)
Tenspiler: A Verified Lifting-Based Compiler for Tensor Operations (Extended Version)
by: Qiu, Jie, et al.
Published: (2024)
by: Qiu, Jie, et al.
Published: (2024)
QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach
by: Dong, Shouyang, et al.
Published: (2025)
by: Dong, Shouyang, et al.
Published: (2025)
Disassembly as Weighted Interval Scheduling with Learned Weights
by: Flores-Montoya, Antonio, et al.
Published: (2025)
by: Flores-Montoya, Antonio, et al.
Published: (2025)
Reactive Semantics for User Interface Description Languages
by: Pesin, Basile, et al.
Published: (2025)
by: Pesin, Basile, et al.
Published: (2025)
Similar Items
-
Scalable MatMul-free Language Modeling
by: Zhu, Rui-Jie, et al.
Published: (2024) -
Assessing Tenstorrent's RISC-V MatMul Acceleration Capabilities
by: Cavagna, Hiari Pizzini, et al.
Published: (2025) -
Changing Base Without Losing Pace: A GPU-Efficient Alternative to MatMul in DNNs
by: Ailon, Nir, et al.
Published: (2025) -
Chameleon: A MatMul-Free Temporal Convolutional Network Accelerator for End-to-End Few-Shot and Continual Learning from Sequential Data
by: Blanken, Douwe den, et al.
Published: (2025) -
A Solver-Aided Hierarchical Language for LLM-Driven CAD Design
by: Jones, Benjamin T., et al.
Published: (2025)