Saved in:
| Main Author: | Veit, Cooper |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.22032 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Mirage Persistent Kernel: A Compiler and Runtime for Mega-Kernelizing Tensor Programs
by: Cheng, Xinhao, et al.
Published: (2025)
by: Cheng, Xinhao, et al.
Published: (2025)
Nautilus: An Auto-Scheduling Tensor Compiler for Efficient Tiled GPU Kernels
by: Zhao, Yifan, et al.
Published: (2026)
by: Zhao, Yifan, et al.
Published: (2026)
Flex Attention: A Programming Model for Generating Optimized Attention Kernels
by: Dong, Juechu, et al.
Published: (2024)
by: Dong, Juechu, et al.
Published: (2024)
Equivalence Checking of ML GPU Kernels
by: Dubey, Kshitij, et al.
Published: (2025)
by: Dubey, Kshitij, et al.
Published: (2025)
Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization
by: Du, He, et al.
Published: (2026)
by: Du, He, et al.
Published: (2026)
Small Language Models as Compiler Experts: Auto-Parallelization for Heterogeneous Systems
by: Devadiga, Prathamesh
Published: (2025)
by: Devadiga, Prathamesh
Published: (2025)
Neptune: Advanced ML Operator Fusion for Locality and Parallelism on GPUs
by: Zhao, Yifan, et al.
Published: (2025)
by: Zhao, Yifan, et al.
Published: (2025)
The Next 700 ML-Enabled Compiler Optimizations
by: VenkataKeerthy, S., et al.
Published: (2023)
by: VenkataKeerthy, S., et al.
Published: (2023)
Learning Minimal Neural Specifications
by: Geng, Chuqin, et al.
Published: (2024)
by: Geng, Chuqin, et al.
Published: (2024)
Kernelized Concept Erasure
by: Ravfogel, Shauli, et al.
Published: (2022)
by: Ravfogel, Shauli, et al.
Published: (2022)
FastKernels: Benchmarking GPU Kernel Generation in Production
by: Oliaro, Gabriele, et al.
Published: (2026)
by: Oliaro, Gabriele, et al.
Published: (2026)
The Anatomy of a Triton Attention Kernel
by: Ringlein, Burkhard, et al.
Published: (2025)
by: Ringlein, Burkhard, et al.
Published: (2025)
Detecting Buggy Contracts via Smart Testing
by: Wang, Sally Junsong, et al.
Published: (2024)
by: Wang, Sally Junsong, et al.
Published: (2024)
A Domain-Specific Language for LLM-Driven Trigger Generation in Multimodal Data Collection
by: Reis, Philipp, et al.
Published: (2026)
by: Reis, Philipp, et al.
Published: (2026)
SwiftEval: Developing a Language-Specific Benchmark for LLM-generated Code Evaluation
by: Petrukha, Ivan, et al.
Published: (2025)
by: Petrukha, Ivan, et al.
Published: (2025)
SmartEval: A Benchmark for Evaluating LLM-Generated Smart Contracts from Natural Language Specifications
by: Goel, Abhinav, et al.
Published: (2026)
by: Goel, Abhinav, et al.
Published: (2026)
Explore as a Storm, Exploit as a Raindrop: On the Benefit of Fine-Tuning Kernel Schedulers with Coordinate Descent
by: Canesche, Michael, et al.
Published: (2024)
by: Canesche, Michael, et al.
Published: (2024)
Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations
by: Liu, Wei, et al.
Published: (2026)
by: Liu, Wei, et al.
Published: (2026)
DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels
by: Bai, Haolei, et al.
Published: (2026)
by: Bai, Haolei, et al.
Published: (2026)
DiJiang: Efficient Large Language Models through Compact Kernelization
by: Chen, Hanting, et al.
Published: (2024)
by: Chen, Hanting, et al.
Published: (2024)
More Than a Score: Probing the Impact of Prompt Specificity on LLM Code Generation
by: Zi, Yangtian, et al.
Published: (2025)
by: Zi, Yangtian, et al.
Published: (2025)
AgentKernelArena: Generalization-Aware Benchmarking of GPU Kernel Optimization Agents
by: Younesian, Sharareh, et al.
Published: (2026)
by: Younesian, Sharareh, et al.
Published: (2026)
A Deep Dive into Function Inlining and its Security Implications for ML-based Binary Analysis
by: Abusabha, Omar, et al.
Published: (2025)
by: Abusabha, Omar, et al.
Published: (2025)
Towards Automated Kernel Generation in the Era of LLMs
by: Yu, Yang, et al.
Published: (2026)
by: Yu, Yang, et al.
Published: (2026)
Cubit: Token Mixer with Kernel Ridge Regression
by: Zheng, Chuanyang, et al.
Published: (2026)
by: Zheng, Chuanyang, et al.
Published: (2026)
Liger Kernel: Efficient Triton Kernels for LLM Training
by: Hsu, Pin-Lun, et al.
Published: (2024)
by: Hsu, Pin-Lun, et al.
Published: (2024)
BODHI: Precise OS Kernel Specification Inference
by: Chang, Zhiming, et al.
Published: (2026)
by: Chang, Zhiming, et al.
Published: (2026)
Verification Modulo Tested Library Contracts
by: Uppar, Abhishek, et al.
Published: (2026)
by: Uppar, Abhishek, et al.
Published: (2026)
MonoCoder: Domain-Specific Code Language Model for HPC Codes and Tasks
by: Kadosh, Tal, et al.
Published: (2023)
by: Kadosh, Tal, et al.
Published: (2023)
Grounding Data Science Code Generation with Input-Output Specifications
by: Wen, Yeming, et al.
Published: (2024)
by: Wen, Yeming, et al.
Published: (2024)
DPO Kernels: A Semantically-Aware, Kernel-Enhanced, and Divergence-Rich Paradigm for Direct Preference Optimization
by: Das, Amitava, et al.
Published: (2025)
by: Das, Amitava, et al.
Published: (2025)
The Collapse of Heterogeneity in Silicon Philosophers
by: Shi, Yuanming, et al.
Published: (2026)
by: Shi, Yuanming, et al.
Published: (2026)
Anka: A Domain-Specific Language for Reliable LLM Code Generation
by: Mazrouei, Saif Khalfan Saif Al
Published: (2025)
by: Mazrouei, Saif Khalfan Saif Al
Published: (2025)
Transformer Based Linear Attention with Optimized GPU Kernel Implementation
by: Gerami, Armin, et al.
Published: (2025)
by: Gerami, Armin, et al.
Published: (2025)
Linear Transformers with Learnable Kernel Functions are Better In-Context Models
by: Aksenov, Yaroslav, et al.
Published: (2024)
by: Aksenov, Yaroslav, et al.
Published: (2024)
Correctness-Guaranteed Code Generation via Constrained Decoding
by: Li, Lingxiao, et al.
Published: (2025)
by: Li, Lingxiao, et al.
Published: (2025)
Type-Constrained Code Generation with Language Models
by: Mündler, Niels, et al.
Published: (2025)
by: Mündler, Niels, et al.
Published: (2025)
Compiler generated feedback for Large Language Models
by: Grubisic, Dejan, et al.
Published: (2024)
by: Grubisic, Dejan, et al.
Published: (2024)
Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities
by: Nikitin, Alexander, et al.
Published: (2024)
by: Nikitin, Alexander, et al.
Published: (2024)
Automated Type Annotation in Python Using Large Language Models
by: Bharti, Varun, et al.
Published: (2025)
by: Bharti, Varun, et al.
Published: (2025)
Similar Items
-
Mirage Persistent Kernel: A Compiler and Runtime for Mega-Kernelizing Tensor Programs
by: Cheng, Xinhao, et al.
Published: (2025) -
Nautilus: An Auto-Scheduling Tensor Compiler for Efficient Tiled GPU Kernels
by: Zhao, Yifan, et al.
Published: (2026) -
Flex Attention: A Programming Model for Generating Optimized Attention Kernels
by: Dong, Juechu, et al.
Published: (2024) -
Equivalence Checking of ML GPU Kernels
by: Dubey, Kshitij, et al.
Published: (2025) -
Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization
by: Du, He, et al.
Published: (2026)