Saved in:
| Main Authors: | Qu, Qiuyi, Sui, Yicheng, Sun, Yufei, Chen, Rui, Zhang, Xiaofei, Zhang, Yuzhi, Wang, Haofeng, Lan, Ge |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.12698 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LLM-Oriented Retrieval Tuner
by: Sun, Si, et al.
Published: (2024)
by: Sun, Si, et al.
Published: (2024)
Semantic Convergence: Harmonizing Recommender Systems via Two-Stage Alignment and Behavioral Semantic Tokenization
by: Li, Guanghan, et al.
Published: (2024)
by: Li, Guanghan, et al.
Published: (2024)
Transformer Based Linear Attention with Optimized GPU Kernel Implementation
by: Gerami, Armin, et al.
Published: (2025)
by: Gerami, Armin, et al.
Published: (2025)
Lean Refactor: Multi-Objective Controllable Proof Optimization via Agentic Strategy Search
by: Lu, Jialin, et al.
Published: (2026)
by: Lu, Jialin, et al.
Published: (2026)
FastKernels: Benchmarking GPU Kernel Generation in Production
by: Oliaro, Gabriele, et al.
Published: (2026)
by: Oliaro, Gabriele, et al.
Published: (2026)
Insum: Sparse GPU Kernels Simplified and Optimized with Indirect Einsums
by: Won, Jaeyeon, et al.
Published: (2025)
by: Won, Jaeyeon, et al.
Published: (2025)
AgentKernelArena: Generalization-Aware Benchmarking of GPU Kernel Optimization Agents
by: Younesian, Sharareh, et al.
Published: (2026)
by: Younesian, Sharareh, et al.
Published: (2026)
Advances in Semantic Patching for HPC-oriented Refactorings with Coccinelle
by: Martone, Michele, et al.
Published: (2025)
by: Martone, Michele, et al.
Published: (2025)
RPTS: Tree-Structured Reasoning Process Scoring for Faithful Multimodal Evaluation
by: Wang, Haofeng, et al.
Published: (2025)
by: Wang, Haofeng, et al.
Published: (2025)
Skill-as-Pseudocode: Refactoring Skill Libraries to Pseudocode for LLM Agents
by: Li, Xinze, et al.
Published: (2026)
by: Li, Xinze, et al.
Published: (2026)
Equivalence Checking of ML GPU Kernels
by: Dubey, Kshitij, et al.
Published: (2025)
by: Dubey, Kshitij, et al.
Published: (2025)
Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization
by: Du, He, et al.
Published: (2026)
by: Du, He, et al.
Published: (2026)
Astra: A Multi-Agent System for GPU Kernel Performance Optimization
by: Wei, Anjiang, et al.
Published: (2025)
by: Wei, Anjiang, et al.
Published: (2025)
AGFT: An Adaptive GPU Frequency Tuner for Real-Time LLM Inference Optimization
by: Ye, Zicong, et al.
Published: (2025)
by: Ye, Zicong, et al.
Published: (2025)
Code Generation for Cryptographic Kernels using Multi-word Modular Arithmetic on GPU
by: Zhang, Naifeng, et al.
Published: (2025)
by: Zhang, Naifeng, et al.
Published: (2025)
SAC-Opt: Semantic Anchors for Iterative Correction in Optimization Modeling
by: Zhang, Yansen, et al.
Published: (2025)
by: Zhang, Yansen, et al.
Published: (2025)
Unlocking Insights: Semantic Search in Jupyter Notebooks
by: Li, Lan, et al.
Published: (2024)
by: Li, Lan, et al.
Published: (2024)
Optimizing Datalog for the GPU
by: Sun, Yihao, et al.
Published: (2023)
by: Sun, Yihao, et al.
Published: (2023)
KPerfIR: Towards an Open and Compiler-centric Ecosystem for GPU Kernel Performance Tooling on Modern AI Workloads
by: Guan, Yue, et al.
Published: (2025)
by: Guan, Yue, et al.
Published: (2025)
ACR: Adaptive Context Refactoring via Context Refactoring Operators for Multi-Turn Dialogue
by: Shen, Jiawei, et al.
Published: (2026)
by: Shen, Jiawei, et al.
Published: (2026)
Optimizing Instruction Synthesis: Effective Exploration of Evolutionary Space with Tree Search
by: Li, Chenglin, et al.
Published: (2024)
by: Li, Chenglin, et al.
Published: (2024)
Learning How and What to Memorize: Cognition-Inspired Two-Stage Optimization for Evolving Memory
by: Xu, Derong, et al.
Published: (2026)
by: Xu, Derong, et al.
Published: (2026)
Leveraging LLMs to Automate Energy-Aware Refactoring of Parallel Scientific Codes
by: Dearing, Matthew T., et al.
Published: (2025)
by: Dearing, Matthew T., et al.
Published: (2025)
Towards Better Multi-task Learning: A Framework for Optimizing Dataset Combinations in Large Language Models
by: Zhan, Zaifu, et al.
Published: (2024)
by: Zhan, Zaifu, et al.
Published: (2024)
ChatCFD: An LLM-Driven Agent for End-to-End CFD Automation with Structured Knowledge and Reasoning
by: Fan, E, et al.
Published: (2025)
by: Fan, E, et al.
Published: (2025)
ConfTuner: Training Large Language Models to Express Their Confidence Verbally
by: Li, Yibo, et al.
Published: (2025)
by: Li, Yibo, et al.
Published: (2025)
Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning
by: Xi, Zhiheng, et al.
Published: (2025)
by: Xi, Zhiheng, et al.
Published: (2025)
QiMeng-Kernel: Macro-Thinking Micro-Coding Paradigm for LLM-Based High-Performance GPU Kernel Generation
by: Zhu, Xinguo, et al.
Published: (2025)
by: Zhu, Xinguo, et al.
Published: (2025)
Mamba for Streaming ASR Combined with Unimodal Aggregation
by: Fang, Ying, et al.
Published: (2024)
by: Fang, Ying, et al.
Published: (2024)
Semantic Search Evaluation
by: Zheng, Chujie, et al.
Published: (2024)
by: Zheng, Chujie, et al.
Published: (2024)
Two-Stage Regularization-Based Structured Pruning for LLMs
by: Feng, Mingkuan, et al.
Published: (2025)
by: Feng, Mingkuan, et al.
Published: (2025)
Comateformer: Combined Attention Transformer for Semantic Sentence Matching
by: Li, Bo, et al.
Published: (2024)
by: Li, Bo, et al.
Published: (2024)
SeCon-RAG: A Two-Stage Semantic Filtering and Conflict-Free Framework for Trustworthy RAG
by: Si, Xiaonan, et al.
Published: (2025)
by: Si, Xiaonan, et al.
Published: (2025)
ReGAL: Refactoring Programs to Discover Generalizable Abstractions
by: Stengel-Eskin, Elias, et al.
Published: (2024)
by: Stengel-Eskin, Elias, et al.
Published: (2024)
PivotAttack: Rethinking the Search Trajectory in Hard-Label Text Attacks via Pivot Words
by: Liang, Yuzhi, et al.
Published: (2026)
by: Liang, Yuzhi, et al.
Published: (2026)
Can Knowledge Graphs Make Large Language Models More Trustworthy? An Empirical Study Over Open-ended Question Answering
by: Sui, Yuan, et al.
Published: (2024)
by: Sui, Yuan, et al.
Published: (2024)
A Two-Stage Multimodal Emotion Recognition Model Based on Graph Contrastive Learning
by: Ai, Wei, et al.
Published: (2024)
by: Ai, Wei, et al.
Published: (2024)
Plug-and-Play Training Framework for Preference Optimization
by: Ma, Jingyuan, et al.
Published: (2024)
by: Ma, Jingyuan, et al.
Published: (2024)
Nautilus: An Auto-Scheduling Tensor Compiler for Efficient Tiled GPU Kernels
by: Zhao, Yifan, et al.
Published: (2026)
by: Zhao, Yifan, et al.
Published: (2026)
Dynamic Optimizations of LLM Ensembles with Two-Stage Reinforcement Learning Agents
by: Tekin, Selim Furkan, et al.
Published: (2025)
by: Tekin, Selim Furkan, et al.
Published: (2025)
Similar Items
-
LLM-Oriented Retrieval Tuner
by: Sun, Si, et al.
Published: (2024) -
Semantic Convergence: Harmonizing Recommender Systems via Two-Stage Alignment and Behavioral Semantic Tokenization
by: Li, Guanghan, et al.
Published: (2024) -
Transformer Based Linear Attention with Optimized GPU Kernel Implementation
by: Gerami, Armin, et al.
Published: (2025) -
Lean Refactor: Multi-Objective Controllable Proof Optimization via Agentic Strategy Search
by: Lu, Jialin, et al.
Published: (2026) -
FastKernels: Benchmarking GPU Kernel Generation in Production
by: Oliaro, Gabriele, et al.
Published: (2026)