Saved in:
| Main Authors: | Chen, Minyu, Qin, Song, Wu, Ling-I, Xue, Jianxin, Li, Guoqiang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.10634 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DaSAThco: Data-Aware SAT Heuristics Combinations Optimization via Large Language Models
by: Chen, Minyu, et al.
Published: (2025)
by: Chen, Minyu, et al.
Published: (2025)
Can Language Models Pretend Solvers? Logic Code Simulation with LLMs
by: Chen, Minyu, et al.
Published: (2024)
by: Chen, Minyu, et al.
Published: (2024)
Enhancing Automated Loop Invariant Generation for Complex Programs with Large Language Models
by: Liu, Ruibang, et al.
Published: (2024)
by: Liu, Ruibang, et al.
Published: (2024)
ARCEAK: An Automated Rule Checking Framework Enhanced with Architectural Knowledge
by: Chen, Junyong, et al.
Published: (2024)
by: Chen, Junyong, et al.
Published: (2024)
Heuristic-Free Multi-Teacher Learning
by: Nguyen, Huy Thong, et al.
Published: (2024)
by: Nguyen, Huy Thong, et al.
Published: (2024)
Heuristic Methods are Good Teachers to Distill MLPs for Graph Link Prediction
by: Qin, Zongyue, et al.
Published: (2025)
by: Qin, Zongyue, et al.
Published: (2025)
Subgoal-Guided Policy Heuristic Search with Learned Subgoals
by: Tuero, Jake, et al.
Published: (2025)
by: Tuero, Jake, et al.
Published: (2025)
EoH-S: Evolution of Heuristic Set using LLMs for Automated Heuristic Design
by: Liu, Fei, et al.
Published: (2025)
by: Liu, Fei, et al.
Published: (2025)
Counteraction-Aware Multi-Teacher On-Policy Distillation for General Capability Recovery with Domain Preservation
by: Chen, Tianlei, et al.
Published: (2026)
by: Chen, Tianlei, et al.
Published: (2026)
Generalizable Heuristic Generation Through LLMs with Meta-Optimization
by: Shi, Yiding, et al.
Published: (2025)
by: Shi, Yiding, et al.
Published: (2025)
Evolutionary Discovery of Heuristic Policies for Traffic Signal Control
by: Wang, Ruibing, et al.
Published: (2025)
by: Wang, Ruibing, et al.
Published: (2025)
Rethinking LLM-Driven Heuristic Design: Generating Efficient and Specialized Solvers via Dynamics-Aware Optimization
by: Wang, Rongzheng, et al.
Published: (2026)
by: Wang, Rongzheng, et al.
Published: (2026)
Game-Theoretic Co-Evolution for LLM-Based Heuristic Discovery
by: Ke, Xinyi, et al.
Published: (2026)
by: Ke, Xinyi, et al.
Published: (2026)
Pathology-Aware Prototype Evolution via LLM-Driven Semantic Disambiguation for Multicenter Diabetic Retinopathy Diagnosis
by: Zhu, Chunzheng, et al.
Published: (2025)
by: Zhu, Chunzheng, et al.
Published: (2025)
Formalize, Don't Optimize: The Heuristic Trap in LLM-Generated Combinatorial Solvers
by: Wang, Haoyu, et al.
Published: (2026)
by: Wang, Haoyu, et al.
Published: (2026)
An effective Genetic Programming Hyper-Heuristic for Uncertain Agile Satellite Scheduling
by: Chen, Yuning, et al.
Published: (2026)
by: Chen, Yuning, et al.
Published: (2026)
Boosting Universal LLM Reward Design through Heuristic Reward Observation Space Evolution
by: Heng, Zen Kit, et al.
Published: (2025)
by: Heng, Zen Kit, et al.
Published: (2025)
UCPO: Uncertainty-Aware Policy Optimization
by: Zeng, Xianzhou, et al.
Published: (2026)
by: Zeng, Xianzhou, et al.
Published: (2026)
Fine-tuning Pocket-Aware Diffusion Models via Denoising Policy Optimization
by: Xue, Yuan, et al.
Published: (2026)
by: Xue, Yuan, et al.
Published: (2026)
Learning Social Heuristics for Human-Aware Path Planning
by: Eirale, Andrea, et al.
Published: (2025)
by: Eirale, Andrea, et al.
Published: (2025)
COPO: Consistency-Aware Policy Optimization
by: Han, Jinghang, et al.
Published: (2025)
by: Han, Jinghang, et al.
Published: (2025)
Calibration-Aware Policy Optimization for Reasoning LLMs
by: Wang, Ziqi, et al.
Published: (2026)
by: Wang, Ziqi, et al.
Published: (2026)
TIDE: Tuning-Integrated Dynamic Evolution for LLM-Based Automated Heuristic Design
by: Chen, Chentong, et al.
Published: (2026)
by: Chen, Chentong, et al.
Published: (2026)
Deep Reinforcement Learning Guided Improvement Heuristic for Job Shop Scheduling
by: Zhang, Cong, et al.
Published: (2022)
by: Zhang, Cong, et al.
Published: (2022)
Tool-Augmented Policy Optimization: Synergizing Reasoning and Adaptive Tool Use with Reinforcement Learning
by: Wu, Wenxun, et al.
Published: (2025)
by: Wu, Wenxun, et al.
Published: (2025)
BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search
by: Liu, Shiyu, et al.
Published: (2026)
by: Liu, Shiyu, et al.
Published: (2026)
Reinforcement Learning-based Heuristics to Guide Domain-Independent Dynamic Programming
by: Narita, Minori, et al.
Published: (2025)
by: Narita, Minori, et al.
Published: (2025)
ReEvo: Large Language Models as Hyper-Heuristics with Reflective Evolution
by: Ye, Haoran, et al.
Published: (2024)
by: Ye, Haoran, et al.
Published: (2024)
Multi-objective Evolution of Heuristic Using Large Language Model
by: Yao, Shunyu, et al.
Published: (2024)
by: Yao, Shunyu, et al.
Published: (2024)
Improving Learnt Local MAPF Policies with Heuristic Search
by: Veerapaneni, Rishi, et al.
Published: (2024)
by: Veerapaneni, Rishi, et al.
Published: (2024)
Adversarial Attack-Defense Co-Evolution for LLM Safety Alignment via Tree-Group Dual-Aware Search and Optimization
by: Li, Xurui, et al.
Published: (2025)
by: Li, Xurui, et al.
Published: (2025)
ACPO: Adaptive Curriculum Policy Optimization for Aligning Vision-Language Models in Complex Reasoning
by: Wang, Yunhao, et al.
Published: (2025)
by: Wang, Yunhao, et al.
Published: (2025)
Enhancing Q-Learning with Large Language Model Heuristics
by: Wu, Xiefeng
Published: (2024)
by: Wu, Xiefeng
Published: (2024)
Return to Tradition: Learning Reliable Heuristics with Classical Machine Learning
by: Chen, Dillon Z., et al.
Published: (2024)
by: Chen, Dillon Z., et al.
Published: (2024)
SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization
by: Chen, Minghan, et al.
Published: (2025)
by: Chen, Minghan, et al.
Published: (2025)
Learning to Explore: Scaling Agentic Reasoning via Exploration-Aware Policy Optimization
by: Hua, Xingyuan, et al.
Published: (2026)
by: Hua, Xingyuan, et al.
Published: (2026)
On-Device Diffusion Transformer Policy for Efficient Robot Manipulation
by: Wu, Yiming, et al.
Published: (2025)
by: Wu, Yiming, et al.
Published: (2025)
FedProxy: Federated Fine-Tuning of LLMs via Proxy SLMs and Heterogeneity-Aware Fusion
by: Fan, Tao, et al.
Published: (2026)
by: Fan, Tao, et al.
Published: (2026)
Learning Domain-Independent Heuristics for Grounded and Lifted Planning
by: Chen, Dillon Z., et al.
Published: (2023)
by: Chen, Dillon Z., et al.
Published: (2023)
Efficient Policy Learning with Hybrid Evaluation-Based Genetic Programming for Uncertain Agile Earth Observation Satellite Scheduling
by: Xue, Junhua, et al.
Published: (2026)
by: Xue, Junhua, et al.
Published: (2026)
Similar Items
-
DaSAThco: Data-Aware SAT Heuristics Combinations Optimization via Large Language Models
by: Chen, Minyu, et al.
Published: (2025) -
Can Language Models Pretend Solvers? Logic Code Simulation with LLMs
by: Chen, Minyu, et al.
Published: (2024) -
Enhancing Automated Loop Invariant Generation for Complex Programs with Large Language Models
by: Liu, Ruibang, et al.
Published: (2024) -
ARCEAK: An Automated Rule Checking Framework Enhanced with Architectural Knowledge
by: Chen, Junyong, et al.
Published: (2024) -
Heuristic-Free Multi-Teacher Learning
by: Nguyen, Huy Thong, et al.
Published: (2024)