:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Chen, Minyu, Qin, Song, Wu, Ling-I, Xue, Jianxin, Li, Guoqiang
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.10634
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

DaSAThco: Data-Aware SAT Heuristics Combinations Optimization via Large Language Models
by: Chen, Minyu, et al.
Published: (2025)

Can Language Models Pretend Solvers? Logic Code Simulation with LLMs
by: Chen, Minyu, et al.
Published: (2024)

Enhancing Automated Loop Invariant Generation for Complex Programs with Large Language Models
by: Liu, Ruibang, et al.
Published: (2024)

ARCEAK: An Automated Rule Checking Framework Enhanced with Architectural Knowledge
by: Chen, Junyong, et al.
Published: (2024)

Heuristic-Free Multi-Teacher Learning
by: Nguyen, Huy Thong, et al.
Published: (2024)

Heuristic Methods are Good Teachers to Distill MLPs for Graph Link Prediction
by: Qin, Zongyue, et al.
Published: (2025)

Subgoal-Guided Policy Heuristic Search with Learned Subgoals
by: Tuero, Jake, et al.
Published: (2025)

EoH-S: Evolution of Heuristic Set using LLMs for Automated Heuristic Design
by: Liu, Fei, et al.
Published: (2025)

Counteraction-Aware Multi-Teacher On-Policy Distillation for General Capability Recovery with Domain Preservation
by: Chen, Tianlei, et al.
Published: (2026)

Generalizable Heuristic Generation Through LLMs with Meta-Optimization
by: Shi, Yiding, et al.
Published: (2025)

Evolutionary Discovery of Heuristic Policies for Traffic Signal Control
by: Wang, Ruibing, et al.
Published: (2025)

Rethinking LLM-Driven Heuristic Design: Generating Efficient and Specialized Solvers via Dynamics-Aware Optimization
by: Wang, Rongzheng, et al.
Published: (2026)

Game-Theoretic Co-Evolution for LLM-Based Heuristic Discovery
by: Ke, Xinyi, et al.
Published: (2026)

Pathology-Aware Prototype Evolution via LLM-Driven Semantic Disambiguation for Multicenter Diabetic Retinopathy Diagnosis
by: Zhu, Chunzheng, et al.
Published: (2025)

Formalize, Don't Optimize: The Heuristic Trap in LLM-Generated Combinatorial Solvers
by: Wang, Haoyu, et al.
Published: (2026)

An effective Genetic Programming Hyper-Heuristic for Uncertain Agile Satellite Scheduling
by: Chen, Yuning, et al.
Published: (2026)

Boosting Universal LLM Reward Design through Heuristic Reward Observation Space Evolution
by: Heng, Zen Kit, et al.
Published: (2025)

UCPO: Uncertainty-Aware Policy Optimization
by: Zeng, Xianzhou, et al.
Published: (2026)

Fine-tuning Pocket-Aware Diffusion Models via Denoising Policy Optimization
by: Xue, Yuan, et al.
Published: (2026)

Learning Social Heuristics for Human-Aware Path Planning
by: Eirale, Andrea, et al.
Published: (2025)

COPO: Consistency-Aware Policy Optimization
by: Han, Jinghang, et al.
Published: (2025)

Calibration-Aware Policy Optimization for Reasoning LLMs
by: Wang, Ziqi, et al.
Published: (2026)

TIDE: Tuning-Integrated Dynamic Evolution for LLM-Based Automated Heuristic Design
by: Chen, Chentong, et al.
Published: (2026)

Deep Reinforcement Learning Guided Improvement Heuristic for Job Shop Scheduling
by: Zhang, Cong, et al.
Published: (2022)

Tool-Augmented Policy Optimization: Synergizing Reasoning and Adaptive Tool Use with Reinforcement Learning
by: Wu, Wenxun, et al.
Published: (2025)

BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search
by: Liu, Shiyu, et al.
Published: (2026)

Reinforcement Learning-based Heuristics to Guide Domain-Independent Dynamic Programming
by: Narita, Minori, et al.
Published: (2025)

ReEvo: Large Language Models as Hyper-Heuristics with Reflective Evolution
by: Ye, Haoran, et al.
Published: (2024)

Multi-objective Evolution of Heuristic Using Large Language Model
by: Yao, Shunyu, et al.
Published: (2024)

Improving Learnt Local MAPF Policies with Heuristic Search
by: Veerapaneni, Rishi, et al.
Published: (2024)

Adversarial Attack-Defense Co-Evolution for LLM Safety Alignment via Tree-Group Dual-Aware Search and Optimization
by: Li, Xurui, et al.
Published: (2025)

ACPO: Adaptive Curriculum Policy Optimization for Aligning Vision-Language Models in Complex Reasoning
by: Wang, Yunhao, et al.
Published: (2025)

Enhancing Q-Learning with Large Language Model Heuristics
by: Wu, Xiefeng
Published: (2024)

Return to Tradition: Learning Reliable Heuristics with Classical Machine Learning
by: Chen, Dillon Z., et al.
Published: (2024)

SEED-GRPO: Semantic Entropy Enhanced GRPO for Uncertainty-Aware Policy Optimization
by: Chen, Minghan, et al.
Published: (2025)

Learning to Explore: Scaling Agentic Reasoning via Exploration-Aware Policy Optimization
by: Hua, Xingyuan, et al.
Published: (2026)

On-Device Diffusion Transformer Policy for Efficient Robot Manipulation
by: Wu, Yiming, et al.
Published: (2025)

FedProxy: Federated Fine-Tuning of LLMs via Proxy SLMs and Heterogeneity-Aware Fusion
by: Fan, Tao, et al.
Published: (2026)

Learning Domain-Independent Heuristics for Grounded and Lifted Planning
by: Chen, Dillon Z., et al.
Published: (2023)

Efficient Policy Learning with Hybrid Evaluation-Based Genetic Programming for Uncertain Agile Earth Observation Satellite Scheduling
by: Xue, Junhua, et al.
Published: (2026)