Saved in:
| Main Authors: | Roupassov-Ruiz, Anton, Zuo, Yiyang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.04365 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Interpretable and Editable Programmatic Tree Policies for Reinforcement Learning
by: Kohler, Hector, et al.
Published: (2024)
by: Kohler, Hector, et al.
Published: (2024)
Multimodal LLM-assisted Evolutionary Search for Programmatic Control Policies
by: Hu, Qinglong, et al.
Published: (2025)
by: Hu, Qinglong, et al.
Published: (2025)
Reclaiming the Source of Programmatic Policies: Programmatic versus Latent Spaces
by: Carvalho, Tales H., et al.
Published: (2024)
by: Carvalho, Tales H., et al.
Published: (2024)
Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search
by: Liu, Max, et al.
Published: (2024)
by: Liu, Max, et al.
Published: (2024)
Programmatic Reinforcement Learning: Navigating Gridworlds
by: Shabadi, Guruprerana, et al.
Published: (2024)
by: Shabadi, Guruprerana, et al.
Published: (2024)
Scheduling That Speaks: An Interpretable Programmatic Reinforcement Learning Framework
by: Hu, Chengpeng, et al.
Published: (2026)
by: Hu, Chengpeng, et al.
Published: (2026)
Common Benchmarks Undervalue the Generalization Power of Programmatic Policies
by: Rajabpour, Amirhossein, et al.
Published: (2025)
by: Rajabpour, Amirhossein, et al.
Published: (2025)
Searching for Programmatic Policies in Semantic Spaces
by: Moraes, Rubens O., et al.
Published: (2024)
by: Moraes, Rubens O., et al.
Published: (2024)
Programmatic Representation Learning with Language Models
by: Poesia, Gabriel, et al.
Published: (2025)
by: Poesia, Gabriel, et al.
Published: (2025)
DiPRL: Learning Discrete Programmatic Policies via Architecture Entropy Regularization
by: Hu, Chengpeng, et al.
Published: (2026)
by: Hu, Chengpeng, et al.
Published: (2026)
Survival of the Fittest: Evolutionary Adaptation of Policies for Environmental Shifts
by: Paul, Sheryl, et al.
Published: (2024)
by: Paul, Sheryl, et al.
Published: (2024)
InnateCoder: Learning Programmatic Options with Foundation Models
by: Moraes, Rubens O., et al.
Published: (2025)
by: Moraes, Rubens O., et al.
Published: (2025)
PolicyEvolve: Evolving Programmatic Policies by LLMs for multi-player games via Population-Based Training
by: Lv, Mingrui, et al.
Published: (2025)
by: Lv, Mingrui, et al.
Published: (2025)
Surrogate-Assisted Evolutionary Reinforcement Learning Based on Autoencoder and Hyperbolic Neural Network
by: Li, Bingdong, et al.
Published: (2025)
by: Li, Bingdong, et al.
Published: (2025)
Neural Topic Models with Survival Supervision: Jointly Predicting Time-to-Event Outcomes and Learning How Clinical Features Relate
by: Chen, George H., et al.
Published: (2020)
by: Chen, George H., et al.
Published: (2020)
Ranking Joint Policies in Dynamic Games using Evolutionary Dynamics
by: Koliou, Natalia, et al.
Published: (2025)
by: Koliou, Natalia, et al.
Published: (2025)
Reinforcement Learning-based Knowledge Distillation with LLM-as-a-Judge
by: Shen, Yiyang, et al.
Published: (2026)
by: Shen, Yiyang, et al.
Published: (2026)
Evolutionary System Prompt Learning for Reinforcement Learning in LLMs
by: Zhang, Lunjun, et al.
Published: (2026)
by: Zhang, Lunjun, et al.
Published: (2026)
CASSANDRA: Programmatic and Probabilistic Learning and Inference for Stochastic World Modeling
by: Lymperopoulos, Panagiotis, et al.
Published: (2026)
by: Lymperopoulos, Panagiotis, et al.
Published: (2026)
Examining Policy Entropy of Reinforcement Learning Agents for Personalization Tasks
by: Dereventsov, Anton, et al.
Published: (2022)
by: Dereventsov, Anton, et al.
Published: (2022)
Evolutionary Policy Optimization
by: Wang, Jianren, et al.
Published: (2025)
by: Wang, Jianren, et al.
Published: (2025)
Evolutionary Policy Optimization
by: Mustafaoglu, Zelal Su "Lain", et al.
Published: (2025)
by: Mustafaoglu, Zelal Su "Lain", et al.
Published: (2025)
SVL: Goal-Conditioned Reinforcement Learning as Survival Learning
by: Tiofack, Franki Nguimatsia, et al.
Published: (2026)
by: Tiofack, Franki Nguimatsia, et al.
Published: (2026)
Neural Co-state Policies: Structuring Hidden States in Recurrent Reinforcement Learning
by: Leeftink, David, et al.
Published: (2026)
by: Leeftink, David, et al.
Published: (2026)
Multi-Objective Neural Architecture Search by Learning Search Space Partitions
by: Zhao, Yiyang, et al.
Published: (2024)
by: Zhao, Yiyang, et al.
Published: (2024)
On-Policy Policy Gradient Reinforcement Learning Without On-Policy Sampling
by: Corrado, Nicholas E., et al.
Published: (2023)
by: Corrado, Nicholas E., et al.
Published: (2023)
LLM as an Algorithmist: Enhancing Anomaly Detectors via Programmatic Synthesis
by: Ye, Hangting, et al.
Published: (2025)
by: Ye, Hangting, et al.
Published: (2025)
Reliable Programmatic Weak Supervision with Confidence Intervals for Label Probabilities
by: Álvarez, Verónica, et al.
Published: (2025)
by: Álvarez, Verónica, et al.
Published: (2025)
Policy Improvement Reinforcement Learning
by: Wang, Huaiyang, et al.
Published: (2026)
by: Wang, Huaiyang, et al.
Published: (2026)
Survival Reinforcement Learning: Toward Scalable Self-Supervised RL
by: Nguimatsia-Tiofack, Franki, et al.
Published: (2026)
by: Nguimatsia-Tiofack, Franki, et al.
Published: (2026)
Helix: Evolutionary Reinforcement Learning for Open-Ended Scientific Problem Solving
by: Su, Chang, et al.
Published: (2026)
by: Su, Chang, et al.
Published: (2026)
Guiding Evolutionary Molecular Design: Adding Reinforcement Learning for Mutation Selection
by: Milon-Harnois, Gaelle, et al.
Published: (2025)
by: Milon-Harnois, Gaelle, et al.
Published: (2025)
Evolutionary Dynamic Optimization and Machine Learning
by: Boulesnane, Abdennour
Published: (2023)
by: Boulesnane, Abdennour
Published: (2023)
Efficient Reinforcement Finetuning via Adaptive Curriculum Learning
by: Shi, Taiwei, et al.
Published: (2025)
by: Shi, Taiwei, et al.
Published: (2025)
Interpretable Non-linear Survival Analysis with Evolutionary Symbolic Regression
by: Rovito, Luigi, et al.
Published: (2025)
by: Rovito, Luigi, et al.
Published: (2025)
Sketch-Plan-Generalize: Learning and Planning with Neuro-Symbolic Programmatic Representations for Inductive Spatial Concepts
by: Kalithasan, Namasivayam, et al.
Published: (2024)
by: Kalithasan, Namasivayam, et al.
Published: (2024)
SPELL: Synthesis of Programmatic Edits using LLMs
by: Ramos, Daniel, et al.
Published: (2026)
by: Ramos, Daniel, et al.
Published: (2026)
Single- vs. Dual-Policy Reinforcement Learning for Dynamic Bike Rebalancing
by: Liang, Jiaqi, et al.
Published: (2024)
by: Liang, Jiaqi, et al.
Published: (2024)
Policy Regularization on Globally Accessible States in Cross-Dynamics Reinforcement Learning
by: Xue, Zhenghai, et al.
Published: (2025)
by: Xue, Zhenghai, et al.
Published: (2025)
Reinforcement Learning for Flow-Matching Policies
by: Pfrommer, Samuel, et al.
Published: (2025)
by: Pfrommer, Samuel, et al.
Published: (2025)
Similar Items
-
Interpretable and Editable Programmatic Tree Policies for Reinforcement Learning
by: Kohler, Hector, et al.
Published: (2024) -
Multimodal LLM-assisted Evolutionary Search for Programmatic Control Policies
by: Hu, Qinglong, et al.
Published: (2025) -
Reclaiming the Source of Programmatic Policies: Programmatic versus Latent Spaces
by: Carvalho, Tales H., et al.
Published: (2024) -
Synthesizing Programmatic Reinforcement Learning Policies with Large Language Model Guided Search
by: Liu, Max, et al.
Published: (2024) -
Programmatic Reinforcement Learning: Navigating Gridworlds
by: Shabadi, Guruprerana, et al.
Published: (2024)