Saved in:
| Main Authors: | Su, Zelal, Mustafaoglu, Lee, Sungyoung, Balachandar, Eshan, Miikkulainen, Risto, Pingali, Keshav |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.12596 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Evolutionary Policy Optimization
by: Mustafaoglu, Zelal Su "Lain", et al.
Published: (2025)
by: Mustafaoglu, Zelal Su "Lain", et al.
Published: (2025)
Towards Efficient and Expressive Offline RL via Flow-Anchored Noise-conditioned Q-Learning
by: Lee, Sungyoung, et al.
Published: (2026)
by: Lee, Sungyoung, et al.
Published: (2026)
Optimizing the Design of an Artificial Pancreas to Improve Diabetes Management
by: Khanna, Ashok, et al.
Published: (2024)
by: Khanna, Ashok, et al.
Published: (2024)
Quantized Evolution Strategies: High-precision Fine-tuning of Quantized LLMs at Low-precision Cost
by: Xu, Yinggan, et al.
Published: (2026)
by: Xu, Yinggan, et al.
Published: (2026)
Flashlight: PyTorch Compiler Extensions to Accelerate Attention Variants
by: You, Bozhi, et al.
Published: (2025)
by: You, Bozhi, et al.
Published: (2025)
The Depth Delusion: Why Transformers Should Be Wider, Not Deeper
by: Fahim, Md Muhtasim Munif, et al.
Published: (2026)
by: Fahim, Md Muhtasim Munif, et al.
Published: (2026)
The Odyssey of the Fittest: Can Agents Survive and Still Be Good?
by: Waldner, Dylan, et al.
Published: (2025)
by: Waldner, Dylan, et al.
Published: (2025)
EVOTER: Evolution of Transparent Explainable Rule-sets
by: Shahrzad, Hormoz, et al.
Published: (2022)
by: Shahrzad, Hormoz, et al.
Published: (2022)
Overcoming Forgetting in LLM Fine-Tuning with Evolution Strategies
by: Schweighofer, Kajetan, et al.
Published: (2026)
by: Schweighofer, Kajetan, et al.
Published: (2026)
Efficient Pre-Training of LLMs through Truncated SVD Layers
by: Kamali, Kaivan, et al.
Published: (2026)
by: Kamali, Kaivan, et al.
Published: (2026)
Discovering Effective Policies for Land-Use Planning with Neuroevolution
by: Young, Daniel, et al.
Published: (2023)
by: Young, Daniel, et al.
Published: (2023)
NeuroBack: Improving CDCL SAT Solving using Graph Neural Networks
by: Wang, Wenxi, et al.
Published: (2021)
by: Wang, Wenxi, et al.
Published: (2021)
The Blessing of Dimensionality in LLM Fine-tuning: A Variance-Curvature Perspective
by: Liang, Qiyao, et al.
Published: (2026)
by: Liang, Qiyao, et al.
Published: (2026)
HAEPO: History-Aggregated Exploratory Policy Optimization
by: Trivedi, Gaurish, et al.
Published: (2025)
by: Trivedi, Gaurish, et al.
Published: (2025)
Fast Think-on-Graph: Wider, Deeper and Faster Reasoning of Large Language Model on Knowledge Graph
by: Liang, Xujian, et al.
Published: (2025)
by: Liang, Xujian, et al.
Published: (2025)
Evolutionary Policy Optimization
by: Wang, Jianren, et al.
Published: (2025)
by: Wang, Jianren, et al.
Published: (2025)
SplAgger: Split Aggregation for Meta-Reinforcement Learning
by: Beck, Jacob, et al.
Published: (2024)
by: Beck, Jacob, et al.
Published: (2024)
Trust-Region Adaptive Policy Optimization
by: Su, Mingyu, et al.
Published: (2025)
by: Su, Mingyu, et al.
Published: (2025)
Semantic Density: Uncertainty Quantification for Large Language Models through Confidence Measurement in Semantic Space
by: Qiu, Xin, et al.
Published: (2024)
by: Qiu, Xin, et al.
Published: (2024)
On-Policy Optimization of ANFIS Policies Using Proximal Policy Optimization
by: Shankar, Kaaustaaub, et al.
Published: (2025)
by: Shankar, Kaaustaaub, et al.
Published: (2025)
Neural Cellular Automata for ARC-AGI
by: Xu, Kevin, et al.
Published: (2025)
by: Xu, Kevin, et al.
Published: (2025)
Orthogonalized Policy Optimization:Policy Optimization as Orthogonal Projection in Hilbert Space
by: Zixian, Wang
Published: (2026)
by: Zixian, Wang
Published: (2026)
Wasserstein Policy Optimization
by: Pfau, David, et al.
Published: (2025)
by: Pfau, David, et al.
Published: (2025)
Reflective Policy Optimization
by: Gan, Yaozhong, et al.
Published: (2024)
by: Gan, Yaozhong, et al.
Published: (2024)
Conformal Constrained Policy Optimization for Cost-Effective LLM Agents
by: Si, Wenwen, et al.
Published: (2025)
by: Si, Wenwen, et al.
Published: (2025)
Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning
by: Qiu, Xin, et al.
Published: (2025)
by: Qiu, Xin, et al.
Published: (2025)
Group Orthogonalized Policy Optimization:Group Policy Optimization as Orthogonal Projection in Hilbert Space
by: Zixian, Wang
Published: (2026)
by: Zixian, Wang
Published: (2026)
Optimizing Anytime Reasoning via Budget Relative Policy Optimization
by: Qi, Penghui, et al.
Published: (2025)
by: Qi, Penghui, et al.
Published: (2025)
Score-Based One-step MeanFlow Policy Optimization
by: Kim, Kyungyoon, et al.
Published: (2026)
by: Kim, Kyungyoon, et al.
Published: (2026)
Attractor Geometry of Transformer Memory: From Conflict Arbitration to Confident Hallucination
by: Liang, Qiyao, et al.
Published: (2026)
by: Liang, Qiyao, et al.
Published: (2026)
KANITE: Kolmogorov-Arnold Networks for ITE estimation
by: Mehendale, Eshan, et al.
Published: (2025)
by: Mehendale, Eshan, et al.
Published: (2025)
Soft Sequence Policy Optimization
by: Glazyrina, Svetlana, et al.
Published: (2026)
by: Glazyrina, Svetlana, et al.
Published: (2026)
Reparameterization Flow Policy Optimization
by: Zhong, Hai, et al.
Published: (2026)
by: Zhong, Hai, et al.
Published: (2026)
Single-stream Policy Optimization
by: Xu, Zhongwen, et al.
Published: (2025)
by: Xu, Zhongwen, et al.
Published: (2025)
Reparameterization Proximal Policy Optimization
by: Zhong, Hai, et al.
Published: (2025)
by: Zhong, Hai, et al.
Published: (2025)
Variational Delayed Policy Optimization
by: Wu, Qingyuan, et al.
Published: (2024)
by: Wu, Qingyuan, et al.
Published: (2024)
Divergence-Augmented Policy Optimization
by: Wang, Qing, et al.
Published: (2025)
by: Wang, Qing, et al.
Published: (2025)
Decision Flow Policy Optimization
by: Hu, Jifeng, et al.
Published: (2025)
by: Hu, Jifeng, et al.
Published: (2025)
Fractal Landscapes in Policy Optimization
by: Wang, Tao, et al.
Published: (2023)
by: Wang, Tao, et al.
Published: (2023)
ResNets Are Deeper Than You Think
by: Mehmeti-Göpel, Christian H. X. Ali, et al.
Published: (2025)
by: Mehmeti-Göpel, Christian H. X. Ali, et al.
Published: (2025)
Similar Items
-
Evolutionary Policy Optimization
by: Mustafaoglu, Zelal Su "Lain", et al.
Published: (2025) -
Towards Efficient and Expressive Offline RL via Flow-Anchored Noise-conditioned Q-Learning
by: Lee, Sungyoung, et al.
Published: (2026) -
Optimizing the Design of an Artificial Pancreas to Improve Diabetes Management
by: Khanna, Ashok, et al.
Published: (2024) -
Quantized Evolution Strategies: High-precision Fine-tuning of Quantized LLMs at Low-precision Cost
by: Xu, Yinggan, et al.
Published: (2026) -
Flashlight: PyTorch Compiler Extensions to Accelerate Attention Variants
by: You, Bozhi, et al.
Published: (2025)