Saved in:
| Main Authors: | Pendyala, Abhijeet, Atamna, Asma, Glasmachers, Tobias |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.02577 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Curriculum RL meets Monte Carlo Planning: Optimization of a Real World Container Management Problem
by: Pendyala, Abhijeet, et al.
Published: (2025)
by: Pendyala, Abhijeet, et al.
Published: (2025)
Leveraging Genetic Algorithms for Efficient Demonstration Generation in Real-World Reinforcement Learning Environments
by: Maus, Tom, et al.
Published: (2025)
by: Maus, Tom, et al.
Published: (2025)
Balancing Specialization and Centralization: A Multi-Agent Reinforcement Learning Benchmark for Sequential Industrial Control
by: Maus, Tom, et al.
Published: (2025)
by: Maus, Tom, et al.
Published: (2025)
Cumulative Learning Rate Adaptation: Revisiting Path-Based Schedules for SGD and Adam
by: Atamna, Asma, et al.
Published: (2025)
by: Atamna, Asma, et al.
Published: (2025)
Accelerating Proximal Policy Optimization Learning Using Task Prediction for Solving Environments with Delayed Rewards
by: Ahmad, Ahmad, et al.
Published: (2024)
by: Ahmad, Ahmad, et al.
Published: (2024)
Deep Reinforcement Learning Based Navigation with Macro Actions and Topological Maps
by: Hakenes, Simon, et al.
Published: (2025)
by: Hakenes, Simon, et al.
Published: (2025)
On-Policy Optimization of ANFIS Policies Using Proximal Policy Optimization
by: Shankar, Kaaustaaub, et al.
Published: (2025)
by: Shankar, Kaaustaaub, et al.
Published: (2025)
Evolutionary Warm-Starts for Reinforcement Learning in Industrial Continuous Control
by: Maus, Tom, et al.
Published: (2026)
by: Maus, Tom, et al.
Published: (2026)
Pulsar Detection with Deep Learning
by: Pendyala, Manideep
Published: (2025)
by: Pendyala, Manideep
Published: (2025)
Trust Region Reward Optimization and Proximal Inverse Reward Optimization Algorithm
by: Chen, Yang, et al.
Published: (2025)
by: Chen, Yang, et al.
Published: (2025)
SortingEnv: An Extendable RL-Environment for an Industrial Sorting Process
by: Maus, Tom, et al.
Published: (2025)
by: Maus, Tom, et al.
Published: (2025)
Reparameterization Proximal Policy Optimization
by: Zhong, Hai, et al.
Published: (2025)
by: Zhong, Hai, et al.
Published: (2025)
Central Path Proximal Policy Optimization
by: Milosevic, Nikola, et al.
Published: (2025)
by: Milosevic, Nikola, et al.
Published: (2025)
Learning Branching Policies for MILPs with Proximal Policy Optimization
by: Mhamed, Abdelouahed Ben, et al.
Published: (2025)
by: Mhamed, Abdelouahed Ben, et al.
Published: (2025)
Assigning Credit with Partial Reward Decoupling in Multi-Agent Proximal Policy Optimization
by: Kapoor, Aditya, et al.
Published: (2024)
by: Kapoor, Aditya, et al.
Published: (2024)
Pass@K Policy Optimization: Solving Harder Reinforcement Learning Problems
by: Walder, Christian, et al.
Published: (2025)
by: Walder, Christian, et al.
Published: (2025)
Diffusion Policy through Conditional Proximal Policy Optimization
by: Liu, Ben, et al.
Published: (2026)
by: Liu, Ben, et al.
Published: (2026)
Beyond the Boundaries of Proximal Policy Optimization
by: Tan, Charlie B., et al.
Published: (2024)
by: Tan, Charlie B., et al.
Published: (2024)
Proximal Policy Optimization with Adaptive Exploration
by: Lixandru, Andrei
Published: (2024)
by: Lixandru, Andrei
Published: (2024)
Complexity-Regularized Proximal Policy Optimization
by: Serfilippi, Luca, et al.
Published: (2025)
by: Serfilippi, Luca, et al.
Published: (2025)
Transductive Off-policy Proximal Policy Optimization
by: Gan, Yaozhong, et al.
Published: (2024)
by: Gan, Yaozhong, et al.
Published: (2024)
Deep Gaussian Process Proximal Policy Optimization
by: van der Lende, Matthijs, et al.
Published: (2025)
by: van der Lende, Matthijs, et al.
Published: (2025)
Actor-Critic Pretraining for Proximal Policy Optimization
by: Kernbach, Andreas, et al.
Published: (2026)
by: Kernbach, Andreas, et al.
Published: (2026)
OID-PPO: Optimal Interior Design using Proximal Policy Optimization by Transforming Design Guidelines into Reward Functions
by: Yoon, Chanyoung, et al.
Published: (2025)
by: Yoon, Chanyoung, et al.
Published: (2025)
Learning To Solve Differential Equation Constrained Optimization Problems
by: Di Vito, Vincenzo, et al.
Published: (2024)
by: Di Vito, Vincenzo, et al.
Published: (2024)
Eval-PPO: Building an Efficient Threat Evaluator Using Proximal Policy Optimization
by: Sun, Wuzhou, et al.
Published: (2025)
by: Sun, Wuzhou, et al.
Published: (2025)
Fairness Aware Reinforcement Learning via Proximal Policy Optimization
by: La Malfa, Gabriele, et al.
Published: (2025)
by: La Malfa, Gabriele, et al.
Published: (2025)
Combined Peak Reduction and Self-Consumption Using Proximal Policy Optimization
by: Peirelinck, Thijs, et al.
Published: (2022)
by: Peirelinck, Thijs, et al.
Published: (2022)
Hindsight Experience Replay Accelerates Proximal Policy Optimization
by: Crowder, Douglas C., et al.
Published: (2024)
by: Crowder, Douglas C., et al.
Published: (2024)
Token-level Proximal Policy Optimization for Query Generation
by: Ouyang, Yichen, et al.
Published: (2024)
by: Ouyang, Yichen, et al.
Published: (2024)
Match or Replay: Self Imitating Proximal Policy Optimization
by: Chaudhary, Gaurav, et al.
Published: (2026)
by: Chaudhary, Gaurav, et al.
Published: (2026)
KIPPO: Koopman-Inspired Proximal Policy Optimization
by: Cozma, Andrei, et al.
Published: (2025)
by: Cozma, Andrei, et al.
Published: (2025)
ESPO: Early-Stopping Proximal Policy Optimization
by: Li, Zihang, et al.
Published: (2026)
by: Li, Zihang, et al.
Published: (2026)
Intrinsic Reward Policy Optimization for Sparse-Reward Environments
by: Cho, Minjae, et al.
Published: (2026)
by: Cho, Minjae, et al.
Published: (2026)
Variable Metric Evolution Strategies for High-dimensional Multi-Objective Optimization
by: Glasmachers, Tobias
Published: (2024)
by: Glasmachers, Tobias
Published: (2024)
Solving Boltzmann Optimization Problems with Deep Learning
by: Knoll, Fiona, et al.
Published: (2024)
by: Knoll, Fiona, et al.
Published: (2024)
AceGRPO: Adaptive Curriculum Enhanced Group Relative Policy Optimization for Autonomous Machine Learning Engineering
by: Cai, Yuzhu, et al.
Published: (2026)
by: Cai, Yuzhu, et al.
Published: (2026)
ERPPO: Entropy Regularization-based Proximal Policy Optimization
by: Lee, Changha, et al.
Published: (2026)
by: Lee, Changha, et al.
Published: (2026)
Proximal Ranking Policy Optimization for Practical Safety in Counterfactual Learning to Rank
by: Gupta, Shashank, et al.
Published: (2024)
by: Gupta, Shashank, et al.
Published: (2024)
Efficient Deep Reinforcement Learning with Predictive Processing Proximal Policy Optimization
by: Küçükoğlu, Burcu, et al.
Published: (2022)
by: Küçükoğlu, Burcu, et al.
Published: (2022)
Similar Items
-
Curriculum RL meets Monte Carlo Planning: Optimization of a Real World Container Management Problem
by: Pendyala, Abhijeet, et al.
Published: (2025) -
Leveraging Genetic Algorithms for Efficient Demonstration Generation in Real-World Reinforcement Learning Environments
by: Maus, Tom, et al.
Published: (2025) -
Balancing Specialization and Centralization: A Multi-Agent Reinforcement Learning Benchmark for Sequential Industrial Control
by: Maus, Tom, et al.
Published: (2025) -
Cumulative Learning Rate Adaptation: Revisiting Path-Based Schedules for SGD and Adam
by: Atamna, Asma, et al.
Published: (2025) -
Accelerating Proximal Policy Optimization Learning Using Task Prediction for Solving Environments with Delayed Rewards
by: Ahmad, Ahmad, et al.
Published: (2024)