Saved in:
| Main Authors: | Dolatyabi, Parya, Bavil, Ali Farajzadeh, Khodayar, Mahdi |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.14730 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Proximal Policy Optimization with Graph Neural Networks for Optimal Power Flow
by: López-Cardona, Ángela, et al.
Published: (2022)
by: López-Cardona, Ángela, et al.
Published: (2022)
Truncated Proximal Policy Optimization
by: Fan, Tiantian, et al.
Published: (2025)
by: Fan, Tiantian, et al.
Published: (2025)
Assigning Credit with Partial Reward Decoupling in Multi-Agent Proximal Policy Optimization
by: Kapoor, Aditya, et al.
Published: (2024)
by: Kapoor, Aditya, et al.
Published: (2024)
Decentralized Distributed Proximal Policy Optimization (DD-PPO) for High Performance Computing Scheduling on Multi-User Systems
by: Sgambati, Matthew, et al.
Published: (2025)
by: Sgambati, Matthew, et al.
Published: (2025)
On-Policy Optimization of ANFIS Policies Using Proximal Policy Optimization
by: Shankar, Kaaustaaub, et al.
Published: (2025)
by: Shankar, Kaaustaaub, et al.
Published: (2025)
Reparameterization Proximal Policy Optimization
by: Zhong, Hai, et al.
Published: (2025)
by: Zhong, Hai, et al.
Published: (2025)
Complexity-Regularized Proximal Policy Optimization
by: Serfilippi, Luca, et al.
Published: (2025)
by: Serfilippi, Luca, et al.
Published: (2025)
Beyond the Boundaries of Proximal Policy Optimization
by: Tan, Charlie B., et al.
Published: (2024)
by: Tan, Charlie B., et al.
Published: (2024)
Proximal Policy Optimization with Adaptive Exploration
by: Lixandru, Andrei
Published: (2024)
by: Lixandru, Andrei
Published: (2024)
Metric-Gradient Projection for Stable Multi-Agent Policy Learning
by: Zhang, Zuyuan, et al.
Published: (2026)
by: Zhang, Zuyuan, et al.
Published: (2026)
Sample-efficient Neuro-symbolic Proximal Policy Optimization
by: Murari, Simone, et al.
Published: (2026)
by: Murari, Simone, et al.
Published: (2026)
FedCritic: Serverless Federated Critic Learning-based Resource Allocation for Multi-Cell OFDMA in 6G
by: Farajzadeh, Amin, et al.
Published: (2026)
by: Farajzadeh, Amin, et al.
Published: (2026)
KIPPO: Koopman-Inspired Proximal Policy Optimization
by: Cozma, Andrei, et al.
Published: (2025)
by: Cozma, Andrei, et al.
Published: (2025)
ESPO: Early-Stopping Proximal Policy Optimization
by: Li, Zihang, et al.
Published: (2026)
by: Li, Zihang, et al.
Published: (2026)
Learning Branching Policies for MILPs with Proximal Policy Optimization
by: Mhamed, Abdelouahed Ben, et al.
Published: (2025)
by: Mhamed, Abdelouahed Ben, et al.
Published: (2025)
Proximity-Based Multi-Turn Optimization: Practical Credit Assignment for LLM Agent Training
by: Fang, Yangyi, et al.
Published: (2026)
by: Fang, Yangyi, et al.
Published: (2026)
A Deep Reinforcement Learning Approach to Battery Management in Dairy Farming via Proximal Policy Optimization
by: Ali, Nawazish, et al.
Published: (2024)
by: Ali, Nawazish, et al.
Published: (2024)
Multi-Agent Guided Policy Optimization
by: Li, Yueheng, et al.
Published: (2025)
by: Li, Yueheng, et al.
Published: (2025)
Intelligent Collaborative Optimization for Rubber Tyre Film Production Based on Multi-path Differentiated Clipping Proximal Policy Optimization
by: Ruan, Yinghao, et al.
Published: (2025)
by: Ruan, Yinghao, et al.
Published: (2025)
HMACE: Heterogeneous Multi-Agent Collaborative Evolution for Combinatorial Optimization
by: Yan, Yuping, et al.
Published: (2026)
by: Yan, Yuping, et al.
Published: (2026)
Counterfactual Credit Policy Optimization for Multi-Agent Collaboration
by: Li, Zhongyi, et al.
Published: (2026)
by: Li, Zhongyi, et al.
Published: (2026)
MASPRM: Multi-Agent System Process Reward Model
by: Yazdani, Milad, et al.
Published: (2025)
by: Yazdani, Milad, et al.
Published: (2025)
Agent-GSPO: Communication-Efficient Multi-Agent Systems via Group Sequence Policy Optimization
by: Fan, Yijia, et al.
Published: (2025)
by: Fan, Yijia, et al.
Published: (2025)
Proximal Policy Distillation
by: Spigler, Giacomo
Published: (2024)
by: Spigler, Giacomo
Published: (2024)
Multi-Agent Reinforcement Learning for Heterogeneous Satellite Cluster Resources Optimization
by: Hady, Mohamad A., et al.
Published: (2025)
by: Hady, Mohamad A., et al.
Published: (2025)
HALO: Learning Human-Robot Collaboration via Heterogeneous-Agent Lyapunov Policy Optimization
by: Zhang, Hao, et al.
Published: (2026)
by: Zhang, Hao, et al.
Published: (2026)
Atomic Proximal Policy Optimization for Electric Robo-Taxi Dispatch and Charger Allocation
by: Dai, Jim, et al.
Published: (2025)
by: Dai, Jim, et al.
Published: (2025)
Proximal Policy Optimization with Evolutionary Mutations
by: Czworkowski, Casimir, et al.
Published: (2026)
by: Czworkowski, Casimir, et al.
Published: (2026)
Multi-Agent Image Restoration
by: Jiang, Xu, et al.
Published: (2025)
by: Jiang, Xu, et al.
Published: (2025)
Asymmetric Proximal Policy Optimization: mini-critics boost LLM reasoning
by: Liu, Jiashun, et al.
Published: (2025)
by: Liu, Jiashun, et al.
Published: (2025)
CIM-PPO:Proximal Policy Optimization with Liu-Correntropy Induced Metric
by: Guo, Yunxiao, et al.
Published: (2021)
by: Guo, Yunxiao, et al.
Published: (2021)
A dynamical clipping approach with task feedback for Proximal Policy Optimization
by: Zhang, Ziqi, et al.
Published: (2023)
by: Zhang, Ziqi, et al.
Published: (2023)
Efficient Deep Reinforcement Learning with Predictive Processing Proximal Policy Optimization
by: Küçükoğlu, Burcu, et al.
Published: (2022)
by: Küçükoğlu, Burcu, et al.
Published: (2022)
Iterative Critique-and-Routing Controller for Multi-Agent Systems with Heterogeneous LLMs
by: Fang, Wenzhi, et al.
Published: (2026)
by: Fang, Wenzhi, et al.
Published: (2026)
Offline Safe Policy Optimization From Heterogeneous Feedback
by: Gong, Ze, et al.
Published: (2025)
by: Gong, Ze, et al.
Published: (2025)
Intrinsic Memory Agents: Heterogeneous Multi-Agent LLM Systems through Structured Contextual Memory
by: Yuen, Sizhe, et al.
Published: (2025)
by: Yuen, Sizhe, et al.
Published: (2025)
Imitation Learning via Focused Satisficing
by: Shah, Rushit N., et al.
Published: (2025)
by: Shah, Rushit N., et al.
Published: (2025)
Federated Learning in NTNs: Design, Architecture and Challenges
by: Farajzadeh, Amin, et al.
Published: (2025)
by: Farajzadeh, Amin, et al.
Published: (2025)
Optimizing UAV Aerial Base Station Flights Using DRL-based Proximal Policy Optimization
by: Ibanez, Mario Rico, et al.
Published: (2025)
by: Ibanez, Mario Rico, et al.
Published: (2025)
ExO-PPO: an Extended Off-policy Proximal Policy Optimization Algorithm
by: Wang, Hanyong, et al.
Published: (2026)
by: Wang, Hanyong, et al.
Published: (2026)
Similar Items
-
Proximal Policy Optimization with Graph Neural Networks for Optimal Power Flow
by: López-Cardona, Ángela, et al.
Published: (2022) -
Truncated Proximal Policy Optimization
by: Fan, Tiantian, et al.
Published: (2025) -
Assigning Credit with Partial Reward Decoupling in Multi-Agent Proximal Policy Optimization
by: Kapoor, Aditya, et al.
Published: (2024) -
Decentralized Distributed Proximal Policy Optimization (DD-PPO) for High Performance Computing Scheduling on Multi-User Systems
by: Sgambati, Matthew, et al.
Published: (2025) -
On-Policy Optimization of ANFIS Policies Using Proximal Policy Optimization
by: Shankar, Kaaustaaub, et al.
Published: (2025)