Saved in:
| Main Authors: | Peng, Xueqiao, Perrault, Andrew |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.19397 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Normality-Guided Distributional Reinforcement Learning for Continuous Control
by: Byun, Ju-Seung, et al.
Published: (2022)
by: Byun, Ju-Seung, et al.
Published: (2022)
Optimizing Urban Service Allocation with Time-Constrained Restless Bandits
by: Mao, Yi, et al.
Published: (2025)
by: Mao, Yi, et al.
Published: (2025)
Symmetric Reinforcement Learning Loss for Robust Learning on Diverse Tasks and Model Scales
by: Byun, Ju-Seung, et al.
Published: (2024)
by: Byun, Ju-Seung, et al.
Published: (2024)
The Distributional Reward Critic Framework for Reinforcement Learning Under Perturbed Rewards
by: Chen, Xi, et al.
Published: (2024)
by: Chen, Xi, et al.
Published: (2024)
C2-DPO: Constrained Controlled Direct Preference Optimization
by: Asadi, Kavosh, et al.
Published: (2025)
by: Asadi, Kavosh, et al.
Published: (2025)
ARES: Alternating Reinforcement Learning and Supervised Fine-Tuning for Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback
by: Byun, Ju-Seung, et al.
Published: (2024)
by: Byun, Ju-Seung, et al.
Published: (2024)
DLPO: Diffusion Model Loss-Guided Reinforcement Learning for Fine-Tuning Text-to-Speech Diffusion Models
by: Chen, Jingyi, et al.
Published: (2024)
by: Chen, Jingyi, et al.
Published: (2024)
Recovering Physical Dynamics from Discrete Observations via Intrinsic Differential Consistency
by: Luo, Yuxiang, et al.
Published: (2026)
by: Luo, Yuxiang, et al.
Published: (2026)
Hierarchical Reinforcement Learning with Targeted Causal Interventions
by: Khorasani, Sadegh, et al.
Published: (2025)
by: Khorasani, Sadegh, et al.
Published: (2025)
Large Scale Constrained Clustering With Reinforcement Learning
by: Schesch, Benedikt, et al.
Published: (2024)
by: Schesch, Benedikt, et al.
Published: (2024)
Leaving the Nest: Going Beyond Local Loss Functions for Predict-Then-Optimize
by: Shah, Sanket, et al.
Published: (2023)
by: Shah, Sanket, et al.
Published: (2023)
Multi-Agent Reinforcement Learning for Adaptive Resource Orchestration in Cloud-Native Clusters
by: Yao, Guanzi, et al.
Published: (2025)
by: Yao, Guanzi, et al.
Published: (2025)
Memory Allocation in Resource-Constrained Reinforcement Learning
by: Tamborski, Massimiliano, et al.
Published: (2025)
by: Tamborski, Massimiliano, et al.
Published: (2025)
SIR-RL: Reinforcement Learning for Optimized Policy Control during Epidemiological Outbreaks in Emerging Market and Developing Economies
by: Jain, Maeghal, et al.
Published: (2024)
by: Jain, Maeghal, et al.
Published: (2024)
Safety Constrained Multi-Agent Reinforcement Learning for Active Voltage Control
by: Qu, Yang, et al.
Published: (2024)
by: Qu, Yang, et al.
Published: (2024)
Decentralized Reinforcement Learning for Multi-Agent Multi-Resource Allocation via Dynamic Cluster Agreements
by: Marino, Antonio, et al.
Published: (2025)
by: Marino, Antonio, et al.
Published: (2025)
Handling Long and Richly Constrained Tasks through Constrained Hierarchical Reinforcement Learning
by: Lu, Yuxiao, et al.
Published: (2023)
by: Lu, Yuxiao, et al.
Published: (2023)
Sequential Stochastic Combinatorial Optimization Using Hierarchal Reinforcement Learning
by: Feng, Xinsong, et al.
Published: (2025)
by: Feng, Xinsong, et al.
Published: (2025)
Classical and Deep Reinforcement Learning Inventory Control Policies for Pharmaceutical Supply Chains with Perishability and Non-Stationarity
by: Stranieri, Francesco, et al.
Published: (2025)
by: Stranieri, Francesco, et al.
Published: (2025)
Secure Resource Allocation via Constrained Deep Reinforcement Learning
by: Sun, Jianfei, et al.
Published: (2025)
by: Sun, Jianfei, et al.
Published: (2025)
Sharper Perturbed-Kullback-Leibler Exponential Tail Bounds for Beta and Dirichlet Distributions
by: Perrault, Pierre
Published: (2025)
by: Perrault, Pierre
Published: (2025)
Ready from Day 1: Population-Aware Coordination for Large-Scale Constrained Multi-Agent Systems
by: Wang, Angel, et al.
Published: (2026)
by: Wang, Angel, et al.
Published: (2026)
HCPO: Hierarchical Conductor-Based Policy Optimization in Multi-Agent Reinforcement Learning
by: Liu, Zejiao, et al.
Published: (2025)
by: Liu, Zejiao, et al.
Published: (2025)
Optimizing 2D+1 Packing in Constrained Environments Using Deep Reinforcement Learning
by: Pugliese, Victor Ulisses, et al.
Published: (2025)
by: Pugliese, Victor Ulisses, et al.
Published: (2025)
Goal Reaching with Eikonal-Constrained Hierarchical Quasimetric Reinforcement Learning
by: Giammarino, Vittorio, et al.
Published: (2025)
by: Giammarino, Vittorio, et al.
Published: (2025)
Optimizing Electric Bus Charging Scheduling with Uncertainties Using Hierarchical Deep Reinforcement Learning
by: Qi, Jiaju, et al.
Published: (2025)
by: Qi, Jiaju, et al.
Published: (2025)
Hierarchical Policy-Gradient Reinforcement Learning for Multi-Agent Shepherding Control of Non-Cohesive Targets
by: Covone, Stefano, et al.
Published: (2025)
by: Covone, Stefano, et al.
Published: (2025)
Predictive Lagrangian Optimization for Constrained Reinforcement Learning
by: Zhang, Tianqi, et al.
Published: (2025)
by: Zhang, Tianqi, et al.
Published: (2025)
Constrained Optimization of Charged Particle Tracking with Multi-Agent Reinforcement Learning
by: Kortus, Tobias, et al.
Published: (2025)
by: Kortus, Tobias, et al.
Published: (2025)
Dual-Mandate Patrols: Multi-Armed Bandits for Green Security
by: Xu, Lily, et al.
Published: (2020)
by: Xu, Lily, et al.
Published: (2020)
Cultivating Archipelago of Forests: Evolving Robust Decision Trees through Island Coevolution
by: Żychowski, Adam, et al.
Published: (2024)
by: Żychowski, Adam, et al.
Published: (2024)
MEL: Multi-level Ensemble Learning for Resource-Constrained Environments
by: Gudipaty, Krishna Praneet, et al.
Published: (2025)
by: Gudipaty, Krishna Praneet, et al.
Published: (2025)
Risk-Averse Constrained Reinforcement Learning with Optimized Certainty Equivalents
by: Lee, Jane H., et al.
Published: (2025)
by: Lee, Jane H., et al.
Published: (2025)
SLA-MORL: SLA-Aware Multi-Objective Reinforcement Learning for HPC Resource Optimization
by: Mostafa, Seraj Al Mahmud, et al.
Published: (2025)
by: Mostafa, Seraj Al Mahmud, et al.
Published: (2025)
Controlling Underestimation Bias in Constrained Reinforcement Learning for Safe Exploration
by: Gao, Shiqing, et al.
Published: (2026)
by: Gao, Shiqing, et al.
Published: (2026)
A Constrained Multi-Agent Reinforcement Learning Approach to Autonomous Traffic Signal Control
by: Satheesh, Anirudh, et al.
Published: (2025)
by: Satheesh, Anirudh, et al.
Published: (2025)
Cluster-Specific Predictive Modeling: A Scalable Solution for Resource-Constrained Wi-Fi Controllers
by: Fontanesi, Gianluca, et al.
Published: (2026)
by: Fontanesi, Gianluca, et al.
Published: (2026)
Constrained Multi-Objective Reinforcement Learning with Max-Min Criterion
by: Park, Giseung, et al.
Published: (2026)
by: Park, Giseung, et al.
Published: (2026)
Experience Constrained Hierarchical Federated Reinforcement Learning for Large-scale UAV Teams in Hazardous Environments
by: Huang, Qinwei, et al.
Published: (2026)
by: Huang, Qinwei, et al.
Published: (2026)
Federated Hierarchical Reinforcement Learning for Adaptive Traffic Signal Control
by: Fu, Yongjie, et al.
Published: (2025)
by: Fu, Yongjie, et al.
Published: (2025)
Similar Items
-
Normality-Guided Distributional Reinforcement Learning for Continuous Control
by: Byun, Ju-Seung, et al.
Published: (2022) -
Optimizing Urban Service Allocation with Time-Constrained Restless Bandits
by: Mao, Yi, et al.
Published: (2025) -
Symmetric Reinforcement Learning Loss for Robust Learning on Diverse Tasks and Model Scales
by: Byun, Ju-Seung, et al.
Published: (2024) -
The Distributional Reward Critic Framework for Reinforcement Learning Under Perturbed Rewards
by: Chen, Xi, et al.
Published: (2024) -
C2-DPO: Constrained Controlled Direct Preference Optimization
by: Asadi, Kavosh, et al.
Published: (2025)