Saved in:
| Main Authors: | Khattar, Vanshaj, Jin, Ming |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.15368 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Offline Reinforcement Learning via Inverse Optimization
by: Dimanidis, Ioannis, et al.
Published: (2025)
by: Dimanidis, Ioannis, et al.
Published: (2025)
Offline Hierarchical Reinforcement Learning via Inverse Optimization
by: Schmidt, Carolin, et al.
Published: (2024)
by: Schmidt, Carolin, et al.
Published: (2024)
Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning
by: Zhang, Ruoqi, et al.
Published: (2024)
by: Zhang, Ruoqi, et al.
Published: (2024)
A CMDP-within-online framework for Meta-Safe Reinforcement Learning
by: Khattar, Vanshaj, et al.
Published: (2024)
by: Khattar, Vanshaj, et al.
Published: (2024)
Parameter Stress Analysis in Reinforcement Learning: Applying Synaptic Filtering to Policy Networks
by: Abdeen, Zain ul, et al.
Published: (2025)
by: Abdeen, Zain ul, et al.
Published: (2025)
Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies
by: Yan, Runze, et al.
Published: (2025)
by: Yan, Runze, et al.
Published: (2025)
Operator Models for Continuous-Time Offline Reinforcement Learning
by: Hoischen, Nicolas, et al.
Published: (2025)
by: Hoischen, Nicolas, et al.
Published: (2025)
Solving Offline Reinforcement Learning with Decision Tree Regression
by: Koirala, Prajwal, et al.
Published: (2024)
by: Koirala, Prajwal, et al.
Published: (2024)
Data Center Cooling System Optimization Using Offline Reinforcement Learning
by: Zhan, Xianyuan, et al.
Published: (2025)
by: Zhan, Xianyuan, et al.
Published: (2025)
Quasi-Newton Compatible Actor-Critic for Deterministic Policies
by: Kordabad, Arash Bahari, et al.
Published: (2025)
by: Kordabad, Arash Bahari, et al.
Published: (2025)
Robust Bandwidth Estimation for Real-Time Communication with Offline Reinforcement Learning
by: Kai, Jian, et al.
Published: (2025)
by: Kai, Jian, et al.
Published: (2025)
Certifying Stability of Reinforcement Learning Policies using Generalized Lyapunov Functions
by: Long, Kehan, et al.
Published: (2025)
by: Long, Kehan, et al.
Published: (2025)
LexiSafe: Offline Safe Reinforcement Learning with Lexicographic Safety-Reward Hierarchy
by: Yang, Hsin-Jung, et al.
Published: (2026)
by: Yang, Hsin-Jung, et al.
Published: (2026)
Deterministic Trajectory Optimization through Probabilistic Optimal Control
by: Filabadi, Mohammad Mahmoudi, et al.
Published: (2024)
by: Filabadi, Mohammad Mahmoudi, et al.
Published: (2024)
Offline Reinforcement Learning and Sequence Modeling for Downlink Link Adaptation
by: Peri, Samuele, et al.
Published: (2024)
by: Peri, Samuele, et al.
Published: (2024)
Offline Reinforcement-Learning-Based Power Control for Application-Agnostic Energy Efficiency
by: Raj, Akhilesh, et al.
Published: (2026)
by: Raj, Akhilesh, et al.
Published: (2026)
Feasible Policy Iteration for Safe Reinforcement Learning
by: Yang, Yujie, et al.
Published: (2023)
by: Yang, Yujie, et al.
Published: (2023)
Safety Optimized Reinforcement Learning via Multi-Objective Policy Optimization
by: Honari, Homayoun, et al.
Published: (2024)
by: Honari, Homayoun, et al.
Published: (2024)
Concurrent Learning of Policy and Unknown Safety Constraints in Reinforcement Learning
by: Yifru, Lunet, et al.
Published: (2024)
by: Yifru, Lunet, et al.
Published: (2024)
Finite-Time Analysis of On-Policy Heterogeneous Federated Reinforcement Learning
by: Zhang, Chenyu, et al.
Published: (2024)
by: Zhang, Chenyu, et al.
Published: (2024)
HOFLON: Hybrid Offline Learning and Online Optimization for Process Start-Up and Grade-Transition Control
by: Durkin, Alex, et al.
Published: (2025)
by: Durkin, Alex, et al.
Published: (2025)
On Robust Reinforcement Learning with Lipschitz-Bounded Policy Networks
by: Barbara, Nicholas H., et al.
Published: (2024)
by: Barbara, Nicholas H., et al.
Published: (2024)
Nonuniqueness and Convergence to Equivalent Solutions in Observer-based Inverse Reinforcement Learning
by: Town, Jared, et al.
Published: (2022)
by: Town, Jared, et al.
Published: (2022)
Proximal Reliability Optimization for Reinforcement Learning
by: Patwardhan, Narendra, et al.
Published: (2019)
by: Patwardhan, Narendra, et al.
Published: (2019)
Joint Optimization of Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm
by: Bai, Qinbo, et al.
Published: (2021)
by: Bai, Qinbo, et al.
Published: (2021)
Safe Deployment of Offline Reinforcement Learning via Input Convex Action Correction
by: Durkin, Alex, et al.
Published: (2025)
by: Durkin, Alex, et al.
Published: (2025)
Off Policy Lyapunov Stability in Reinforcement Learning
by: Gill, Sarvan, et al.
Published: (2025)
by: Gill, Sarvan, et al.
Published: (2025)
Reinforced Model Predictive Control via Trust-Region Quasi-Newton Policy Optimization
by: Brandner, Dean, et al.
Published: (2024)
by: Brandner, Dean, et al.
Published: (2024)
Online Residual Learning from Offline Experts for Pedestrian Tracking
by: Vlachos, Anastasios, et al.
Published: (2024)
by: Vlachos, Anastasios, et al.
Published: (2024)
Solving Reach-Avoid-Stay Problems Using Deep Deterministic Policy Gradients
by: Chenevert, Gabriel, et al.
Published: (2024)
by: Chenevert, Gabriel, et al.
Published: (2024)
Stable Inverse Reinforcement Learning: Policies from Control Lyapunov Landscapes
by: Tesfazgi, Samuel, et al.
Published: (2024)
by: Tesfazgi, Samuel, et al.
Published: (2024)
Predictive Lagrangian Optimization for Constrained Reinforcement Learning
by: Zhang, Tianqi, et al.
Published: (2025)
by: Zhang, Tianqi, et al.
Published: (2025)
Relative Entropy Regularized Reinforcement Learning for Efficient Encrypted Policy Synthesis
by: Suh, Jihoon, et al.
Published: (2025)
by: Suh, Jihoon, et al.
Published: (2025)
Power Allocation for Delay Optimization in Device-to-Device Networks: A Graph Reinforcement Learning Approach
by: Fang, Hao, et al.
Published: (2025)
by: Fang, Hao, et al.
Published: (2025)
Decomposing Control Lyapunov Functions for Efficient Reinforcement Learning
by: Lopez, Antonio, et al.
Published: (2024)
by: Lopez, Antonio, et al.
Published: (2024)
Settling the Sample Complexity of Model-Based Offline Reinforcement Learning
by: Li, Gen, et al.
Published: (2022)
by: Li, Gen, et al.
Published: (2022)
MAD: A Magnitude And Direction Policy Parametrization for Stability Constrained Reinforcement Learning
by: Furieri, Luca, et al.
Published: (2025)
by: Furieri, Luca, et al.
Published: (2025)
Structured Reinforcement Learning for Incentivized Stochastic Covert Optimization
by: Jain, Adit, et al.
Published: (2024)
by: Jain, Adit, et al.
Published: (2024)
Game-Theory-Assisted Reinforcement Learning for Border Defense: Early Termination based on Analytical Solutions
by: Das, Goutam, et al.
Published: (2026)
by: Das, Goutam, et al.
Published: (2026)
Investigating the Impact of Observation Space Design Choices On Training Reinforcement Learning Solutions for Spacecraft Problems
by: Hamilton, Nathaniel, et al.
Published: (2025)
by: Hamilton, Nathaniel, et al.
Published: (2025)
Similar Items
-
Offline Reinforcement Learning via Inverse Optimization
by: Dimanidis, Ioannis, et al.
Published: (2025) -
Offline Hierarchical Reinforcement Learning via Inverse Optimization
by: Schmidt, Carolin, et al.
Published: (2024) -
Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning
by: Zhang, Ruoqi, et al.
Published: (2024) -
A CMDP-within-online framework for Meta-Safe Reinforcement Learning
by: Khattar, Vanshaj, et al.
Published: (2024) -
Parameter Stress Analysis in Reinforcement Learning: Applying Synaptic Filtering to Policy Networks
by: Abdeen, Zain ul, et al.
Published: (2025)