:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhang, Hengrui, Lin, Youfang, Han, Sheng, Wang, Shuo, Lv, Kai
Format:	Preprint
Published:	2022
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2201.07286
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent Reinforcement Learning
by: Yu, Xiaoyang, et al.
Published: (2023)

Improving Global Parameter-sharing in Physically Heterogeneous Multi-agent Reinforcement Learning with Unified Action Space
by: Yu, Xiaoyang, et al.
Published: (2024)

Probabilistic Constraint for Safety-Critical Reinforcement Learning
by: Chen, Weiqin, et al.
Published: (2023)

Time-Series Contrastive Learning against False Negatives and Class Imbalance
by: Jin, Xiyuan, et al.
Published: (2023)

Safe Reinforcement Learning with Learned Non-Markovian Safety Constraints
by: Low, Siow Meng, et al.
Published: (2024)

A Diffusion-Based Method for Learning the Multi-Outcome Distribution of Medical Treatments
by: Ma, Yuchen, et al.
Published: (2025)

Handling Distribution Shifts on Graphs: An Invariance Perspective
by: Wu, Qitian, et al.
Published: (2022)

Reinforcement Learning via Conservative Agent for Environments with Random Delays
by: Lee, Jongsoo, et al.
Published: (2025)

Diffusion Models for Offline Multi-agent Reinforcement Learning with Safety Constraints
by: Huang, Jianuo
Published: (2024)

AlphaAlign: Incentivizing Safety Alignment with Extremely Simplified Reinforcement Learning
by: Zhang, Yi, et al.
Published: (2025)

CombAlign: Enhancing Model Expressiveness in Unsupervised Graph Alignment
by: Chen, Songyang, et al.
Published: (2024)

Policy Constraint by Only Support Constraint for Offline Reinforcement Learning
by: Gao, Yunkai, et al.
Published: (2025)

Automatic Constraint Policy Optimization based on Continuous Constraint Interpolation Framework for Offline Reinforcement Learning
by: Han, Xinchen, et al.
Published: (2026)

PIQL: Projective Implicit Q-Learning with Support Constraint for Offline Reinforcement Learning
by: Han, Xinchen, et al.
Published: (2025)

Guardian: Decoupling Exploration from Safety in Reinforcement Learning
by: Cai, Kaitong, et al.
Published: (2025)

Mildly Conservative Q-Learning for Offline Reinforcement Learning
by: Lyu, Jiafei, et al.
Published: (2022)

SGFormer: Single-Layer Graph Transformers with Approximation-Free Linear Complexity
by: Wu, Qitian, et al.
Published: (2024)

Safety Modulation: Enhancing Safety in Reinforcement Learning through Cost-Modulated Rewards
by: Zhang, Hanping, et al.
Published: (2025)

Learning Safety Constraints for Large Language Models
by: Chen, Xin, et al.
Published: (2025)

Safe Offline Reinforcement Learning with Real-Time Budget Constraints
by: Lin, Qian, et al.
Published: (2023)

Towards Effective and Efficient Graph Alignment without Supervision
by: Chen, Songyang, et al.
Published: (2026)

Reinforcement Learning from Human Feedback with High-Confidence Safety Constraints
by: Chittepu, Yaswanth, et al.
Published: (2025)

Spatial-Temporal Cross-View Contrastive Pre-training for Check-in Sequence Representation Learning
by: Gong, Letian, et al.
Published: (2024)

Mildly Conservative Regularized Evaluation for Offline Reinforcement Learning
by: Chen, Haohui, et al.
Published: (2025)

R^3: Replay, Reflection, and Ranking Rewards for LLM Reinforcement Learning
by: Jiang, Zhizheng, et al.
Published: (2026)

Inverse Reinforcement Learning With Constraint Recovery
by: Das, Nirjhar, et al.
Published: (2023)

Kernel-Based Distributed Q-Learning: A Scalable Reinforcement Learning Approach for Dynamic Treatment Regimes
by: Wang, Di, et al.
Published: (2023)

Learning Safety Constraints from Demonstrations with Unknown Rewards
by: Lindner, David, et al.
Published: (2023)

Multi-Path Collaborative Reasoning via Reinforcement Learning
by: Lv, Jindi, et al.
Published: (2025)

Networked Restless Multi-Arm Bandits with Reinforcement Learning
by: Zhang, Hanmo, et al.
Published: (2025)

Improving monotonic optimization in heterogeneous multi-agent reinforcement learning with optimal marginal deterministic policy gradient
by: Yu, Xiaoyang, et al.
Published: (2025)

ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation
by: Hou, Hongru, et al.
Published: (2026)

Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
by: Gu, Shangding, et al.
Published: (2024)

Ferret: An Efficient Online Continual Learning Framework under Varying Memory Constraints
by: Zhou, Yuhao, et al.
Published: (2025)

Learning Local Constraints for Reinforcement-Learned Content Generators
by: Bhaumik, Debosmita, et al.
Published: (2026)

Reinforcement Learning with $ω$-Regular Objectives and Constraints
by: Wagner, Dominik, et al.
Published: (2025)

Just-In-Time Reinforcement Learning: Continual Learning in LLM Agents Without Gradient Updates
by: Li, Yibo, et al.
Published: (2026)

Constraint-Conditioned Policy Optimization for Versatile Safe Reinforcement Learning
by: Yao, Yihang, et al.
Published: (2023)

Adaptive and Robust DBSCAN with Multi-agent Reinforcement Learning
by: Peng, Hao, et al.
Published: (2025)

Augmented Lagrangian Multiplier Network for State-wise Safety in Reinforcement Learning
by: Zhang, Jiaming, et al.
Published: (2026)