Saved in:
| Main Authors: | Ma, Xiaoteng, Chen, Junyao, Xia, Li, Yang, Jun, Zhao, Qianchuan, Zhou, Zhengyuan |
|---|---|
| Format: | Preprint |
| Published: |
2020
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2004.14547 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Single-Trajectory Distributionally Robust Reinforcement Learning
by: Liang, Zhipeng, et al.
Published: (2023)
by: Liang, Zhipeng, et al.
Published: (2023)
DR-SAC: Distributionally Robust Soft Actor-Critic for Reinforcement Learning under Uncertainty
by: Cui, Mingxuan, et al.
Published: (2025)
by: Cui, Mingxuan, et al.
Published: (2025)
Causal Policy Learning in Reinforcement Learning: Backdoor-Adjusted Soft Actor-Critic
by: Vo, Thanh Vinh, et al.
Published: (2025)
by: Vo, Thanh Vinh, et al.
Published: (2025)
Distributional Soft Actor-Critic with Diffusion Policy
by: Liu, Tong, et al.
Published: (2025)
by: Liu, Tong, et al.
Published: (2025)
Decorrelated Soft Actor-Critic for Efficient Deep Reinforcement Learning
by: Küçükoğlu, Burcu, et al.
Published: (2025)
by: Küçükoğlu, Burcu, et al.
Published: (2025)
Functional Critics Are Essential for Actor-Critic: From Off-Policy Stability to Efficient Exploration
by: Bai, Qinxun, et al.
Published: (2025)
by: Bai, Qinxun, et al.
Published: (2025)
Revisiting Discrete Soft Actor-Critic
by: Zhou, Haibin, et al.
Published: (2022)
by: Zhou, Haibin, et al.
Published: (2022)
DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic
by: Neo, Dexter, et al.
Published: (2023)
by: Neo, Dexter, et al.
Published: (2023)
Average-Reward Soft Actor-Critic
by: Adamczyk, Jacob, et al.
Published: (2025)
by: Adamczyk, Jacob, et al.
Published: (2025)
Bidirectional Soft Actor-Critic: Leveraging Forward and Reverse KL Divergence for Efficient Reinforcement Learning
by: Zhang, Yixian, et al.
Published: (2025)
by: Zhang, Yixian, et al.
Published: (2025)
Reinforcement Learning Position Control of a Quadrotor Using Soft Actor-Critic (SAC)
by: Mahran, Youssef, et al.
Published: (2025)
by: Mahran, Youssef, et al.
Published: (2025)
Flow Actor-Critic for Offline Reinforcement Learning
by: Chae, Jongseong, et al.
Published: (2026)
by: Chae, Jongseong, et al.
Published: (2026)
FastDSAC: Unlocking the Potential of Maximum Entropy RL in High-Dimensional Humanoid Control
by: Xue, Jun, et al.
Published: (2026)
by: Xue, Jun, et al.
Published: (2026)
Mildly Conservative Q-Learning for Offline Reinforcement Learning
by: Lyu, Jiafei, et al.
Published: (2022)
by: Lyu, Jiafei, et al.
Published: (2022)
Rethinking Soft Actor-Critic in High-Dimensional Action Spaces: The Cost of Ignoring Distribution Shift
by: Chen, Yanjun, et al.
Published: (2024)
by: Chen, Yanjun, et al.
Published: (2024)
Risk-Sensitive RL for Alleviating Exploration Dilemmas in Large Language Models
by: Jiang, Yuhua, et al.
Published: (2025)
by: Jiang, Yuhua, et al.
Published: (2025)
Double Actor-Critic with TD Error-Driven Regularization in Reinforcement Learning
by: Chen, Haohui, et al.
Published: (2024)
by: Chen, Haohui, et al.
Published: (2024)
Episodic Novelty Through Temporal Distance
by: Jiang, Yuhua, et al.
Published: (2025)
by: Jiang, Yuhua, et al.
Published: (2025)
Efficient Multi-agent Reinforcement Learning by Planning
by: Liu, Qihan, et al.
Published: (2024)
by: Liu, Qihan, et al.
Published: (2024)
SACn: Soft Actor-Critic with n-step Returns
by: Łyskawa, Jakub, et al.
Published: (2025)
by: Łyskawa, Jakub, et al.
Published: (2025)
Federated Natural Policy Gradient and Actor Critic Methods for Multi-task Reinforcement Learning
by: Yang, Tong, et al.
Published: (2023)
by: Yang, Tong, et al.
Published: (2023)
Efficient $Q$-Learning and Actor-Critic Methods for Robust Average Reward Reinforcement Learning
by: Xu, Yang, et al.
Published: (2025)
by: Xu, Yang, et al.
Published: (2025)
Quantum Advantage Actor-Critic for Reinforcement Learning
by: Kölle, Michael, et al.
Published: (2024)
by: Kölle, Michael, et al.
Published: (2024)
Broad Critic Deep Actor Reinforcement Learning for Continuous Control
by: Thalagala, Shiron, et al.
Published: (2024)
by: Thalagala, Shiron, et al.
Published: (2024)
Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning
by: Garcin, Samuel, et al.
Published: (2025)
by: Garcin, Samuel, et al.
Published: (2025)
Simulation-Based Benchmarking of Reinforcement Learning Agents for Personalized Retail Promotions
by: Xia, Yu, et al.
Published: (2024)
by: Xia, Yu, et al.
Published: (2024)
Dissecting Discrete Soft Actor-Critic: Limitations and Principled Alternatives
by: Asad, Reza, et al.
Published: (2025)
by: Asad, Reza, et al.
Published: (2025)
Relative Importance Sampling for off-Policy Actor-Critic in Deep Reinforcement Learning
by: Humayoo, Mahammad, et al.
Published: (2018)
by: Humayoo, Mahammad, et al.
Published: (2018)
Adviser-Actor-Critic: Eliminating Steady-State Error in Reinforcement Learning Control
by: Chen, Donghe, et al.
Published: (2025)
by: Chen, Donghe, et al.
Published: (2025)
A Safety Modulator Actor-Critic Method in Model-Free Safe Reinforcement Learning and Application in UAV Hovering
by: Qi, Qihan, et al.
Published: (2024)
by: Qi, Qihan, et al.
Published: (2024)
Proximal Action Replacement for Behavior Cloning Actor-Critic in Offline Reinforcement Learning
by: Dong, Jinzong, et al.
Published: (2026)
by: Dong, Jinzong, et al.
Published: (2026)
Offline Actor-Critic Reinforcement Learning Scales to Large Models
by: Springenberg, Jost Tobias, et al.
Published: (2024)
by: Springenberg, Jost Tobias, et al.
Published: (2024)
Distributional Soft Actor-Critic with Harmonic Gradient for Safe and Efficient Autonomous Driving in Multi-lane Scenarios
by: Zhang, Feihong, et al.
Published: (2025)
by: Zhang, Feihong, et al.
Published: (2025)
Monte Carlo Beam Search for Actor-Critic Reinforcement Learning in Continuous Control
by: Alzorgan, Hazim, et al.
Published: (2025)
by: Alzorgan, Hazim, et al.
Published: (2025)
Human-Readable Programs as Actors of Reinforcement Learning Agents Using Critic-Moderated Evolution
by: Deproost, Senne, et al.
Published: (2024)
by: Deproost, Senne, et al.
Published: (2024)
Contraction Actor-Critic: Contraction Metric-Guided Reinforcement Learning for Robust Path Tracking
by: Cho, Minjae, et al.
Published: (2025)
by: Cho, Minjae, et al.
Published: (2025)
AI Olympics challenge with Evolutionary Soft Actor Critic
by: Calì, Marco, et al.
Published: (2024)
by: Calì, Marco, et al.
Published: (2024)
Bridging the Gap: Enabling Soft Actor Critic for High Performance Legged Locomotion
by: Sabatini, Gianluca, et al.
Published: (2026)
by: Sabatini, Gianluca, et al.
Published: (2026)
Risk-Sensitive Soft Actor-Critic for Robust Deep Reinforcement Learning under Distribution Shifts
by: Enders, Tobias, et al.
Published: (2024)
by: Enders, Tobias, et al.
Published: (2024)
${\rm E}(3)$-Equivariant Actor-Critic Methods for Cooperative Multi-Agent Reinforcement Learning
by: Chen, Dingyang, et al.
Published: (2023)
by: Chen, Dingyang, et al.
Published: (2023)
Similar Items
-
Single-Trajectory Distributionally Robust Reinforcement Learning
by: Liang, Zhipeng, et al.
Published: (2023) -
DR-SAC: Distributionally Robust Soft Actor-Critic for Reinforcement Learning under Uncertainty
by: Cui, Mingxuan, et al.
Published: (2025) -
Causal Policy Learning in Reinforcement Learning: Backdoor-Adjusted Soft Actor-Critic
by: Vo, Thanh Vinh, et al.
Published: (2025) -
Distributional Soft Actor-Critic with Diffusion Policy
by: Liu, Tong, et al.
Published: (2025) -
Decorrelated Soft Actor-Critic for Efficient Deep Reinforcement Learning
by: Küçükoğlu, Burcu, et al.
Published: (2025)