Saved in:
| Main Authors: | Dong, Yanjie, Zhang, Haijun, Wang, Gang, Cui, Shisheng, Hu, Xiping |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.06945 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Non-Asymptotic Analysis for Single-Loop (Natural) Actor-Critic with Compatible Function Approximation
by: Wang, Yudan, et al.
Published: (2024)
by: Wang, Yudan, et al.
Published: (2024)
Communication-Efficient Heterogeneous Federated Learning with Generalized Heavy-Ball Momentum
by: Zaccone, Riccardo, et al.
Published: (2023)
by: Zaccone, Riccardo, et al.
Published: (2023)
Diffusion Actor-Critic with Entropy Regulator
by: Wang, Yinuo, et al.
Published: (2024)
by: Wang, Yinuo, et al.
Published: (2024)
Functional Critics Are Essential for Actor-Critic: From Off-Policy Stability to Efficient Exploration
by: Bai, Qinxun, et al.
Published: (2025)
by: Bai, Qinxun, et al.
Published: (2025)
Provable Acceleration of Nesterov's Accelerated Gradient Method over Heavy Ball Method in Training Over-Parameterized Neural Networks
by: Liu, Xin, et al.
Published: (2022)
by: Liu, Xin, et al.
Published: (2022)
Finite-time Convergence Analysis of Actor-Critic with Evolving Reward
by: Hu, Rui, et al.
Published: (2025)
by: Hu, Rui, et al.
Published: (2025)
Distributional Soft Actor-Critic with Diffusion Policy
by: Liu, Tong, et al.
Published: (2025)
by: Liu, Tong, et al.
Published: (2025)
Value Improved Actor Critic Algorithms
by: Oren, Yaniv, et al.
Published: (2024)
by: Oren, Yaniv, et al.
Published: (2024)
Revisiting Discrete Soft Actor-Critic
by: Zhou, Haibin, et al.
Published: (2022)
by: Zhou, Haibin, et al.
Published: (2022)
Average-Reward Soft Actor-Critic
by: Adamczyk, Jacob, et al.
Published: (2025)
by: Adamczyk, Jacob, et al.
Published: (2025)
Proximal Action Replacement for Behavior Cloning Actor-Critic in Offline Reinforcement Learning
by: Dong, Jinzong, et al.
Published: (2026)
by: Dong, Jinzong, et al.
Published: (2026)
DR-SAC: Distributionally Robust Soft Actor-Critic for Reinforcement Learning under Uncertainty
by: Cui, Mingxuan, et al.
Published: (2025)
by: Cui, Mingxuan, et al.
Published: (2025)
Relative Importance Sampling for off-Policy Actor-Critic in Deep Reinforcement Learning
by: Humayoo, Mahammad, et al.
Published: (2018)
by: Humayoo, Mahammad, et al.
Published: (2018)
Flow Actor-Critic for Offline Reinforcement Learning
by: Chae, Jongseong, et al.
Published: (2026)
by: Chae, Jongseong, et al.
Published: (2026)
Relational Object-Centric Actor-Critic
by: Ugadiarov, Leonid, et al.
Published: (2023)
by: Ugadiarov, Leonid, et al.
Published: (2023)
Broad Critic Deep Actor Reinforcement Learning for Continuous Control
by: Thalagala, Shiron, et al.
Published: (2024)
by: Thalagala, Shiron, et al.
Published: (2024)
Nesterov-Accelerated Robust Federated Learning Over Byzantine Adversaries
by: Xu, Lihan, et al.
Published: (2025)
by: Xu, Lihan, et al.
Published: (2025)
Scalable Neighborhood-Based Multi-Agent Actor-Critic
by: Goppelsroeder, Tim, et al.
Published: (2026)
by: Goppelsroeder, Tim, et al.
Published: (2026)
Revisiting Mixture Policies in Entropy-Regularized Actor-Critic
by: He, Jiamin, et al.
Published: (2026)
by: He, Jiamin, et al.
Published: (2026)
SACn: Soft Actor-Critic with n-step Returns
by: Łyskawa, Jakub, et al.
Published: (2025)
by: Łyskawa, Jakub, et al.
Published: (2025)
Actor-Critics Can Achieve Optimal Sample Efficiency
by: Tan, Kevin, et al.
Published: (2025)
by: Tan, Kevin, et al.
Published: (2025)
Rethinking Soft Actor-Critic in High-Dimensional Action Spaces: The Cost of Ignoring Distribution Shift
by: Chen, Yanjun, et al.
Published: (2024)
by: Chen, Yanjun, et al.
Published: (2024)
A Safety Modulator Actor-Critic Method in Model-Free Safe Reinforcement Learning and Application in UAV Hovering
by: Qi, Qihan, et al.
Published: (2024)
by: Qi, Qihan, et al.
Published: (2024)
Collaborative Yet Personalized Policy Training: Single-Timescale Federated Actor-Critic
by: Wang, Leo Muxing, et al.
Published: (2026)
by: Wang, Leo Muxing, et al.
Published: (2026)
Dissecting Discrete Soft Actor-Critic: Limitations and Principled Alternatives
by: Asad, Reza, et al.
Published: (2025)
by: Asad, Reza, et al.
Published: (2025)
Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning
by: Garcin, Samuel, et al.
Published: (2025)
by: Garcin, Samuel, et al.
Published: (2025)
Ask-AC: An Initiative Advisor-in-the-Loop Actor-Critic Framework
by: Liu, Shunyu, et al.
Published: (2022)
by: Liu, Shunyu, et al.
Published: (2022)
Decorrelated Soft Actor-Critic for Efficient Deep Reinforcement Learning
by: Küçükoğlu, Burcu, et al.
Published: (2025)
by: Küçükoğlu, Burcu, et al.
Published: (2025)
Stabilizing the Q-Gradient Field for Policy Smoothness in Actor-Critic
by: Lee, Jeong Woon, et al.
Published: (2026)
by: Lee, Jeong Woon, et al.
Published: (2026)
(Accelerated) Noise-adaptive Stochastic Heavy-Ball Momentum
by: Dang, Anh, et al.
Published: (2024)
by: Dang, Anh, et al.
Published: (2024)
Adviser-Actor-Critic: Eliminating Steady-State Error in Reinforcement Learning Control
by: Chen, Donghe, et al.
Published: (2025)
by: Chen, Donghe, et al.
Published: (2025)
Offline Actor-Critic Reinforcement Learning Scales to Large Models
by: Springenberg, Jost Tobias, et al.
Published: (2024)
by: Springenberg, Jost Tobias, et al.
Published: (2024)
Quantum Advantage Actor-Critic for Reinforcement Learning
by: Kölle, Michael, et al.
Published: (2024)
by: Kölle, Michael, et al.
Published: (2024)
Goal Recognition using Actor-Critic Optimization
by: Nageris, Ben, et al.
Published: (2024)
by: Nageris, Ben, et al.
Published: (2024)
Maximum Entropy On-Policy Actor-Critic via Entropy Advantage Estimation
by: Choe, Jean Seong Bjorn, et al.
Published: (2024)
by: Choe, Jean Seong Bjorn, et al.
Published: (2024)
Double Actor-Critic with TD Error-Driven Regularization in Reinforcement Learning
by: Chen, Haohui, et al.
Published: (2024)
by: Chen, Haohui, et al.
Published: (2024)
SMAC: Score-Matched Actor-Critics for Robust Offline-to-Online Transfer
by: de Lara, Nathan Samuel, et al.
Published: (2026)
by: de Lara, Nathan Samuel, et al.
Published: (2026)
Limits of Actor-Critic Algorithms for Decision Tree Policies Learning in IBMDPs
by: Kohler, Hector, et al.
Published: (2023)
by: Kohler, Hector, et al.
Published: (2023)
Enabling Off-Policy Imitation Learning with Deep Actor Critic Stabilization
by: Sen, Sayambhu, et al.
Published: (2025)
by: Sen, Sayambhu, et al.
Published: (2025)
DSAC: Distributional Soft Actor-Critic for Risk-Sensitive Reinforcement Learning
by: Ma, Xiaoteng, et al.
Published: (2020)
by: Ma, Xiaoteng, et al.
Published: (2020)
Similar Items
-
Non-Asymptotic Analysis for Single-Loop (Natural) Actor-Critic with Compatible Function Approximation
by: Wang, Yudan, et al.
Published: (2024) -
Communication-Efficient Heterogeneous Federated Learning with Generalized Heavy-Ball Momentum
by: Zaccone, Riccardo, et al.
Published: (2023) -
Diffusion Actor-Critic with Entropy Regulator
by: Wang, Yinuo, et al.
Published: (2024) -
Functional Critics Are Essential for Actor-Critic: From Off-Policy Stability to Efficient Exploration
by: Bai, Qinxun, et al.
Published: (2025) -
Provable Acceleration of Nesterov's Accelerated Gradient Method over Heavy Ball Method in Training Over-Parameterized Neural Networks
by: Liu, Xin, et al.
Published: (2022)