:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ma, Hao, Pu, Zhiqiang, Ai, Xiaolin, Wang, Huimu
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2603.17468
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

OMoE: Diversifying Mixture of Low-Rank Adaptation by Orthogonal Finetuning
by: Feng, Jinyuan, et al.
Published: (2025)

Vision-Based Generic Potential Function for Policy Alignment in Multi-Agent Reinforcement Learning
by: Ma, Hao, et al.
Published: (2025)

Enhancing Reinforcement Learning Fine-Tuning with an Online Refiner
by: Ma, Hao, et al.
Published: (2026)

Rethinking Soft Actor-Critic in High-Dimensional Action Spaces: The Cost of Ignoring Distribution Shift
by: Chen, Yanjun, et al.
Published: (2024)

Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
by: Ishfaq, Haque, et al.
Published: (2025)

Safe Langevin Soft Actor Critic
by: Keswani, Mahesh, et al.
Published: (2026)

Decorrelated Soft Actor-Critic for Efficient Deep Reinforcement Learning
by: Küçükoğlu, Burcu, et al.
Published: (2025)

Distributional Soft Actor-Critic with Diffusion Policy
by: Liu, Tong, et al.
Published: (2025)

Generative Actor-Critic with Soft Bridge Policies
by: He, Ke, et al.
Published: (2026)

Revisiting Discrete Soft Actor-Critic
by: Zhou, Haibin, et al.
Published: (2022)

Average-Reward Soft Actor-Critic
by: Adamczyk, Jacob, et al.
Published: (2025)

Wasserstein Barycenter Soft Actor-Critic
by: Shahrooei, Zahra, et al.
Published: (2025)

PAC-Bayesian Soft Actor-Critic Learning
by: Tasdighi, Bahareh, et al.
Published: (2023)

Distributional Soft Actor-Critic with Three Refinements
by: Duan, Jingliang, et al.
Published: (2023)

Learning Without Time-Based Embodiment Resets in Soft-Actor Critic
by: Farrahi, Homayoon, et al.
Published: (2025)

Broad Critic Deep Actor Reinforcement Learning for Continuous Control
by: Thalagala, Shiron, et al.
Published: (2024)

S$^2$AC: Energy-Based Reinforcement Learning with Stein Soft Actor Critic
by: Messaoud, Safa, et al.
Published: (2024)

DSAC: Distributional Soft Actor-Critic for Risk-Sensitive Reinforcement Learning
by: Ma, Xiaoteng, et al.
Published: (2020)

SACn: Soft Actor-Critic with n-step Returns
by: Łyskawa, Jakub, et al.
Published: (2025)

Causal Policy Learning in Reinforcement Learning: Backdoor-Adjusted Soft Actor-Critic
by: Vo, Thanh Vinh, et al.
Published: (2025)

ISAACS: Iterative Soft Adversarial Actor-Critic for Safety
by: Hsu, Kai-Chieh, et al.
Published: (2022)

Chunking the Critic: A Transformer-based Soft Actor-Critic with N-Step Returns
by: Tian, Dong, et al.
Published: (2025)

SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
by: Gaven, Loris, et al.
Published: (2024)

Actor-Critic without Actor
by: Ki, Donghyeon, et al.
Published: (2025)

Soft Actor-Critic-based Control Barrier Adaptation for Robust Autonomous Navigation in Unknown Environments
by: Mohammad, Nicholas, et al.
Published: (2025)

Reinforcement Learning Position Control of a Quadrotor Using Soft Actor-Critic (SAC)
by: Mahran, Youssef, et al.
Published: (2025)

Dissecting Discrete Soft Actor-Critic: Limitations and Principled Alternatives
by: Asad, Reza, et al.
Published: (2025)

Bidirectional Soft Actor-Critic: Leveraging Forward and Reverse KL Divergence for Efficient Reinforcement Learning
by: Zhang, Yixian, et al.
Published: (2025)

Generative Actor Critic
by: Qin, Aoyang, et al.
Published: (2025)

DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic
by: Neo, Dexter, et al.
Published: (2023)

AI Olympics challenge with Evolutionary Soft Actor Critic
by: Calì, Marco, et al.
Published: (2024)

Improving Actor-Critic Training with Steerable Action-Value Approximation Errors
by: Tasdighi, Bahareh, et al.
Published: (2024)

Monte Carlo Beam Search for Actor-Critic Reinforcement Learning in Continuous Control
by: Alzorgan, Hazim, et al.
Published: (2025)

Policy-Based Radiative Transfer: Solving the $2$-Level Atom Non-LTE Problem using Soft Actor-Critic Reinforcement Learning
by: Panos, Brandon, et al.
Published: (2025)

Multi-Agent Soft Actor-Critic with Coordinated Loss for Autonomous Mobility-on-Demand Fleet Control
by: Woywood, Zeno, et al.
Published: (2024)

Actor-Critic Reinforcement Learning with Phased Actor
by: Wu, Ruofan, et al.
Published: (2024)

Actor-Critic or Critic-Actor? A Tale of Two Time Scales
by: Bhatnagar, Shalabh, et al.
Published: (2022)

D2 Actor Critic: Diffusion Actor Meets Distributional Critic
by: Zhang, Lunjun, et al.
Published: (2025)

Actor-Critic Physics-informed Neural Lyapunov Control
by: Wang, Jiarui, et al.
Published: (2024)

Distributional Soft Actor-Critic with Harmonic Gradient for Safe and Efficient Autonomous Driving in Multi-lane Scenarios
by: Zhang, Feihong, et al.
Published: (2025)