Saved in:
| Main Authors: | Ma, Hao, Pu, Zhiqiang, Ai, Xiaolin, Wang, Huimu |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.17468 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
OMoE: Diversifying Mixture of Low-Rank Adaptation by Orthogonal Finetuning
by: Feng, Jinyuan, et al.
Published: (2025)
by: Feng, Jinyuan, et al.
Published: (2025)
Vision-Based Generic Potential Function for Policy Alignment in Multi-Agent Reinforcement Learning
by: Ma, Hao, et al.
Published: (2025)
by: Ma, Hao, et al.
Published: (2025)
Enhancing Reinforcement Learning Fine-Tuning with an Online Refiner
by: Ma, Hao, et al.
Published: (2026)
by: Ma, Hao, et al.
Published: (2026)
Rethinking Soft Actor-Critic in High-Dimensional Action Spaces: The Cost of Ignoring Distribution Shift
by: Chen, Yanjun, et al.
Published: (2024)
by: Chen, Yanjun, et al.
Published: (2024)
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
by: Ishfaq, Haque, et al.
Published: (2025)
by: Ishfaq, Haque, et al.
Published: (2025)
Safe Langevin Soft Actor Critic
by: Keswani, Mahesh, et al.
Published: (2026)
by: Keswani, Mahesh, et al.
Published: (2026)
Decorrelated Soft Actor-Critic for Efficient Deep Reinforcement Learning
by: Küçükoğlu, Burcu, et al.
Published: (2025)
by: Küçükoğlu, Burcu, et al.
Published: (2025)
Distributional Soft Actor-Critic with Diffusion Policy
by: Liu, Tong, et al.
Published: (2025)
by: Liu, Tong, et al.
Published: (2025)
Generative Actor-Critic with Soft Bridge Policies
by: He, Ke, et al.
Published: (2026)
by: He, Ke, et al.
Published: (2026)
Revisiting Discrete Soft Actor-Critic
by: Zhou, Haibin, et al.
Published: (2022)
by: Zhou, Haibin, et al.
Published: (2022)
Average-Reward Soft Actor-Critic
by: Adamczyk, Jacob, et al.
Published: (2025)
by: Adamczyk, Jacob, et al.
Published: (2025)
Wasserstein Barycenter Soft Actor-Critic
by: Shahrooei, Zahra, et al.
Published: (2025)
by: Shahrooei, Zahra, et al.
Published: (2025)
PAC-Bayesian Soft Actor-Critic Learning
by: Tasdighi, Bahareh, et al.
Published: (2023)
by: Tasdighi, Bahareh, et al.
Published: (2023)
Distributional Soft Actor-Critic with Three Refinements
by: Duan, Jingliang, et al.
Published: (2023)
by: Duan, Jingliang, et al.
Published: (2023)
Learning Without Time-Based Embodiment Resets in Soft-Actor Critic
by: Farrahi, Homayoon, et al.
Published: (2025)
by: Farrahi, Homayoon, et al.
Published: (2025)
Broad Critic Deep Actor Reinforcement Learning for Continuous Control
by: Thalagala, Shiron, et al.
Published: (2024)
by: Thalagala, Shiron, et al.
Published: (2024)
S$^2$AC: Energy-Based Reinforcement Learning with Stein Soft Actor Critic
by: Messaoud, Safa, et al.
Published: (2024)
by: Messaoud, Safa, et al.
Published: (2024)
DSAC: Distributional Soft Actor-Critic for Risk-Sensitive Reinforcement Learning
by: Ma, Xiaoteng, et al.
Published: (2020)
by: Ma, Xiaoteng, et al.
Published: (2020)
SACn: Soft Actor-Critic with n-step Returns
by: Łyskawa, Jakub, et al.
Published: (2025)
by: Łyskawa, Jakub, et al.
Published: (2025)
Causal Policy Learning in Reinforcement Learning: Backdoor-Adjusted Soft Actor-Critic
by: Vo, Thanh Vinh, et al.
Published: (2025)
by: Vo, Thanh Vinh, et al.
Published: (2025)
ISAACS: Iterative Soft Adversarial Actor-Critic for Safety
by: Hsu, Kai-Chieh, et al.
Published: (2022)
by: Hsu, Kai-Chieh, et al.
Published: (2022)
Chunking the Critic: A Transformer-based Soft Actor-Critic with N-Step Returns
by: Tian, Dong, et al.
Published: (2025)
by: Tian, Dong, et al.
Published: (2025)
SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling
by: Gaven, Loris, et al.
Published: (2024)
by: Gaven, Loris, et al.
Published: (2024)
Actor-Critic without Actor
by: Ki, Donghyeon, et al.
Published: (2025)
by: Ki, Donghyeon, et al.
Published: (2025)
Soft Actor-Critic-based Control Barrier Adaptation for Robust Autonomous Navigation in Unknown Environments
by: Mohammad, Nicholas, et al.
Published: (2025)
by: Mohammad, Nicholas, et al.
Published: (2025)
Reinforcement Learning Position Control of a Quadrotor Using Soft Actor-Critic (SAC)
by: Mahran, Youssef, et al.
Published: (2025)
by: Mahran, Youssef, et al.
Published: (2025)
Dissecting Discrete Soft Actor-Critic: Limitations and Principled Alternatives
by: Asad, Reza, et al.
Published: (2025)
by: Asad, Reza, et al.
Published: (2025)
Bidirectional Soft Actor-Critic: Leveraging Forward and Reverse KL Divergence for Efficient Reinforcement Learning
by: Zhang, Yixian, et al.
Published: (2025)
by: Zhang, Yixian, et al.
Published: (2025)
Generative Actor Critic
by: Qin, Aoyang, et al.
Published: (2025)
by: Qin, Aoyang, et al.
Published: (2025)
DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic
by: Neo, Dexter, et al.
Published: (2023)
by: Neo, Dexter, et al.
Published: (2023)
AI Olympics challenge with Evolutionary Soft Actor Critic
by: Calì, Marco, et al.
Published: (2024)
by: Calì, Marco, et al.
Published: (2024)
Improving Actor-Critic Training with Steerable Action-Value Approximation Errors
by: Tasdighi, Bahareh, et al.
Published: (2024)
by: Tasdighi, Bahareh, et al.
Published: (2024)
Monte Carlo Beam Search for Actor-Critic Reinforcement Learning in Continuous Control
by: Alzorgan, Hazim, et al.
Published: (2025)
by: Alzorgan, Hazim, et al.
Published: (2025)
Policy-Based Radiative Transfer: Solving the $2$-Level Atom Non-LTE Problem using Soft Actor-Critic Reinforcement Learning
by: Panos, Brandon, et al.
Published: (2025)
by: Panos, Brandon, et al.
Published: (2025)
Multi-Agent Soft Actor-Critic with Coordinated Loss for Autonomous Mobility-on-Demand Fleet Control
by: Woywood, Zeno, et al.
Published: (2024)
by: Woywood, Zeno, et al.
Published: (2024)
Actor-Critic Reinforcement Learning with Phased Actor
by: Wu, Ruofan, et al.
Published: (2024)
by: Wu, Ruofan, et al.
Published: (2024)
Actor-Critic or Critic-Actor? A Tale of Two Time Scales
by: Bhatnagar, Shalabh, et al.
Published: (2022)
by: Bhatnagar, Shalabh, et al.
Published: (2022)
D2 Actor Critic: Diffusion Actor Meets Distributional Critic
by: Zhang, Lunjun, et al.
Published: (2025)
by: Zhang, Lunjun, et al.
Published: (2025)
Actor-Critic Physics-informed Neural Lyapunov Control
by: Wang, Jiarui, et al.
Published: (2024)
by: Wang, Jiarui, et al.
Published: (2024)
Distributional Soft Actor-Critic with Harmonic Gradient for Safe and Efficient Autonomous Driving in Multi-lane Scenarios
by: Zhang, Feihong, et al.
Published: (2025)
by: Zhang, Feihong, et al.
Published: (2025)
Similar Items
-
OMoE: Diversifying Mixture of Low-Rank Adaptation by Orthogonal Finetuning
by: Feng, Jinyuan, et al.
Published: (2025) -
Vision-Based Generic Potential Function for Policy Alignment in Multi-Agent Reinforcement Learning
by: Ma, Hao, et al.
Published: (2025) -
Enhancing Reinforcement Learning Fine-Tuning with an Online Refiner
by: Ma, Hao, et al.
Published: (2026) -
Rethinking Soft Actor-Critic in High-Dimensional Action Spaces: The Cost of Ignoring Distribution Shift
by: Chen, Yanjun, et al.
Published: (2024) -
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
by: Ishfaq, Haque, et al.
Published: (2025)