:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Shi, Ruochuan, Lu, Runyu, Zhu, Yuanheng, Zhao, Dongbin
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2511.08412
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

R2PS: Worst-Case Robust Real-Time Pursuit Strategies under Partial Observability
by: Lu, Runyu, et al.
Published: (2025)

Equilibrium Policy Generalization: A Reinforcement Learning Framework for Cross-Graph Zero-Shot Generalization in Pursuit-Evasion Games
by: Lu, Runyu, et al.
Published: (2025)

Discretizing Continuous Action Space with Unimodal Probability Distributions for On-Policy Reinforcement Learning
by: Zhu, Yuanyang, et al.
Published: (2024)

ISAACS: Iterative Soft Adversarial Actor-Critic for Safety
by: Hsu, Kai-Chieh, et al.
Published: (2022)

Revisiting Discrete Soft Actor-Critic
by: Zhou, Haibin, et al.
Published: (2022)

Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement
by: Wang, Zhi, et al.
Published: (2024)

Safe Langevin Soft Actor Critic
by: Keswani, Mahesh, et al.
Published: (2026)

DipLLM: Fine-Tuning LLM for Strategic Decision-making in Diplomacy
by: Xu, Kaixuan, et al.
Published: (2025)

$π$-Play: Multi-Agent Self-Play via Privileged Self-Distillation without External Data
by: Zhang, Yaocheng, et al.
Published: (2026)

Average-Reward Soft Actor-Critic
by: Adamczyk, Jacob, et al.
Published: (2025)

Wasserstein Barycenter Soft Actor-Critic
by: Shahrooei, Zahra, et al.
Published: (2025)

PAC-Bayesian Soft Actor-Critic Learning
by: Tasdighi, Bahareh, et al.
Published: (2023)

Refined Analysis of Entropy-Regularized Actor-Critic
by: Labbi, Safwan, et al.
Published: (2026)

FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game
by: Hu, Guangzheng, et al.
Published: (2024)

RLAE: Reinforcement Learning-Assisted Ensemble for LLMs
by: Fu, Yuqian, et al.
Published: (2025)

Distributional Soft Actor-Critic with Diffusion Policy
by: Liu, Tong, et al.
Published: (2025)

Multi-Agent Soft Actor-Critic with Coordinated Loss for Autonomous Mobility-on-Demand Fleet Control
by: Woywood, Zeno, et al.
Published: (2024)

Distributional Soft Actor-Critic with Three Refinements
by: Duan, Jingliang, et al.
Published: (2023)

Generative Actor-Critic with Soft Bridge Policies
by: He, Ke, et al.
Published: (2026)

Cooperative Multi-Agent Graph Bandits: UCB Algorithm and Regret Analysis
by: Paschalidis, Phevos, et al.
Published: (2024)

Adaptive Ensemble Aggregation for Actor-Critics
by: Werge, Nicklas, et al.
Published: (2025)

Scalable Neighborhood-Based Multi-Agent Actor-Critic
by: Goppelsroeder, Tim, et al.
Published: (2026)

Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes
by: Fu, Yuqian, et al.
Published: (2026)

DSAC: Distributional Soft Actor-Critic for Risk-Sensitive Reinforcement Learning
by: Ma, Xiaoteng, et al.
Published: (2020)

SACn: Soft Actor-Critic with n-step Returns
by: Łyskawa, Jakub, et al.
Published: (2025)

Causal Policy Learning in Reinforcement Learning: Backdoor-Adjusted Soft Actor-Critic
by: Vo, Thanh Vinh, et al.
Published: (2025)

Revisiting Mixture Policies in Entropy-Regularized Actor-Critic
by: He, Jiamin, et al.
Published: (2026)

Generative Actor Critic
by: Qin, Aoyang, et al.
Published: (2025)

Chunking the Critic: A Transformer-based Soft Actor-Critic with N-Step Returns
by: Tian, Dong, et al.
Published: (2025)

Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
by: Ishfaq, Haque, et al.
Published: (2025)

Actor-Critic without Actor
by: Ki, Donghyeon, et al.
Published: (2025)

Learning Without Time-Based Embodiment Resets in Soft-Actor Critic
by: Farrahi, Homayoon, et al.
Published: (2025)

Dissecting Discrete Soft Actor-Critic: Limitations and Principled Alternatives
by: Asad, Reza, et al.
Published: (2025)

Decorrelated Soft Actor-Critic for Efficient Deep Reinforcement Learning
by: Küçükoğlu, Burcu, et al.
Published: (2025)

Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning
by: Wei, Honghao, et al.
Published: (2024)

ACSAC: Adaptive Chunk Size Actor-Critic with Causal Transformer Q-Network
by: Chen, Qian, et al.
Published: (2026)

S$^2$AC: Energy-Based Reinforcement Learning with Stein Soft Actor Critic
by: Messaoud, Safa, et al.
Published: (2024)

DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic
by: Neo, Dexter, et al.
Published: (2023)

AI Olympics challenge with Evolutionary Soft Actor Critic
by: Calì, Marco, et al.
Published: (2024)

Actor-Critic Reinforcement Learning with Phased Actor
by: Wu, Ruofan, et al.
Published: (2024)