Saved in:
| Main Authors: | Shi, Ruochuan, Lu, Runyu, Zhu, Yuanheng, Zhao, Dongbin |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.08412 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
R2PS: Worst-Case Robust Real-Time Pursuit Strategies under Partial Observability
by: Lu, Runyu, et al.
Published: (2025)
by: Lu, Runyu, et al.
Published: (2025)
Equilibrium Policy Generalization: A Reinforcement Learning Framework for Cross-Graph Zero-Shot Generalization in Pursuit-Evasion Games
by: Lu, Runyu, et al.
Published: (2025)
by: Lu, Runyu, et al.
Published: (2025)
Discretizing Continuous Action Space with Unimodal Probability Distributions for On-Policy Reinforcement Learning
by: Zhu, Yuanyang, et al.
Published: (2024)
by: Zhu, Yuanyang, et al.
Published: (2024)
ISAACS: Iterative Soft Adversarial Actor-Critic for Safety
by: Hsu, Kai-Chieh, et al.
Published: (2022)
by: Hsu, Kai-Chieh, et al.
Published: (2022)
Revisiting Discrete Soft Actor-Critic
by: Zhou, Haibin, et al.
Published: (2022)
by: Zhou, Haibin, et al.
Published: (2022)
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement
by: Wang, Zhi, et al.
Published: (2024)
by: Wang, Zhi, et al.
Published: (2024)
Safe Langevin Soft Actor Critic
by: Keswani, Mahesh, et al.
Published: (2026)
by: Keswani, Mahesh, et al.
Published: (2026)
DipLLM: Fine-Tuning LLM for Strategic Decision-making in Diplomacy
by: Xu, Kaixuan, et al.
Published: (2025)
by: Xu, Kaixuan, et al.
Published: (2025)
$π$-Play: Multi-Agent Self-Play via Privileged Self-Distillation without External Data
by: Zhang, Yaocheng, et al.
Published: (2026)
by: Zhang, Yaocheng, et al.
Published: (2026)
Average-Reward Soft Actor-Critic
by: Adamczyk, Jacob, et al.
Published: (2025)
by: Adamczyk, Jacob, et al.
Published: (2025)
Wasserstein Barycenter Soft Actor-Critic
by: Shahrooei, Zahra, et al.
Published: (2025)
by: Shahrooei, Zahra, et al.
Published: (2025)
PAC-Bayesian Soft Actor-Critic Learning
by: Tasdighi, Bahareh, et al.
Published: (2023)
by: Tasdighi, Bahareh, et al.
Published: (2023)
Refined Analysis of Entropy-Regularized Actor-Critic
by: Labbi, Safwan, et al.
Published: (2026)
by: Labbi, Safwan, et al.
Published: (2026)
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game
by: Hu, Guangzheng, et al.
Published: (2024)
by: Hu, Guangzheng, et al.
Published: (2024)
RLAE: Reinforcement Learning-Assisted Ensemble for LLMs
by: Fu, Yuqian, et al.
Published: (2025)
by: Fu, Yuqian, et al.
Published: (2025)
Distributional Soft Actor-Critic with Diffusion Policy
by: Liu, Tong, et al.
Published: (2025)
by: Liu, Tong, et al.
Published: (2025)
Multi-Agent Soft Actor-Critic with Coordinated Loss for Autonomous Mobility-on-Demand Fleet Control
by: Woywood, Zeno, et al.
Published: (2024)
by: Woywood, Zeno, et al.
Published: (2024)
Distributional Soft Actor-Critic with Three Refinements
by: Duan, Jingliang, et al.
Published: (2023)
by: Duan, Jingliang, et al.
Published: (2023)
Generative Actor-Critic with Soft Bridge Policies
by: He, Ke, et al.
Published: (2026)
by: He, Ke, et al.
Published: (2026)
Cooperative Multi-Agent Graph Bandits: UCB Algorithm and Regret Analysis
by: Paschalidis, Phevos, et al.
Published: (2024)
by: Paschalidis, Phevos, et al.
Published: (2024)
Adaptive Ensemble Aggregation for Actor-Critics
by: Werge, Nicklas, et al.
Published: (2025)
by: Werge, Nicklas, et al.
Published: (2025)
Scalable Neighborhood-Based Multi-Agent Actor-Critic
by: Goppelsroeder, Tim, et al.
Published: (2026)
by: Goppelsroeder, Tim, et al.
Published: (2026)
Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes
by: Fu, Yuqian, et al.
Published: (2026)
by: Fu, Yuqian, et al.
Published: (2026)
DSAC: Distributional Soft Actor-Critic for Risk-Sensitive Reinforcement Learning
by: Ma, Xiaoteng, et al.
Published: (2020)
by: Ma, Xiaoteng, et al.
Published: (2020)
SACn: Soft Actor-Critic with n-step Returns
by: Łyskawa, Jakub, et al.
Published: (2025)
by: Łyskawa, Jakub, et al.
Published: (2025)
Causal Policy Learning in Reinforcement Learning: Backdoor-Adjusted Soft Actor-Critic
by: Vo, Thanh Vinh, et al.
Published: (2025)
by: Vo, Thanh Vinh, et al.
Published: (2025)
Revisiting Mixture Policies in Entropy-Regularized Actor-Critic
by: He, Jiamin, et al.
Published: (2026)
by: He, Jiamin, et al.
Published: (2026)
Generative Actor Critic
by: Qin, Aoyang, et al.
Published: (2025)
by: Qin, Aoyang, et al.
Published: (2025)
Chunking the Critic: A Transformer-based Soft Actor-Critic with N-Step Returns
by: Tian, Dong, et al.
Published: (2025)
by: Tian, Dong, et al.
Published: (2025)
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
by: Ishfaq, Haque, et al.
Published: (2025)
by: Ishfaq, Haque, et al.
Published: (2025)
Actor-Critic without Actor
by: Ki, Donghyeon, et al.
Published: (2025)
by: Ki, Donghyeon, et al.
Published: (2025)
Learning Without Time-Based Embodiment Resets in Soft-Actor Critic
by: Farrahi, Homayoon, et al.
Published: (2025)
by: Farrahi, Homayoon, et al.
Published: (2025)
Dissecting Discrete Soft Actor-Critic: Limitations and Principled Alternatives
by: Asad, Reza, et al.
Published: (2025)
by: Asad, Reza, et al.
Published: (2025)
Decorrelated Soft Actor-Critic for Efficient Deep Reinforcement Learning
by: Küçükoğlu, Burcu, et al.
Published: (2025)
by: Küçükoğlu, Burcu, et al.
Published: (2025)
Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning
by: Wei, Honghao, et al.
Published: (2024)
by: Wei, Honghao, et al.
Published: (2024)
ACSAC: Adaptive Chunk Size Actor-Critic with Causal Transformer Q-Network
by: Chen, Qian, et al.
Published: (2026)
by: Chen, Qian, et al.
Published: (2026)
S$^2$AC: Energy-Based Reinforcement Learning with Stein Soft Actor Critic
by: Messaoud, Safa, et al.
Published: (2024)
by: Messaoud, Safa, et al.
Published: (2024)
DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic
by: Neo, Dexter, et al.
Published: (2023)
by: Neo, Dexter, et al.
Published: (2023)
AI Olympics challenge with Evolutionary Soft Actor Critic
by: Calì, Marco, et al.
Published: (2024)
by: Calì, Marco, et al.
Published: (2024)
Actor-Critic Reinforcement Learning with Phased Actor
by: Wu, Ruofan, et al.
Published: (2024)
by: Wu, Ruofan, et al.
Published: (2024)
Similar Items
-
R2PS: Worst-Case Robust Real-Time Pursuit Strategies under Partial Observability
by: Lu, Runyu, et al.
Published: (2025) -
Equilibrium Policy Generalization: A Reinforcement Learning Framework for Cross-Graph Zero-Shot Generalization in Pursuit-Evasion Games
by: Lu, Runyu, et al.
Published: (2025) -
Discretizing Continuous Action Space with Unimodal Probability Distributions for On-Policy Reinforcement Learning
by: Zhu, Yuanyang, et al.
Published: (2024) -
ISAACS: Iterative Soft Adversarial Actor-Critic for Safety
by: Hsu, Kai-Chieh, et al.
Published: (2022) -
Revisiting Discrete Soft Actor-Critic
by: Zhou, Haibin, et al.
Published: (2022)