Saved in:
| Main Author: | Kobayashi, Taisuke |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.01613 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Intentionally-underestimated Value Function at Terminal State for Temporal-difference Learning with Mis-designed Reward
by: Kobayashi, Taisuke
Published: (2023)
by: Kobayashi, Taisuke
Published: (2023)
Towards Autonomous Driving of Personal Mobility with Small and Noisy Dataset using Tsallis-statistics-based Behavioral Cloning
by: Kobayashi, Taisuke, et al.
Published: (2021)
by: Kobayashi, Taisuke, et al.
Published: (2021)
Weber-Fechner Law in Temporal Difference learning derived from Control as Inference
by: Takahashi, Keiichiro, et al.
Published: (2024)
by: Takahashi, Keiichiro, et al.
Published: (2024)
Flexible Empowerment at Reasoning with Extended Best-of-N Sampling
by: Kobayashi, Taisuke
Published: (2026)
by: Kobayashi, Taisuke
Published: (2026)
Revisiting Experience Replayable Conditions
by: Kobayashi, Taisuke
Published: (2024)
by: Kobayashi, Taisuke
Published: (2024)
Improvements of Dark Experience Replay and Reservoir Sampling towards Better Balance between Consolidation and Plasticity
by: Kobayashi, Taisuke
Published: (2025)
by: Kobayashi, Taisuke
Published: (2025)
DROP: Distributional and Regular Optimism and Pessimism for Reinforcement Learning
by: Kobayashi, Taisuke
Published: (2024)
by: Kobayashi, Taisuke
Published: (2024)
Consolidated Adaptive T-soft Update for Deep Reinforcement Learning
by: Kobayashi, Taisuke
Published: (2022)
by: Kobayashi, Taisuke
Published: (2022)
CubeDAgger: Interactive Imitation Learning for Dynamic Systems with Efficient yet Low-risk Interaction
by: Kobayashi, Taisuke
Published: (2025)
by: Kobayashi, Taisuke
Published: (2025)
Primal-Only Actor Critic Algorithm for Robust Constrained Average Cost MDPs
by: Satheesh, Anirudh, et al.
Published: (2025)
by: Satheesh, Anirudh, et al.
Published: (2025)
Value Improved Actor Critic Algorithms
by: Oren, Yaniv, et al.
Published: (2024)
by: Oren, Yaniv, et al.
Published: (2024)
Compatible Gradient Approximations for Actor-Critic Algorithms
by: Saglam, Baturay, et al.
Published: (2024)
by: Saglam, Baturay, et al.
Published: (2024)
Finite-Time Analysis of Three-Timescale Constrained Actor-Critic and Constrained Natural Actor-Critic Algorithms
by: Panda, Prashansa, et al.
Published: (2023)
by: Panda, Prashansa, et al.
Published: (2023)
Variational Adaptive Noise and Dropout towards Stable Recurrent Neural Networks
by: Kobayashi, Taisuke, et al.
Published: (2025)
by: Kobayashi, Taisuke, et al.
Published: (2025)
Design of Restricted Normalizing Flow towards Arbitrary Stochastic Policy with Computational Efficiency
by: Kobayashi, Taisuke, et al.
Published: (2024)
by: Kobayashi, Taisuke, et al.
Published: (2024)
Actor-Critic without Actor
by: Ki, Donghyeon, et al.
Published: (2025)
by: Ki, Donghyeon, et al.
Published: (2025)
Actor-Critic Algorithm for Dynamic Expectile and CVaR
by: Luo, Yudong, et al.
Published: (2026)
by: Luo, Yudong, et al.
Published: (2026)
A Theoretical Justification for Asymmetric Actor-Critic Algorithms
by: Lambrechts, Gaspard, et al.
Published: (2025)
by: Lambrechts, Gaspard, et al.
Published: (2025)
Improving Actor-Critic Training with Steerable Action-Value Approximation Errors
by: Tasdighi, Bahareh, et al.
Published: (2024)
by: Tasdighi, Bahareh, et al.
Published: (2024)
An Investigation of Batch Normalization in Off-Policy Actor-Critic Algorithms
by: Wang, Li, et al.
Published: (2025)
by: Wang, Li, et al.
Published: (2025)
A Communication-Efficient Decentralized Actor-Critic Algorithm
by: Ren, Xiaoxing, et al.
Published: (2025)
by: Ren, Xiaoxing, et al.
Published: (2025)
Actor-Critic Reinforcement Learning with Phased Actor
by: Wu, Ruofan, et al.
Published: (2024)
by: Wu, Ruofan, et al.
Published: (2024)
Actor-Critic or Critic-Actor? A Tale of Two Time Scales
by: Bhatnagar, Shalabh, et al.
Published: (2022)
by: Bhatnagar, Shalabh, et al.
Published: (2022)
D2 Actor Critic: Diffusion Actor Meets Distributional Critic
by: Zhang, Lunjun, et al.
Published: (2025)
by: Zhang, Lunjun, et al.
Published: (2025)
Double Actor-Critic with TD Error-Driven Regularization in Reinforcement Learning
by: Chen, Haohui, et al.
Published: (2024)
by: Chen, Haohui, et al.
Published: (2024)
Generative Actor Critic
by: Qin, Aoyang, et al.
Published: (2025)
by: Qin, Aoyang, et al.
Published: (2025)
Noisy Spiking Actor Network for Exploration
by: Chen, Ding, et al.
Published: (2024)
by: Chen, Ding, et al.
Published: (2024)
Adviser-Actor-Critic: Eliminating Steady-State Error in Reinforcement Learning Control
by: Chen, Donghe, et al.
Published: (2025)
by: Chen, Donghe, et al.
Published: (2025)
Weak Convergence Analysis of Online Neural Actor-Critic Algorithms
by: Lam, Samuel Chun-Hei, et al.
Published: (2024)
by: Lam, Samuel Chun-Hei, et al.
Published: (2024)
Limits of Actor-Critic Algorithms for Decision Tree Policies Learning in IBMDPs
by: Kohler, Hector, et al.
Published: (2023)
by: Kohler, Hector, et al.
Published: (2023)
Scaling Effects and Uncertainty Quantification in Neural Actor Critic Algorithms
by: Georgoudios, Nikos, et al.
Published: (2026)
by: Georgoudios, Nikos, et al.
Published: (2026)
DSAC-C: Constrained Maximum Entropy for Robust Discrete Soft-Actor Critic
by: Neo, Dexter, et al.
Published: (2023)
by: Neo, Dexter, et al.
Published: (2023)
A New Error Temporal Difference Algorithm for Deep Reinforcement Learning in Microgrid Optimization
by: Yao, Fulong, et al.
Published: (2025)
by: Yao, Fulong, et al.
Published: (2025)
Stochastic Actor-Critic: Mitigating Overestimation via Temporal Aleatoric Uncertainty
by: Özalp, Uğurcan
Published: (2026)
by: Özalp, Uğurcan
Published: (2026)
SMAC: Score-Matched Actor-Critics for Robust Offline-to-Online Transfer
by: de Lara, Nathan Samuel, et al.
Published: (2026)
by: de Lara, Nathan Samuel, et al.
Published: (2026)
XQCfD: Accelerating Fast Actor-Critic Algorithms with Prior Data and Prior Policies
by: Palenicek, Daniel, et al.
Published: (2026)
by: Palenicek, Daniel, et al.
Published: (2026)
An Actor-Critic Algorithm with Function Approximation for Risk Sensitive Cost Markov Decision Processes
by: Guin, Soumyajit, et al.
Published: (2025)
by: Guin, Soumyajit, et al.
Published: (2025)
Efficient Exploration in Deep Reinforcement Learning: A Novel Bayesian Actor-Critic Algorithm
by: Rozanov, Nikolai
Published: (2024)
by: Rozanov, Nikolai
Published: (2024)
Novel Actor-Critic Algorithm for Robust Decision Making of CAV under Delays and Loss of V2X Data
by: Kherroubi, Zine el abidine
Published: (2024)
by: Kherroubi, Zine el abidine
Published: (2024)
Risk-Sensitive Exponential Actor Critic
by: Granados, Alonso, et al.
Published: (2026)
by: Granados, Alonso, et al.
Published: (2026)
Similar Items
-
Intentionally-underestimated Value Function at Terminal State for Temporal-difference Learning with Mis-designed Reward
by: Kobayashi, Taisuke
Published: (2023) -
Towards Autonomous Driving of Personal Mobility with Small and Noisy Dataset using Tsallis-statistics-based Behavioral Cloning
by: Kobayashi, Taisuke, et al.
Published: (2021) -
Weber-Fechner Law in Temporal Difference learning derived from Control as Inference
by: Takahashi, Keiichiro, et al.
Published: (2024) -
Flexible Empowerment at Reasoning with Extended Best-of-N Sampling
by: Kobayashi, Taisuke
Published: (2026) -
Revisiting Experience Replayable Conditions
by: Kobayashi, Taisuke
Published: (2024)