Saved in:
| Main Authors: | Lim, Han-Dong, Lee, Donghwan |
|---|---|
| Format: | Preprint |
| Published: |
2022
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2202.05404 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Periodic Regularized Q-Learning
by: Yang, Hyukjun, et al.
Published: (2026)
by: Yang, Hyukjun, et al.
Published: (2026)
A finite time analysis of distributed Q-learning
by: Lim, Han-Dong, et al.
Published: (2024)
by: Lim, Han-Dong, et al.
Published: (2024)
Understanding the theoretical properties of projected Bellman equation, linear Q-learning, and approximate value iteration
by: Lim, Han-Dong, et al.
Published: (2025)
by: Lim, Han-Dong, et al.
Published: (2025)
A Switching System Theory of Q-Learning with Linear Function Approximation
by: Lee, Donghwan, et al.
Published: (2026)
by: Lee, Donghwan, et al.
Published: (2026)
A primal-dual perspective for distributed TD-learning
by: Lim, Han-Dong, et al.
Published: (2023)
by: Lim, Han-Dong, et al.
Published: (2023)
Learning the Model While Learning Q: Finite-Time Sample Complexity of Online SyncMBQ
by: Lim, Han-Dong, et al.
Published: (2024)
by: Lim, Han-Dong, et al.
Published: (2024)
Analysis of Off-Policy $n$-Step TD-Learning with Linear Function Approximation
by: Lim, Han-Dong, et al.
Published: (2025)
by: Lim, Han-Dong, et al.
Published: (2025)
Finite-Time Analysis of Temporal Difference Learning with Experience Replay
by: Lim, Han-Dong, et al.
Published: (2023)
by: Lim, Han-Dong, et al.
Published: (2023)
Backstepping Temporal Difference Learning
by: Lim, Han-Dong, et al.
Published: (2023)
by: Lim, Han-Dong, et al.
Published: (2023)
Finite-Time Analysis of Simultaneous Double Q-learning
by: Na, Hyunjun, et al.
Published: (2024)
by: Na, Hyunjun, et al.
Published: (2024)
Safe-Support Q-Learning: Learning without Unsafe Exploration
by: Lim, Yeeun, et al.
Published: (2026)
by: Lim, Yeeun, et al.
Published: (2026)
Contraction-Aligned Analysis of Soft Bellman Residual Minimization with Weighted Lp-Norm for Markov Decision Problem
by: Yang, Hyukjun, et al.
Published: (2026)
by: Yang, Hyukjun, et al.
Published: (2026)
Lyapunov-Certified Direct Switching Theory for Q-Learning
by: Lee, Donghwan
Published: (2026)
by: Lee, Donghwan
Published: (2026)
New Versions of Gradient Temporal Difference Learning
by: Lee, Donghwan, et al.
Published: (2021)
by: Lee, Donghwan, et al.
Published: (2021)
Finite-Time Error Analysis of Soft Q-Learning: Switching System Approach
by: Jeong, Narim, et al.
Published: (2024)
by: Jeong, Narim, et al.
Published: (2024)
Suppressing Overestimation in Q-Learning through Adversarial Behaviors
by: Lee, HyeAnn, et al.
Published: (2023)
by: Lee, HyeAnn, et al.
Published: (2023)
Finite-Time Analysis of Q-Value Iteration for General-Sum Stackelberg Games
by: Jeong, Narim, et al.
Published: (2026)
by: Jeong, Narim, et al.
Published: (2026)
Toward a Unified Lyapunov-Certified ODE Convergence Analysis of Smooth Q-Learning with p-Norms
by: Lee, Donghwan, et al.
Published: (2024)
by: Lee, Donghwan, et al.
Published: (2024)
Deep Q-Learning with Gradient Target Tracking
by: Park, Bum Geun, et al.
Published: (2025)
by: Park, Bum Geun, et al.
Published: (2025)
Pretraining a Shared Q-Network for Data-Efficient Offline Reinforcement Learning
by: Park, Jongchan, et al.
Published: (2025)
by: Park, Jongchan, et al.
Published: (2025)
Analysis of Off-Policy Multi-Step TD-Learning with Linear Function Approximation
by: Lee, Donghwan
Published: (2024)
by: Lee, Donghwan
Published: (2024)
Taming the Adversary: Stable Minimax Deep Deterministic Policy Gradient via Fractional Objectives
by: Lee, Taeho, et al.
Published: (2026)
by: Lee, Taeho, et al.
Published: (2026)
Soft Deterministic Policy Gradient with Gaussian Smoothing
by: Na, Hyunjun, et al.
Published: (2026)
by: Na, Hyunjun, et al.
Published: (2026)
R-GTD: A Geometric Analysis of Gradient Temporal-Difference Learning in Singular Regimes
by: Na, Hyunjun, et al.
Published: (2026)
by: Na, Hyunjun, et al.
Published: (2026)
Adaptive Policy Backbone via Shared Network
by: Park, Bumgeun, et al.
Published: (2025)
by: Park, Bumgeun, et al.
Published: (2025)
Bellman Residual Minimization for Control: Geometry, Stationarity, and Convergence
by: Lee, Donghwan, et al.
Published: (2026)
by: Lee, Donghwan, et al.
Published: (2026)
Regularized Q-learning through Robust Averaging
by: Schmitt-Förster, Peter, et al.
Published: (2024)
by: Schmitt-Förster, Peter, et al.
Published: (2024)
Finite-Time Accuracy of Temporal-Difference Learning Under Schur-Stable Recursions
by: Lee, Donghwan, et al.
Published: (2022)
by: Lee, Donghwan, et al.
Published: (2022)
Studying the Korean Word-Chain Game with RLVR: Mitigating Reward Conflicts via Curriculum Learning
by: Rho, Donghwan
Published: (2025)
by: Rho, Donghwan
Published: (2025)
Exclusively Penalized Q-learning for Offline Reinforcement Learning
by: Yeom, Junghyuk, et al.
Published: (2024)
by: Yeom, Junghyuk, et al.
Published: (2024)
A Theory of Non-Linear Feature Learning with One Gradient Step in Two-Layer Neural Networks
by: Moniri, Behrad, et al.
Published: (2023)
by: Moniri, Behrad, et al.
Published: (2023)
TILDE-Q: A Transformation Invariant Loss Function for Time-Series Forecasting
by: Lee, Hyunwook, et al.
Published: (2022)
by: Lee, Hyunwook, et al.
Published: (2022)
MahaVar: OOD Detection via Class-wise Mahalanobis Distance Variance under Neural Collapse
by: Kim, Donghwan, et al.
Published: (2026)
by: Kim, Donghwan, et al.
Published: (2026)
Mitigating the Likelihood Paradox in Flow-based OOD Detection via Entropy Manipulation
by: Kim, Donghwan, et al.
Published: (2026)
by: Kim, Donghwan, et al.
Published: (2026)
Merge and Bound: Direct Manipulations on Weights for Class Incremental Learning
by: Kim, Taehoon, et al.
Published: (2025)
by: Kim, Taehoon, et al.
Published: (2025)
Transfer learning via Regularized Linear Discriminant Analysis
by: Zhang, Hongzhe, et al.
Published: (2025)
by: Zhang, Hongzhe, et al.
Published: (2025)
Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning
by: Yan, Teng, et al.
Published: (2024)
by: Yan, Teng, et al.
Published: (2024)
HyperQ-Opt: Q-learning for Hyperparameter Optimization
by: Hasan, Md. Tarek
Published: (2024)
by: Hasan, Md. Tarek
Published: (2024)
Q-value Regularized Transformer for Offline Reinforcement Learning
by: Hu, Shengchao, et al.
Published: (2024)
by: Hu, Shengchao, et al.
Published: (2024)
Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
by: Zhang, Jing, et al.
Published: (2024)
by: Zhang, Jing, et al.
Published: (2024)
Similar Items
-
Periodic Regularized Q-Learning
by: Yang, Hyukjun, et al.
Published: (2026) -
A finite time analysis of distributed Q-learning
by: Lim, Han-Dong, et al.
Published: (2024) -
Understanding the theoretical properties of projected Bellman equation, linear Q-learning, and approximate value iteration
by: Lim, Han-Dong, et al.
Published: (2025) -
A Switching System Theory of Q-Learning with Linear Function Approximation
by: Lee, Donghwan, et al.
Published: (2026) -
A primal-dual perspective for distributed TD-learning
by: Lim, Han-Dong, et al.
Published: (2023)