Saved in:
| Main Authors: | Lee, Donghwan, Lim, Han-Dong, Park, Jihoon, Choi, Okyong |
|---|---|
| Format: | Preprint |
| Published: |
2021
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2109.04033 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Backstepping Temporal Difference Learning
by: Lim, Han-Dong, et al.
Published: (2023)
by: Lim, Han-Dong, et al.
Published: (2023)
Finite-Time Analysis of Temporal Difference Learning with Experience Replay
by: Lim, Han-Dong, et al.
Published: (2023)
by: Lim, Han-Dong, et al.
Published: (2023)
R-GTD: A Geometric Analysis of Gradient Temporal-Difference Learning in Singular Regimes
by: Na, Hyunjun, et al.
Published: (2026)
by: Na, Hyunjun, et al.
Published: (2026)
A Switching System Theory of Q-Learning with Linear Function Approximation
by: Lee, Donghwan, et al.
Published: (2026)
by: Lee, Donghwan, et al.
Published: (2026)
Analysis of Off-Policy $n$-Step TD-Learning with Linear Function Approximation
by: Lim, Han-Dong, et al.
Published: (2025)
by: Lim, Han-Dong, et al.
Published: (2025)
Regularized Q-learning
by: Lim, Han-Dong, et al.
Published: (2022)
by: Lim, Han-Dong, et al.
Published: (2022)
Understanding the theoretical properties of projected Bellman equation, linear Q-learning, and approximate value iteration
by: Lim, Han-Dong, et al.
Published: (2025)
by: Lim, Han-Dong, et al.
Published: (2025)
A primal-dual perspective for distributed TD-learning
by: Lim, Han-Dong, et al.
Published: (2023)
by: Lim, Han-Dong, et al.
Published: (2023)
Periodic Regularized Q-Learning
by: Yang, Hyukjun, et al.
Published: (2026)
by: Yang, Hyukjun, et al.
Published: (2026)
A finite time analysis of distributed Q-learning
by: Lim, Han-Dong, et al.
Published: (2024)
by: Lim, Han-Dong, et al.
Published: (2024)
Learning the Model While Learning Q: Finite-Time Sample Complexity of Online SyncMBQ
by: Lim, Han-Dong, et al.
Published: (2024)
by: Lim, Han-Dong, et al.
Published: (2024)
Deep Q-Learning with Gradient Target Tracking
by: Park, Bum Geun, et al.
Published: (2025)
by: Park, Bum Geun, et al.
Published: (2025)
Finite-Time Accuracy of Temporal-Difference Learning Under Schur-Stable Recursions
by: Lee, Donghwan, et al.
Published: (2022)
by: Lee, Donghwan, et al.
Published: (2022)
Contraction-Aligned Analysis of Soft Bellman Residual Minimization with Weighted Lp-Norm for Markov Decision Problem
by: Yang, Hyukjun, et al.
Published: (2026)
by: Yang, Hyukjun, et al.
Published: (2026)
Safe-Support Q-Learning: Learning without Unsafe Exploration
by: Lim, Yeeun, et al.
Published: (2026)
by: Lim, Yeeun, et al.
Published: (2026)
Soft Deterministic Policy Gradient with Gaussian Smoothing
by: Na, Hyunjun, et al.
Published: (2026)
by: Na, Hyunjun, et al.
Published: (2026)
Taming the Adversary: Stable Minimax Deep Deterministic Policy Gradient via Fractional Objectives
by: Lee, Taeho, et al.
Published: (2026)
by: Lee, Taeho, et al.
Published: (2026)
Pretraining a Shared Q-Network for Data-Efficient Offline Reinforcement Learning
by: Park, Jongchan, et al.
Published: (2025)
by: Park, Jongchan, et al.
Published: (2025)
Adaptive Policy Backbone via Shared Network
by: Park, Bumgeun, et al.
Published: (2025)
by: Park, Bumgeun, et al.
Published: (2025)
Analysis of Off-Policy Multi-Step TD-Learning with Linear Function Approximation
by: Lee, Donghwan
Published: (2024)
by: Lee, Donghwan
Published: (2024)
Lyapunov-Certified Direct Switching Theory for Q-Learning
by: Lee, Donghwan
Published: (2026)
by: Lee, Donghwan
Published: (2026)
A Theory of Non-Linear Feature Learning with One Gradient Step in Two-Layer Neural Networks
by: Moniri, Behrad, et al.
Published: (2023)
by: Moniri, Behrad, et al.
Published: (2023)
Finite-Time Error Analysis of Soft Q-Learning: Switching System Approach
by: Jeong, Narim, et al.
Published: (2024)
by: Jeong, Narim, et al.
Published: (2024)
Suppressing Overestimation in Q-Learning through Adversarial Behaviors
by: Lee, HyeAnn, et al.
Published: (2023)
by: Lee, HyeAnn, et al.
Published: (2023)
Gradient Iterated Temporal-Difference Learning
by: Vincent, Théo, et al.
Published: (2026)
by: Vincent, Théo, et al.
Published: (2026)
Toward a Unified Lyapunov-Certified ODE Convergence Analysis of Smooth Q-Learning with p-Norms
by: Lee, Donghwan, et al.
Published: (2024)
by: Lee, Donghwan, et al.
Published: (2024)
Studying the Korean Word-Chain Game with RLVR: Mitigating Reward Conflicts via Curriculum Learning
by: Rho, Donghwan
Published: (2025)
by: Rho, Donghwan
Published: (2025)
Revisiting a Design Choice in Gradient Temporal Difference Learning
by: Qian, Xiaochi, et al.
Published: (2023)
by: Qian, Xiaochi, et al.
Published: (2023)
Finite-Time Analysis of Simultaneous Double Q-learning
by: Na, Hyunjun, et al.
Published: (2024)
by: Na, Hyunjun, et al.
Published: (2024)
Finite-Time Analysis of Q-Value Iteration for General-Sum Stackelberg Games
by: Jeong, Narim, et al.
Published: (2026)
by: Jeong, Narim, et al.
Published: (2026)
Bellman Residual Minimization for Control: Geometry, Stationarity, and Convergence
by: Lee, Donghwan, et al.
Published: (2026)
by: Lee, Donghwan, et al.
Published: (2026)
Merge and Bound: Direct Manipulations on Weights for Class Incremental Learning
by: Kim, Taehoon, et al.
Published: (2025)
by: Kim, Taehoon, et al.
Published: (2025)
Stochastic Optimal Control for Diffusion Bridges in Function Spaces
by: Park, Byoungwoo, et al.
Published: (2024)
by: Park, Byoungwoo, et al.
Published: (2024)
Generalized Gaussian Temporal Difference Error for Uncertainty-aware Reinforcement Learning
by: Kim, Seyeon, et al.
Published: (2024)
by: Kim, Seyeon, et al.
Published: (2024)
Spatio-Temporal Forecasting of Retaining Wall Deformation: Mitigating Error Accumulation via Multi-Resolution ConvLSTM Stacking Ensemble
by: Kim, Jihoon, et al.
Published: (2026)
by: Kim, Jihoon, et al.
Published: (2026)
Temporal Graph Learning Recurrent Neural Network for Traffic Forecasting
by: Lee, Sanghyun, et al.
Published: (2024)
by: Lee, Sanghyun, et al.
Published: (2024)
Temporal Linear Item-Item Model for Sequential Recommendation
by: Park, Seongmin, et al.
Published: (2024)
by: Park, Seongmin, et al.
Published: (2024)
Controllable Machine Unlearning via Gradient Pivoting
by: Hwang, Youngsik, et al.
Published: (2025)
by: Hwang, Youngsik, et al.
Published: (2025)
Material-Agnostic Zero-Shot Thermal Inference for Metal Additive Manufacturing via a Parametric PINN Framework
by: Lee, Hyeonsu, et al.
Published: (2026)
by: Lee, Hyeonsu, et al.
Published: (2026)
Beyond Correctness: Learning Robust Reasoning via Transfer
by: Lee, Hyunseok, et al.
Published: (2026)
by: Lee, Hyunseok, et al.
Published: (2026)
Similar Items
-
Backstepping Temporal Difference Learning
by: Lim, Han-Dong, et al.
Published: (2023) -
Finite-Time Analysis of Temporal Difference Learning with Experience Replay
by: Lim, Han-Dong, et al.
Published: (2023) -
R-GTD: A Geometric Analysis of Gradient Temporal-Difference Learning in Singular Regimes
by: Na, Hyunjun, et al.
Published: (2026) -
A Switching System Theory of Q-Learning with Linear Function Approximation
by: Lee, Donghwan, et al.
Published: (2026) -
Analysis of Off-Policy $n$-Step TD-Learning with Linear Function Approximation
by: Lim, Han-Dong, et al.
Published: (2025)