:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lim, Han-Dong, Lee, Donghwan
Format:	Preprint
Published:	2022
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2202.05404
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Periodic Regularized Q-Learning
by: Yang, Hyukjun, et al.
Published: (2026)

A finite time analysis of distributed Q-learning
by: Lim, Han-Dong, et al.
Published: (2024)

Understanding the theoretical properties of projected Bellman equation, linear Q-learning, and approximate value iteration
by: Lim, Han-Dong, et al.
Published: (2025)

A Switching System Theory of Q-Learning with Linear Function Approximation
by: Lee, Donghwan, et al.
Published: (2026)

A primal-dual perspective for distributed TD-learning
by: Lim, Han-Dong, et al.
Published: (2023)

Learning the Model While Learning Q: Finite-Time Sample Complexity of Online SyncMBQ
by: Lim, Han-Dong, et al.
Published: (2024)

Analysis of Off-Policy $n$-Step TD-Learning with Linear Function Approximation
by: Lim, Han-Dong, et al.
Published: (2025)

Finite-Time Analysis of Temporal Difference Learning with Experience Replay
by: Lim, Han-Dong, et al.
Published: (2023)

Backstepping Temporal Difference Learning
by: Lim, Han-Dong, et al.
Published: (2023)

Finite-Time Analysis of Simultaneous Double Q-learning
by: Na, Hyunjun, et al.
Published: (2024)

Safe-Support Q-Learning: Learning without Unsafe Exploration
by: Lim, Yeeun, et al.
Published: (2026)

Contraction-Aligned Analysis of Soft Bellman Residual Minimization with Weighted Lp-Norm for Markov Decision Problem
by: Yang, Hyukjun, et al.
Published: (2026)

Lyapunov-Certified Direct Switching Theory for Q-Learning
by: Lee, Donghwan
Published: (2026)

New Versions of Gradient Temporal Difference Learning
by: Lee, Donghwan, et al.
Published: (2021)

Finite-Time Error Analysis of Soft Q-Learning: Switching System Approach
by: Jeong, Narim, et al.
Published: (2024)

Suppressing Overestimation in Q-Learning through Adversarial Behaviors
by: Lee, HyeAnn, et al.
Published: (2023)

Finite-Time Analysis of Q-Value Iteration for General-Sum Stackelberg Games
by: Jeong, Narim, et al.
Published: (2026)

Toward a Unified Lyapunov-Certified ODE Convergence Analysis of Smooth Q-Learning with p-Norms
by: Lee, Donghwan, et al.
Published: (2024)

Deep Q-Learning with Gradient Target Tracking
by: Park, Bum Geun, et al.
Published: (2025)

Pretraining a Shared Q-Network for Data-Efficient Offline Reinforcement Learning
by: Park, Jongchan, et al.
Published: (2025)

Analysis of Off-Policy Multi-Step TD-Learning with Linear Function Approximation
by: Lee, Donghwan
Published: (2024)

Taming the Adversary: Stable Minimax Deep Deterministic Policy Gradient via Fractional Objectives
by: Lee, Taeho, et al.
Published: (2026)

Soft Deterministic Policy Gradient with Gaussian Smoothing
by: Na, Hyunjun, et al.
Published: (2026)

R-GTD: A Geometric Analysis of Gradient Temporal-Difference Learning in Singular Regimes
by: Na, Hyunjun, et al.
Published: (2026)

Adaptive Policy Backbone via Shared Network
by: Park, Bumgeun, et al.
Published: (2025)

Bellman Residual Minimization for Control: Geometry, Stationarity, and Convergence
by: Lee, Donghwan, et al.
Published: (2026)

Regularized Q-learning through Robust Averaging
by: Schmitt-Förster, Peter, et al.
Published: (2024)

Finite-Time Accuracy of Temporal-Difference Learning Under Schur-Stable Recursions
by: Lee, Donghwan, et al.
Published: (2022)

Studying the Korean Word-Chain Game with RLVR: Mitigating Reward Conflicts via Curriculum Learning
by: Rho, Donghwan
Published: (2025)

Exclusively Penalized Q-learning for Offline Reinforcement Learning
by: Yeom, Junghyuk, et al.
Published: (2024)

A Theory of Non-Linear Feature Learning with One Gradient Step in Two-Layer Neural Networks
by: Moniri, Behrad, et al.
Published: (2023)

TILDE-Q: A Transformation Invariant Loss Function for Time-Series Forecasting
by: Lee, Hyunwook, et al.
Published: (2022)

MahaVar: OOD Detection via Class-wise Mahalanobis Distance Variance under Neural Collapse
by: Kim, Donghwan, et al.
Published: (2026)

Mitigating the Likelihood Paradox in Flow-based OOD Detection via Entropy Manipulation
by: Kim, Donghwan, et al.
Published: (2026)

Merge and Bound: Direct Manipulations on Weights for Class Incremental Learning
by: Kim, Taehoon, et al.
Published: (2025)

Transfer learning via Regularized Linear Discriminant Analysis
by: Zhang, Hongzhe, et al.
Published: (2025)

Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning
by: Yan, Teng, et al.
Published: (2024)

HyperQ-Opt: Q-learning for Hyperparameter Optimization
by: Hasan, Md. Tarek
Published: (2024)

Q-value Regularized Transformer for Offline Reinforcement Learning
by: Hu, Shengchao, et al.
Published: (2024)

Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model
by: Zhang, Jing, et al.
Published: (2024)