:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lee, Donghwan, Lim, Han-Dong, Park, Jihoon, Choi, Okyong
Format:	Preprint
Published:	2021
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2109.04033
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Backstepping Temporal Difference Learning
by: Lim, Han-Dong, et al.
Published: (2023)

Finite-Time Analysis of Temporal Difference Learning with Experience Replay
by: Lim, Han-Dong, et al.
Published: (2023)

R-GTD: A Geometric Analysis of Gradient Temporal-Difference Learning in Singular Regimes
by: Na, Hyunjun, et al.
Published: (2026)

A Switching System Theory of Q-Learning with Linear Function Approximation
by: Lee, Donghwan, et al.
Published: (2026)

Analysis of Off-Policy $n$-Step TD-Learning with Linear Function Approximation
by: Lim, Han-Dong, et al.
Published: (2025)

Regularized Q-learning
by: Lim, Han-Dong, et al.
Published: (2022)

Understanding the theoretical properties of projected Bellman equation, linear Q-learning, and approximate value iteration
by: Lim, Han-Dong, et al.
Published: (2025)

A primal-dual perspective for distributed TD-learning
by: Lim, Han-Dong, et al.
Published: (2023)

Periodic Regularized Q-Learning
by: Yang, Hyukjun, et al.
Published: (2026)

A finite time analysis of distributed Q-learning
by: Lim, Han-Dong, et al.
Published: (2024)

Learning the Model While Learning Q: Finite-Time Sample Complexity of Online SyncMBQ
by: Lim, Han-Dong, et al.
Published: (2024)

Deep Q-Learning with Gradient Target Tracking
by: Park, Bum Geun, et al.
Published: (2025)

Finite-Time Accuracy of Temporal-Difference Learning Under Schur-Stable Recursions
by: Lee, Donghwan, et al.
Published: (2022)

Contraction-Aligned Analysis of Soft Bellman Residual Minimization with Weighted Lp-Norm for Markov Decision Problem
by: Yang, Hyukjun, et al.
Published: (2026)

Safe-Support Q-Learning: Learning without Unsafe Exploration
by: Lim, Yeeun, et al.
Published: (2026)

Soft Deterministic Policy Gradient with Gaussian Smoothing
by: Na, Hyunjun, et al.
Published: (2026)

Taming the Adversary: Stable Minimax Deep Deterministic Policy Gradient via Fractional Objectives
by: Lee, Taeho, et al.
Published: (2026)

Pretraining a Shared Q-Network for Data-Efficient Offline Reinforcement Learning
by: Park, Jongchan, et al.
Published: (2025)

Adaptive Policy Backbone via Shared Network
by: Park, Bumgeun, et al.
Published: (2025)

Analysis of Off-Policy Multi-Step TD-Learning with Linear Function Approximation
by: Lee, Donghwan
Published: (2024)

Lyapunov-Certified Direct Switching Theory for Q-Learning
by: Lee, Donghwan
Published: (2026)

A Theory of Non-Linear Feature Learning with One Gradient Step in Two-Layer Neural Networks
by: Moniri, Behrad, et al.
Published: (2023)

Finite-Time Error Analysis of Soft Q-Learning: Switching System Approach
by: Jeong, Narim, et al.
Published: (2024)

Suppressing Overestimation in Q-Learning through Adversarial Behaviors
by: Lee, HyeAnn, et al.
Published: (2023)

Gradient Iterated Temporal-Difference Learning
by: Vincent, Théo, et al.
Published: (2026)

Toward a Unified Lyapunov-Certified ODE Convergence Analysis of Smooth Q-Learning with p-Norms
by: Lee, Donghwan, et al.
Published: (2024)

Studying the Korean Word-Chain Game with RLVR: Mitigating Reward Conflicts via Curriculum Learning
by: Rho, Donghwan
Published: (2025)

Revisiting a Design Choice in Gradient Temporal Difference Learning
by: Qian, Xiaochi, et al.
Published: (2023)

Finite-Time Analysis of Simultaneous Double Q-learning
by: Na, Hyunjun, et al.
Published: (2024)

Finite-Time Analysis of Q-Value Iteration for General-Sum Stackelberg Games
by: Jeong, Narim, et al.
Published: (2026)

Bellman Residual Minimization for Control: Geometry, Stationarity, and Convergence
by: Lee, Donghwan, et al.
Published: (2026)

Merge and Bound: Direct Manipulations on Weights for Class Incremental Learning
by: Kim, Taehoon, et al.
Published: (2025)

Stochastic Optimal Control for Diffusion Bridges in Function Spaces
by: Park, Byoungwoo, et al.
Published: (2024)

Generalized Gaussian Temporal Difference Error for Uncertainty-aware Reinforcement Learning
by: Kim, Seyeon, et al.
Published: (2024)

Spatio-Temporal Forecasting of Retaining Wall Deformation: Mitigating Error Accumulation via Multi-Resolution ConvLSTM Stacking Ensemble
by: Kim, Jihoon, et al.
Published: (2026)

Temporal Graph Learning Recurrent Neural Network for Traffic Forecasting
by: Lee, Sanghyun, et al.
Published: (2024)

Temporal Linear Item-Item Model for Sequential Recommendation
by: Park, Seongmin, et al.
Published: (2024)

Controllable Machine Unlearning via Gradient Pivoting
by: Hwang, Youngsik, et al.
Published: (2025)

Material-Agnostic Zero-Shot Thermal Inference for Metal Additive Manufacturing via a Parametric PINN Framework
by: Lee, Hyeonsu, et al.
Published: (2026)

Beyond Correctness: Learning Robust Reasoning via Transfer
by: Lee, Hyunseok, et al.
Published: (2026)