:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autore principale:	Lee, Donghwan
Natura:	Preprint
Pubblicazione:	2026
Soggetti:	Artificial Intelligence
Accesso online:	https://arxiv.org/abs/2605.16103
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

Learning the Model While Learning Q: Finite-Time Sample Complexity of Online SyncMBQ
di: Lim, Han-Dong, et al.
Pubblicazione: (2024)

Finite-Time Analysis of Temporal Difference Learning with Experience Replay
di: Lim, Han-Dong, et al.
Pubblicazione: (2023)

Switching-Geometry Analysis of Deflated Q-Value Iteration
di: Lee, Donghwan
Pubblicazione: (2026)

Lyapunov-Certified Direct Switching Theory for Q-Learning
di: Lee, Donghwan
Pubblicazione: (2026)

A Discrete-Time Switching System Analysis of Q-learning
di: Lee, Donghwan, et al.
Pubblicazione: (2021)

Toward a Unified Lyapunov-Certified ODE Convergence Analysis of Smooth Q-Learning with p-Norms
di: Lee, Donghwan, et al.
Pubblicazione: (2024)

Suppressing Overestimation in Q-Learning through Adversarial Behaviors
di: Lee, HyeAnn, et al.
Pubblicazione: (2023)

Periodic Regularized Q-Learning
di: Yang, Hyukjun, et al.
Pubblicazione: (2026)

Finite-Time Error Analysis of Soft Q-Learning: Switching System Approach
di: Jeong, Narim, et al.
Pubblicazione: (2024)

Safe-Support Q-Learning: Learning without Unsafe Exploration
di: Lim, Yeeun, et al.
Pubblicazione: (2026)

Pretraining a Shared Q-Network for Data-Efficient Offline Reinforcement Learning
di: Park, Jongchan, et al.
Pubblicazione: (2025)

A finite time analysis of distributed Q-learning
di: Lim, Han-Dong, et al.
Pubblicazione: (2024)

R-GTD: A Geometric Analysis of Gradient Temporal-Difference Learning in Singular Regimes
di: Na, Hyunjun, et al.
Pubblicazione: (2026)

Understanding the theoretical properties of projected Bellman equation, linear Q-learning, and approximate value iteration
di: Lim, Han-Dong, et al.
Pubblicazione: (2025)

Analysis of Off-Policy $n$-Step TD-Learning with Linear Function Approximation
di: Lim, Han-Dong, et al.
Pubblicazione: (2025)

Finite-Time Analysis of Simultaneous Double Q-learning
di: Na, Hyunjun, et al.
Pubblicazione: (2024)

Beyond the Bellman Fixed Point: Geometry and Fast Policy Identification in Value Iteration
di: Lee, Donghwan
Pubblicazione: (2026)

Backstepping Temporal Difference Learning
di: Lim, Han-Dong, et al.
Pubblicazione: (2023)

Robust Deterministic Policy Gradient for Disturbance Attenuation and Its Application to Quadrotor Control
di: Lee, Taeho, et al.
Pubblicazione: (2025)

Taming the Adversary: Stable Minimax Deep Deterministic Policy Gradient via Fractional Objectives
di: Lee, Taeho, et al.
Pubblicazione: (2026)

Analysis of approximate linear programming solution to Markov decision problem with log barrier function
di: Lee, Donghwan, et al.
Pubblicazione: (2025)

Soft Deterministic Policy Gradient with Gaussian Smoothing
di: Na, Hyunjun, et al.
Pubblicazione: (2026)

Adaptive Policy Backbone via Shared Network
di: Park, Bumgeun, et al.
Pubblicazione: (2025)

Finite-Time Analysis of Q-Value Iteration for General-Sum Stackelberg Games
di: Jeong, Narim, et al.
Pubblicazione: (2026)

Continuous-Time Distributed Dynamic Programming for Networked Multi-Agent Markov Decision Processes
di: Lee, Donghwan, et al.
Pubblicazione: (2023)

MahaVar: OOD Detection via Class-wise Mahalanobis Distance Variance under Neural Collapse
di: Kim, Donghwan, et al.
Pubblicazione: (2026)

Mitigating the Likelihood Paradox in Flow-based OOD Detection via Entropy Manipulation
di: Kim, Donghwan, et al.
Pubblicazione: (2026)

Mutation-based Consistency Testing for Evaluating the Code Understanding Capability of LLMs
di: Li, Ziyu, et al.
Pubblicazione: (2024)

Q-Learning under Finite Model Uncertainty
di: Sester, Julian, et al.
Pubblicazione: (2024)

Inference of Deterministic Finite Automata via Q-Learning
di: Hosseinkhani, Elaheh, et al.
Pubblicazione: (2025)

Frictional Q-Learning
di: Kim, Hyunwoo, et al.
Pubblicazione: (2025)

Merge and Bound: Direct Manipulations on Weights for Class Incremental Learning
di: Kim, Taehoon, et al.
Pubblicazione: (2025)

PhysHanDI: Physics-Based Reconstruction of Hand-Deformable Object Interactions
di: Lee, Jihyun, et al.
Pubblicazione: (2026)

Why the Counterintuitive Phenomenon of Likelihood Rarely Appears in Tabular Anomaly Detection with Deep Generative Models?
di: Kim, Donghwan, et al.
Pubblicazione: (2026)

Symmetric Q-learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning
di: Omura, Motoki, et al.
Pubblicazione: (2024)

Chunk-Guided Q-Learning
di: Song, Gwanwoo, et al.
Pubblicazione: (2026)

DeClotH: Decomposable 3D Cloth and Human Body Reconstruction from a Single Image
di: Nam, Hyeongjin, et al.
Pubblicazione: (2025)

Finite-Time Analysis of MCTS in Continuous POMDP Planning
di: Kong, Da, et al.
Pubblicazione: (2026)

Misspecified $Q$-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error
di: Du, Ally Yalei, et al.
Pubblicazione: (2024)

Find A Winning Sign: Sign Is All We Need to Win the Lottery
di: Oh, Junghun, et al.
Pubblicazione: (2025)