:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Schmitt-Förster, Peter, Sutter, Tobias
Format:	Preprint
Published:	2024
Subjects:	Optimization and Control Machine Learning
Online Access:	https://arxiv.org/abs/2405.02201
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Randomized algorithms and PAC bounds for inverse reinforcement learning in continuous spaces
by: Kamoutsi, Angeliki, et al.
Published: (2024)

A Two-Timescale Primal-Dual Framework for Reinforcement Learning via Online Dual Variable Guidance
by: Wolter, Axel Friedrich, et al.
Published: (2025)

Distributional Adversarial Attacks and Training in Deep Hedging
by: He, Guangyi, et al.
Published: (2025)

Sample Complexity of Variance-reduced Distributionally Robust Q-learning
by: Wang, Shengbo, et al.
Published: (2023)

Central Limit Theorems for Asynchronous Averaged Q-Learning
by: Liu, Xingtu
Published: (2025)

Policy Gradient Algorithms for Robust MDPs with Non-Rectangular Uncertainty Sets
by: Li, Mengmeng, et al.
Published: (2023)

Robust Regression over Averaged Uncertainty
by: Bertsimas, Dimitris, et al.
Published: (2023)

Towards Optimal Offline Reinforcement Learning
by: Li, Mengmeng, et al.
Published: (2025)

On Convergence of Average-Reward Q-Learning in Weakly Communicating Markov Decision Processes
by: Wan, Yi, et al.
Published: (2024)

Regularization for Adversarial Robust Learning
by: Wang, Jie, et al.
Published: (2024)

Sample Complexity of Distributionally Robust Average-Reward Reinforcement Learning
by: Chen, Zijun, et al.
Published: (2025)

Robust Q-Learning under Corrupted Rewards
by: Maity, Sreejeet, et al.
Published: (2024)

Regularized Adaptive Momentum Dual Averaging with an Efficient Inexact Subproblem Solver for Training Structured Neural Network
by: Huang, Zih-Syuan, et al.
Published: (2024)

Convergence and stability of Q-learning in Hierarchical Reinforcement Learning
by: Manenti, Massimiliano, et al.
Published: (2025)

Robust Implicit Regularization via Weight Normalization
by: Chou, Hung-Hsu, et al.
Published: (2023)

Non-Rectangular Average-Reward Robust MDPs: Optimal Policies and Their Transient Values
by: Wang, Shengbo, et al.
Published: (2026)

Variance-Reduced Cascade Q-learning: Algorithms and Sample Complexity
by: Boveiri, Mohammad, et al.
Published: (2024)

On Generalization and Regularization via Wasserstein Distributionally Robust Optimization
by: Wu, Qinyu, et al.
Published: (2022)

Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation
by: Zhang, Yixuan, et al.
Published: (2024)

Bellman Optimality of Average-Reward Robust Markov Decision Processes with a Constant Gain
by: Wang, Shengbo, et al.
Published: (2025)

Optimal Sample Complexity for Average Reward Markov Decision Processes
by: Wang, Shengbo, et al.
Published: (2023)

Implicit Regularization Makes Overparameterized Asymmetric Matrix Sensing Robust to Perturbations
by: Wind, Johan S.
Published: (2023)

Online reinforcement learning via sparse Gaussian mixture model Q-functions
by: Vu, Minh, et al.
Published: (2025)

Robust $Q$-learning Algorithm for Markov Decision Processes under Wasserstein Uncertainty
by: Neufeld, Ariel, et al.
Published: (2022)

Robust Fitted-Q-Evaluation and Iteration under Sequentially Exogenous Unobserved Confounders
by: Bruns-Smith, David, et al.
Published: (2023)

Nested Stochastic Algorithm for Generalized Sinkhorn distance-Regularized Distributionally Robust Optimization
by: Yang, Yufeng, et al.
Published: (2025)

Distributionally Robust Deep Q-Learning
by: Lu, Chung I, et al.
Published: (2025)

DADA: Dual Averaging with Distance Adaptation
by: Moshtaghifar, Mohammad, et al.
Published: (2025)

Q3R: Quadratic Reweighted Rank Regularizer for Effective Low-Rank Training
by: Ghosh, Ipsita, et al.
Published: (2025)

A Unified Analysis for Finite Weight Averaging
by: Wang, Peng, et al.
Published: (2024)

Planning and Learning in Average Risk-aware MDPs
by: Wang, Weikai, et al.
Published: (2025)

Refined Analysis of Federated Averaging and Federated Richardson-Romberg
by: Mangold, Paul, et al.
Published: (2024)

Composite Optimization with Error Feedback: the Dual Averaging Approach
by: Gao, Yuan, et al.
Published: (2025)

Layer-wise Quantization for Quantized Optimistic Dual Averaging
by: Nguyen, Anh Duc, et al.
Published: (2025)

Achieving Tractable Minimax Optimal Regret in Average Reward MDPs
by: Boone, Victor, et al.
Published: (2024)

(Almost) Smooth Sailing: Towards Numerical Stability of Neural Networks Through Differentiable Regularization of the Condition Number
by: Nenov, Rossen, et al.
Published: (2024)

Cauchy-Schwarz Regularizers
by: Taner, Sueda, et al.
Published: (2025)

Performance of NPG in Countable State-Space Average-Cost RL
by: Murthy, Yashaswini, et al.
Published: (2024)

A Simplified Analysis of SGD for Linear Regression with Weight Averaging
by: Meterez, Alexandru, et al.
Published: (2025)

Unified Convergence Analysis for Adaptive Optimization with Moving Average Estimator
by: Guo, Zhishuai, et al.
Published: (2021)