Saved in:
| Main Authors: | Schmitt-Förster, Peter, Sutter, Tobias |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.02201 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Randomized algorithms and PAC bounds for inverse reinforcement learning in continuous spaces
by: Kamoutsi, Angeliki, et al.
Published: (2024)
by: Kamoutsi, Angeliki, et al.
Published: (2024)
A Two-Timescale Primal-Dual Framework for Reinforcement Learning via Online Dual Variable Guidance
by: Wolter, Axel Friedrich, et al.
Published: (2025)
by: Wolter, Axel Friedrich, et al.
Published: (2025)
Distributional Adversarial Attacks and Training in Deep Hedging
by: He, Guangyi, et al.
Published: (2025)
by: He, Guangyi, et al.
Published: (2025)
Sample Complexity of Variance-reduced Distributionally Robust Q-learning
by: Wang, Shengbo, et al.
Published: (2023)
by: Wang, Shengbo, et al.
Published: (2023)
Central Limit Theorems for Asynchronous Averaged Q-Learning
by: Liu, Xingtu
Published: (2025)
by: Liu, Xingtu
Published: (2025)
Policy Gradient Algorithms for Robust MDPs with Non-Rectangular Uncertainty Sets
by: Li, Mengmeng, et al.
Published: (2023)
by: Li, Mengmeng, et al.
Published: (2023)
Robust Regression over Averaged Uncertainty
by: Bertsimas, Dimitris, et al.
Published: (2023)
by: Bertsimas, Dimitris, et al.
Published: (2023)
Towards Optimal Offline Reinforcement Learning
by: Li, Mengmeng, et al.
Published: (2025)
by: Li, Mengmeng, et al.
Published: (2025)
On Convergence of Average-Reward Q-Learning in Weakly Communicating Markov Decision Processes
by: Wan, Yi, et al.
Published: (2024)
by: Wan, Yi, et al.
Published: (2024)
Regularization for Adversarial Robust Learning
by: Wang, Jie, et al.
Published: (2024)
by: Wang, Jie, et al.
Published: (2024)
Sample Complexity of Distributionally Robust Average-Reward Reinforcement Learning
by: Chen, Zijun, et al.
Published: (2025)
by: Chen, Zijun, et al.
Published: (2025)
Robust Q-Learning under Corrupted Rewards
by: Maity, Sreejeet, et al.
Published: (2024)
by: Maity, Sreejeet, et al.
Published: (2024)
Regularized Adaptive Momentum Dual Averaging with an Efficient Inexact Subproblem Solver for Training Structured Neural Network
by: Huang, Zih-Syuan, et al.
Published: (2024)
by: Huang, Zih-Syuan, et al.
Published: (2024)
Convergence and stability of Q-learning in Hierarchical Reinforcement Learning
by: Manenti, Massimiliano, et al.
Published: (2025)
by: Manenti, Massimiliano, et al.
Published: (2025)
Robust Implicit Regularization via Weight Normalization
by: Chou, Hung-Hsu, et al.
Published: (2023)
by: Chou, Hung-Hsu, et al.
Published: (2023)
Non-Rectangular Average-Reward Robust MDPs: Optimal Policies and Their Transient Values
by: Wang, Shengbo, et al.
Published: (2026)
by: Wang, Shengbo, et al.
Published: (2026)
Variance-Reduced Cascade Q-learning: Algorithms and Sample Complexity
by: Boveiri, Mohammad, et al.
Published: (2024)
by: Boveiri, Mohammad, et al.
Published: (2024)
On Generalization and Regularization via Wasserstein Distributionally Robust Optimization
by: Wu, Qinyu, et al.
Published: (2022)
by: Wu, Qinyu, et al.
Published: (2022)
Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation
by: Zhang, Yixuan, et al.
Published: (2024)
by: Zhang, Yixuan, et al.
Published: (2024)
Bellman Optimality of Average-Reward Robust Markov Decision Processes with a Constant Gain
by: Wang, Shengbo, et al.
Published: (2025)
by: Wang, Shengbo, et al.
Published: (2025)
Optimal Sample Complexity for Average Reward Markov Decision Processes
by: Wang, Shengbo, et al.
Published: (2023)
by: Wang, Shengbo, et al.
Published: (2023)
Implicit Regularization Makes Overparameterized Asymmetric Matrix Sensing Robust to Perturbations
by: Wind, Johan S.
Published: (2023)
by: Wind, Johan S.
Published: (2023)
Online reinforcement learning via sparse Gaussian mixture model Q-functions
by: Vu, Minh, et al.
Published: (2025)
by: Vu, Minh, et al.
Published: (2025)
Robust $Q$-learning Algorithm for Markov Decision Processes under Wasserstein Uncertainty
by: Neufeld, Ariel, et al.
Published: (2022)
by: Neufeld, Ariel, et al.
Published: (2022)
Robust Fitted-Q-Evaluation and Iteration under Sequentially Exogenous Unobserved Confounders
by: Bruns-Smith, David, et al.
Published: (2023)
by: Bruns-Smith, David, et al.
Published: (2023)
Nested Stochastic Algorithm for Generalized Sinkhorn distance-Regularized Distributionally Robust Optimization
by: Yang, Yufeng, et al.
Published: (2025)
by: Yang, Yufeng, et al.
Published: (2025)
Distributionally Robust Deep Q-Learning
by: Lu, Chung I, et al.
Published: (2025)
by: Lu, Chung I, et al.
Published: (2025)
DADA: Dual Averaging with Distance Adaptation
by: Moshtaghifar, Mohammad, et al.
Published: (2025)
by: Moshtaghifar, Mohammad, et al.
Published: (2025)
Q3R: Quadratic Reweighted Rank Regularizer for Effective Low-Rank Training
by: Ghosh, Ipsita, et al.
Published: (2025)
by: Ghosh, Ipsita, et al.
Published: (2025)
A Unified Analysis for Finite Weight Averaging
by: Wang, Peng, et al.
Published: (2024)
by: Wang, Peng, et al.
Published: (2024)
Planning and Learning in Average Risk-aware MDPs
by: Wang, Weikai, et al.
Published: (2025)
by: Wang, Weikai, et al.
Published: (2025)
Refined Analysis of Federated Averaging and Federated Richardson-Romberg
by: Mangold, Paul, et al.
Published: (2024)
by: Mangold, Paul, et al.
Published: (2024)
Composite Optimization with Error Feedback: the Dual Averaging Approach
by: Gao, Yuan, et al.
Published: (2025)
by: Gao, Yuan, et al.
Published: (2025)
Layer-wise Quantization for Quantized Optimistic Dual Averaging
by: Nguyen, Anh Duc, et al.
Published: (2025)
by: Nguyen, Anh Duc, et al.
Published: (2025)
Achieving Tractable Minimax Optimal Regret in Average Reward MDPs
by: Boone, Victor, et al.
Published: (2024)
by: Boone, Victor, et al.
Published: (2024)
(Almost) Smooth Sailing: Towards Numerical Stability of Neural Networks Through Differentiable Regularization of the Condition Number
by: Nenov, Rossen, et al.
Published: (2024)
by: Nenov, Rossen, et al.
Published: (2024)
Cauchy-Schwarz Regularizers
by: Taner, Sueda, et al.
Published: (2025)
by: Taner, Sueda, et al.
Published: (2025)
Performance of NPG in Countable State-Space Average-Cost RL
by: Murthy, Yashaswini, et al.
Published: (2024)
by: Murthy, Yashaswini, et al.
Published: (2024)
A Simplified Analysis of SGD for Linear Regression with Weight Averaging
by: Meterez, Alexandru, et al.
Published: (2025)
by: Meterez, Alexandru, et al.
Published: (2025)
Unified Convergence Analysis for Adaptive Optimization with Moving Average Estimator
by: Guo, Zhishuai, et al.
Published: (2021)
by: Guo, Zhishuai, et al.
Published: (2021)
Similar Items
-
Randomized algorithms and PAC bounds for inverse reinforcement learning in continuous spaces
by: Kamoutsi, Angeliki, et al.
Published: (2024) -
A Two-Timescale Primal-Dual Framework for Reinforcement Learning via Online Dual Variable Guidance
by: Wolter, Axel Friedrich, et al.
Published: (2025) -
Distributional Adversarial Attacks and Training in Deep Hedging
by: He, Guangyi, et al.
Published: (2025) -
Sample Complexity of Variance-reduced Distributionally Robust Q-learning
by: Wang, Shengbo, et al.
Published: (2023) -
Central Limit Theorems for Asynchronous Averaged Q-Learning
by: Liu, Xingtu
Published: (2025)