Saved in:
| Main Authors: | Vu, Minh, Slavakis, Konstantinos |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.14585 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Gaussian-Mixture-Model Q-Functions for Policy Iteration in Reinforcement Learning
by: Vu, Minh, et al.
Published: (2025)
by: Vu, Minh, et al.
Published: (2025)
Gaussian-Mixture-Model Q-Functions for Reinforcement Learning by Riemannian Optimization
by: Vu, Minh, et al.
Published: (2024)
by: Vu, Minh, et al.
Published: (2024)
Iteratively reweighted kernel machines efficiently learn sparse functions
by: Zhu, Libin, et al.
Published: (2025)
by: Zhu, Libin, et al.
Published: (2025)
Symmetric Linear Dynamical Systems are Learnable from Few Observations
by: Vu, Minh, et al.
Published: (2025)
by: Vu, Minh, et al.
Published: (2025)
Continuous-time reinforcement learning: ellipticity enables model-free value function approximation
by: Mou, Wenlong
Published: (2026)
by: Mou, Wenlong
Published: (2026)
What is the objective of reasoning with reinforcement learning?
by: Davis, Damek, et al.
Published: (2025)
by: Davis, Damek, et al.
Published: (2025)
Meta-reinforcement learning with minimum attention
by: Gupta, Shashank, et al.
Published: (2025)
by: Gupta, Shashank, et al.
Published: (2025)
Local linear convergence of gradient methods for overparameterized Gaussian mixtures
by: Wang, Jingxing, et al.
Published: (2026)
by: Wang, Jingxing, et al.
Published: (2026)
Fast sparse optimization via adaptive shrinkage
by: Cerone, Vito, et al.
Published: (2025)
by: Cerone, Vito, et al.
Published: (2025)
Stochastic Halpern iteration in normed spaces and applications to reinforcement learning
by: Bravo, Mario, et al.
Published: (2024)
by: Bravo, Mario, et al.
Published: (2024)
Regularized Q-learning through Robust Averaging
by: Schmitt-Förster, Peter, et al.
Published: (2024)
by: Schmitt-Förster, Peter, et al.
Published: (2024)
Flowsheet synthesis through hierarchical reinforcement learning and graph neural networks
by: Stops, Laura, et al.
Published: (2022)
by: Stops, Laura, et al.
Published: (2022)
Randomized algorithms and PAC bounds for inverse reinforcement learning in continuous spaces
by: Kamoutsi, Angeliki, et al.
Published: (2024)
by: Kamoutsi, Angeliki, et al.
Published: (2024)
Convergence and stability of Q-learning in Hierarchical Reinforcement Learning
by: Manenti, Massimiliano, et al.
Published: (2025)
by: Manenti, Massimiliano, et al.
Published: (2025)
A reinforcement learning agent for maintenance of deteriorating systems with increasingly imperfect repairs
by: Marugán, Alberto Pliego, et al.
Published: (2025)
by: Marugán, Alberto Pliego, et al.
Published: (2025)
Variance-Reduced Cascade Q-learning: Algorithms and Sample Complexity
by: Boveiri, Mohammad, et al.
Published: (2024)
by: Boveiri, Mohammad, et al.
Published: (2024)
Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation
by: Zhang, Yixuan, et al.
Published: (2024)
by: Zhang, Yixuan, et al.
Published: (2024)
Sample Complexity of Variance-reduced Distributionally Robust Q-learning
by: Wang, Shengbo, et al.
Published: (2023)
by: Wang, Shengbo, et al.
Published: (2023)
Safe learning-based control via function-based uncertainty quantification
by: Tokmak, Abdullah, et al.
Published: (2026)
by: Tokmak, Abdullah, et al.
Published: (2026)
Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity
by: Yang, Yan, et al.
Published: (2024)
by: Yang, Yan, et al.
Published: (2024)
Independent policy gradient-based reinforcement learning for economic and reliable energy management of multi-microgrid systems
by: Hu, Junkai, et al.
Published: (2025)
by: Hu, Junkai, et al.
Published: (2025)
Adaptive multi-gradient methods for quasiconvex vector optimization and applications to multi-task learning
by: Minh, Nguyen Anh, et al.
Published: (2024)
by: Minh, Nguyen Anh, et al.
Published: (2024)
Scalable spectral representations for multi-agent reinforcement learning in network MDPs
by: Ren, Zhaolin, et al.
Published: (2024)
by: Ren, Zhaolin, et al.
Published: (2024)
Exploiting inter-agent coupling information for efficient reinforcement learning of cooperative LQR
by: Syed, Shahbaz P Qadri, et al.
Published: (2025)
by: Syed, Shahbaz P Qadri, et al.
Published: (2025)
Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces
by: Duan, Yaqi, et al.
Published: (2024)
by: Duan, Yaqi, et al.
Published: (2024)
Online Nonstochastic Prediction: Logarithmic Regret via Predictive Online Least Squares
by: Pai, Chih-Fan, et al.
Published: (2026)
by: Pai, Chih-Fan, et al.
Published: (2026)
Continuous-time reinforcement learning for optimal switching over multiple regimes
by: Huang, Yijie, et al.
Published: (2025)
by: Huang, Yijie, et al.
Published: (2025)
Dynamic financial processes identification using sparse regressive reservoir computers
by: Vides, Fredy, et al.
Published: (2023)
by: Vides, Fredy, et al.
Published: (2023)
Scalable Online Exploration via Coverability
by: Amortila, Philip, et al.
Published: (2024)
by: Amortila, Philip, et al.
Published: (2024)
Random Sparse Lifts: Construction, Analysis and Convergence of finite sparse networks
by: Robin, David A. R., et al.
Published: (2025)
by: Robin, David A. R., et al.
Published: (2025)
Optimistic Online LQR via Intrinsic Rewards
by: Bartos, Marcell, et al.
Published: (2026)
by: Bartos, Marcell, et al.
Published: (2026)
Nonparametric Bellman Mappings for Reinforcement Learning: Application to Robust Adaptive Filtering
by: Akiyama, Yuki, et al.
Published: (2024)
by: Akiyama, Yuki, et al.
Published: (2024)
Structured Difference-of-Q via Orthogonal Learning
by: Cao, Defu, et al.
Published: (2024)
by: Cao, Defu, et al.
Published: (2024)
Robust Gaussian Processes via Relevance Pursuit
by: Ament, Sebastian, et al.
Published: (2024)
by: Ament, Sebastian, et al.
Published: (2024)
Gaussian Process Thompson Sampling via Rootfinding
by: Adebiyi, Taiwo A., et al.
Published: (2024)
by: Adebiyi, Taiwo A., et al.
Published: (2024)
Stabilizing reinforcement learning control: A modular framework for optimizing over all stable behavior
by: Lawrence, Nathan P., et al.
Published: (2023)
by: Lawrence, Nathan P., et al.
Published: (2023)
A robust and adaptive MPC formulation for Gaussian process models
by: Dubied, Mathieu, et al.
Published: (2025)
by: Dubied, Mathieu, et al.
Published: (2025)
Decision-Focused Federated Learning Under Heterogeneous Objectives and Constraints
by: Ziliaskopoulos, Konstantinos, et al.
Published: (2026)
by: Ziliaskopoulos, Konstantinos, et al.
Published: (2026)
Alignment of large language models with constrained learning
by: Zhang, Botong, et al.
Published: (2025)
by: Zhang, Botong, et al.
Published: (2025)
On the impact of measure pre-conditionings on general parametric ML models and transfer learning via domain adaptation
by: García, Joaquín Sánchez
Published: (2024)
by: García, Joaquín Sánchez
Published: (2024)
Similar Items
-
Gaussian-Mixture-Model Q-Functions for Policy Iteration in Reinforcement Learning
by: Vu, Minh, et al.
Published: (2025) -
Gaussian-Mixture-Model Q-Functions for Reinforcement Learning by Riemannian Optimization
by: Vu, Minh, et al.
Published: (2024) -
Iteratively reweighted kernel machines efficiently learn sparse functions
by: Zhu, Libin, et al.
Published: (2025) -
Symmetric Linear Dynamical Systems are Learnable from Few Observations
by: Vu, Minh, et al.
Published: (2025) -
Continuous-time reinforcement learning: ellipticity enables model-free value function approximation
by: Mou, Wenlong
Published: (2026)