:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Vu, Minh, Slavakis, Konstantinos
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Optimization and Control
Online Access:	https://arxiv.org/abs/2509.14585
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Gaussian-Mixture-Model Q-Functions for Policy Iteration in Reinforcement Learning
by: Vu, Minh, et al.
Published: (2025)

Gaussian-Mixture-Model Q-Functions for Reinforcement Learning by Riemannian Optimization
by: Vu, Minh, et al.
Published: (2024)

Iteratively reweighted kernel machines efficiently learn sparse functions
by: Zhu, Libin, et al.
Published: (2025)

Symmetric Linear Dynamical Systems are Learnable from Few Observations
by: Vu, Minh, et al.
Published: (2025)

Continuous-time reinforcement learning: ellipticity enables model-free value function approximation
by: Mou, Wenlong
Published: (2026)

What is the objective of reasoning with reinforcement learning?
by: Davis, Damek, et al.
Published: (2025)

Meta-reinforcement learning with minimum attention
by: Gupta, Shashank, et al.
Published: (2025)

Local linear convergence of gradient methods for overparameterized Gaussian mixtures
by: Wang, Jingxing, et al.
Published: (2026)

Fast sparse optimization via adaptive shrinkage
by: Cerone, Vito, et al.
Published: (2025)

Stochastic Halpern iteration in normed spaces and applications to reinforcement learning
by: Bravo, Mario, et al.
Published: (2024)

Regularized Q-learning through Robust Averaging
by: Schmitt-Förster, Peter, et al.
Published: (2024)

Flowsheet synthesis through hierarchical reinforcement learning and graph neural networks
by: Stops, Laura, et al.
Published: (2022)

Randomized algorithms and PAC bounds for inverse reinforcement learning in continuous spaces
by: Kamoutsi, Angeliki, et al.
Published: (2024)

Convergence and stability of Q-learning in Hierarchical Reinforcement Learning
by: Manenti, Massimiliano, et al.
Published: (2025)

A reinforcement learning agent for maintenance of deteriorating systems with increasingly imperfect repairs
by: Marugán, Alberto Pliego, et al.
Published: (2025)

Variance-Reduced Cascade Q-learning: Algorithms and Sample Complexity
by: Boveiri, Mohammad, et al.
Published: (2024)

Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation
by: Zhang, Yixuan, et al.
Published: (2024)

Sample Complexity of Variance-reduced Distributionally Robust Q-learning
by: Wang, Shengbo, et al.
Published: (2023)

Safe learning-based control via function-based uncertainty quantification
by: Tokmak, Abdullah, et al.
Published: (2026)

Bilevel reinforcement learning via the development of hyper-gradient without lower-level convexity
by: Yang, Yan, et al.
Published: (2024)

Independent policy gradient-based reinforcement learning for economic and reliable energy management of multi-microgrid systems
by: Hu, Junkai, et al.
Published: (2025)

Adaptive multi-gradient methods for quasiconvex vector optimization and applications to multi-task learning
by: Minh, Nguyen Anh, et al.
Published: (2024)

Scalable spectral representations for multi-agent reinforcement learning in network MDPs
by: Ren, Zhaolin, et al.
Published: (2024)

Exploiting inter-agent coupling information for efficient reinforcement learning of cooperative LQR
by: Syed, Shahbaz P Qadri, et al.
Published: (2025)

Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces
by: Duan, Yaqi, et al.
Published: (2024)

Online Nonstochastic Prediction: Logarithmic Regret via Predictive Online Least Squares
by: Pai, Chih-Fan, et al.
Published: (2026)

Continuous-time reinforcement learning for optimal switching over multiple regimes
by: Huang, Yijie, et al.
Published: (2025)

Dynamic financial processes identification using sparse regressive reservoir computers
by: Vides, Fredy, et al.
Published: (2023)

Scalable Online Exploration via Coverability
by: Amortila, Philip, et al.
Published: (2024)

Random Sparse Lifts: Construction, Analysis and Convergence of finite sparse networks
by: Robin, David A. R., et al.
Published: (2025)

Optimistic Online LQR via Intrinsic Rewards
by: Bartos, Marcell, et al.
Published: (2026)

Nonparametric Bellman Mappings for Reinforcement Learning: Application to Robust Adaptive Filtering
by: Akiyama, Yuki, et al.
Published: (2024)

Structured Difference-of-Q via Orthogonal Learning
by: Cao, Defu, et al.
Published: (2024)

Robust Gaussian Processes via Relevance Pursuit
by: Ament, Sebastian, et al.
Published: (2024)

Gaussian Process Thompson Sampling via Rootfinding
by: Adebiyi, Taiwo A., et al.
Published: (2024)

Stabilizing reinforcement learning control: A modular framework for optimizing over all stable behavior
by: Lawrence, Nathan P., et al.
Published: (2023)

A robust and adaptive MPC formulation for Gaussian process models
by: Dubied, Mathieu, et al.
Published: (2025)

Decision-Focused Federated Learning Under Heterogeneous Objectives and Constraints
by: Ziliaskopoulos, Konstantinos, et al.
Published: (2026)

Alignment of large language models with constrained learning
by: Zhang, Botong, et al.
Published: (2025)

On the impact of measure pre-conditionings on general parametric ML models and transfer learning via domain adaptation
by: García, Joaquín Sánchez
Published: (2024)