Saved in:
| Main Authors: | Song, Bowen, Weissmann, Simon, Staudigl, Mathias, Iannelli, Andrea |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.18933 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Convergence Guarantees of Model-free Policy Gradient Methods for LQR with Stochastic Data
by: Song, Bowen, et al.
Published: (2025)
by: Song, Bowen, et al.
Published: (2025)
Sample-Efficient Model-Free Policy Gradient Methods for Stochastic LQR via Robust Linear Regression
by: Song, Bowen, et al.
Published: (2025)
by: Song, Bowen, et al.
Published: (2025)
The Role of Identification in Data-driven Policy Iteration: A System Theoretic Study
by: Song, Bowen, et al.
Published: (2024)
by: Song, Bowen, et al.
Published: (2024)
Robustness of Online Identification-based Policy Iteration to Noisy Data
by: Song, Bowen, et al.
Published: (2025)
by: Song, Bowen, et al.
Published: (2025)
On Globally Optimal Stochastic Policy Gradient Methods for Domain Randomized LQR Synthesis
by: Nguyen-Le, Alex, et al.
Published: (2026)
by: Nguyen-Le, Alex, et al.
Published: (2026)
Policy Gradient Bounds in Multitask LQR
by: Stamouli, Charis, et al.
Published: (2025)
by: Stamouli, Charis, et al.
Published: (2025)
Policy Gradient Adaptive Control for the LQR: Indirect and Direct Approaches
by: Zhao, Feiran, et al.
Published: (2025)
by: Zhao, Feiran, et al.
Published: (2025)
Policy Gradient for LQR with Domain Randomization
by: Fujinami, Tesshu, et al.
Published: (2025)
by: Fujinami, Tesshu, et al.
Published: (2025)
Policy Gradient Methods for the Cost-Constrained LQR: Strong Duality and Global Convergence
by: Zhao, Feiran, et al.
Published: (2024)
by: Zhao, Feiran, et al.
Published: (2024)
Convergence and Robustness of Value and Policy Iteration for the Linear Quadratic Regulator
by: Song, Bowen, et al.
Published: (2024)
by: Song, Bowen, et al.
Published: (2024)
Hidden Convexity in Active Learning: A Convexified Online Input Design for ARX Systems
by: Chatzikiriakos, Nicolas, et al.
Published: (2025)
by: Chatzikiriakos, Nicolas, et al.
Published: (2025)
Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy Gradient Methods
by: Klein, Sara, et al.
Published: (2023)
by: Klein, Sara, et al.
Published: (2023)
Controlling the Flow: Stability and Convergence for Stochastic Gradient Descent with Decaying Regularization
by: Kassing, Sebastian, et al.
Published: (2025)
by: Kassing, Sebastian, et al.
Published: (2025)
LQR for Systems with Probabilistic Parametric Uncertainties: A Gradient Method
by: Cui, Leilei, et al.
Published: (2026)
by: Cui, Leilei, et al.
Published: (2026)
Derivative-free stochastic bilevel optimization for inverse problems
by: Staudigl, Mathias, et al.
Published: (2024)
by: Staudigl, Mathias, et al.
Published: (2024)
Stability-Certified On-Policy Data-Driven LQR via Recursive Learning and Policy Gradient
by: Sforni, Lorenzo, et al.
Published: (2024)
by: Sforni, Lorenzo, et al.
Published: (2024)
Revisiting LQR Control from the Perspective of Receding-Horizon Policy Gradient
by: Zhang, Xiangyuan, et al.
Published: (2023)
by: Zhang, Xiangyuan, et al.
Published: (2023)
Stochastic LQR Design With Disturbance Preview
by: Liu, Jietian, et al.
Published: (2024)
by: Liu, Jietian, et al.
Published: (2024)
Anytime Acceleration of Gradient Descent
by: Zhang, Zihan, et al.
Published: (2024)
by: Zhang, Zihan, et al.
Published: (2024)
Convergence Analysis of Gradient Flow for Overparameterized LQR Formulations
by: de Oliveira, Arthur Castello B., et al.
Published: (2024)
by: de Oliveira, Arthur Castello B., et al.
Published: (2024)
Stochastic Differential Inclusions driven by Maximal Monotone Operators with empty interiors
by: Garrido, Juan Guillermo, et al.
Published: (2026)
by: Garrido, Juan Guillermo, et al.
Published: (2026)
IRKA is a Riemannian Gradient Descent Method
by: Mlinarić, Petar, et al.
Published: (2023)
by: Mlinarić, Petar, et al.
Published: (2023)
Adaptive control mechanisms in gradient descent algorithms
by: Iannelli, Andrea
Published: (2025)
by: Iannelli, Andrea
Published: (2025)
Power-Constrained Policy Gradient Methods for LQR
by: Verma, Ashwin, et al.
Published: (2025)
by: Verma, Ashwin, et al.
Published: (2025)
Natural Gradient Descent for Control
by: Esmzad, Ramin, et al.
Published: (2025)
by: Esmzad, Ramin, et al.
Published: (2025)
Trajectory-Oriented Control Using Gradient Descent: An Unconventional Approach
by: Esmzad, Ramin, et al.
Published: (2024)
by: Esmzad, Ramin, et al.
Published: (2024)
Interpretable Gradient Descent for Kalman Gain
by: Belabbas, M. A., et al.
Published: (2025)
by: Belabbas, M. A., et al.
Published: (2025)
Policy Gradient Methods for Designing Dynamic Output Feedback Controllers
by: Sadamoto, Tomonori, et al.
Published: (2022)
by: Sadamoto, Tomonori, et al.
Published: (2022)
Small-Disturbance Input-to-State Stability of Perturbed Gradient Flows: Applications to LQR Problem
by: Cui, Leilei, et al.
Published: (2023)
by: Cui, Leilei, et al.
Published: (2023)
An ADRC-Incorporated Stochastic Gradient Descent Algorithm for Latent Factor Analysis
by: Li, Jinli, et al.
Published: (2024)
by: Li, Jinli, et al.
Published: (2024)
Input-Output Stability of Gradient Descent: A Discrete-Time Passivity-Based Approach
by: Moalemi, Sepehr, et al.
Published: (2024)
by: Moalemi, Sepehr, et al.
Published: (2024)
Beyond Bounded Noise: Stochastic Set-Membership Estimation for Nonlinear Systems
by: Brändle, Felix, et al.
Published: (2026)
by: Brändle, Felix, et al.
Published: (2026)
A hybrid systems framework for data-based adaptive control of linear time-varying systems
by: Iannelli, Andrea, et al.
Published: (2024)
by: Iannelli, Andrea, et al.
Published: (2024)
Bridging Continuous-time LQR and Reinforcement Learning via Gradient Flow of the Bellman Error
by: Gießler, Armin, et al.
Published: (2025)
by: Gießler, Armin, et al.
Published: (2025)
On Convergence of the Iteratively Preconditioned Gradient-Descent (IPG) Observer
by: Chakrabarti, Kushal, et al.
Published: (2024)
by: Chakrabarti, Kushal, et al.
Published: (2024)
Online Convex Optimization and Integral Quadratic Constraints: An automated approach to regret analysis
by: Jakob, Fabian, et al.
Published: (2025)
by: Jakob, Fabian, et al.
Published: (2025)
Convergence and stability of Q-learning in Hierarchical Reinforcement Learning
by: Manenti, Massimiliano, et al.
Published: (2025)
by: Manenti, Massimiliano, et al.
Published: (2025)
Data-Driven Stabilization of Continuous-Time LTI Systems from Noisy Input-Output Data
by: Bosso, Alessandro, et al.
Published: (2025)
by: Bosso, Alessandro, et al.
Published: (2025)
Structure Matters: Dynamic Policy Gradient
by: Klein, Sara, et al.
Published: (2024)
by: Klein, Sara, et al.
Published: (2024)
Model Predictive Path Integral Control as Preconditioned Gradient Descent
by: Fazlyab, Mahyar, et al.
Published: (2026)
by: Fazlyab, Mahyar, et al.
Published: (2026)
Similar Items
-
Convergence Guarantees of Model-free Policy Gradient Methods for LQR with Stochastic Data
by: Song, Bowen, et al.
Published: (2025) -
Sample-Efficient Model-Free Policy Gradient Methods for Stochastic LQR via Robust Linear Regression
by: Song, Bowen, et al.
Published: (2025) -
The Role of Identification in Data-driven Policy Iteration: A System Theoretic Study
by: Song, Bowen, et al.
Published: (2024) -
Robustness of Online Identification-based Policy Iteration to Noisy Data
by: Song, Bowen, et al.
Published: (2025) -
On Globally Optimal Stochastic Policy Gradient Methods for Domain Randomized LQR Synthesis
by: Nguyen-Le, Alex, et al.
Published: (2026)