Saved in:
| Main Authors: | Song, Bowen, Iannelli, Andrea |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.19977 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Stochastic Gradient Descent Approach to Design Policy Gradient Methods for LQR
by: Song, Bowen, et al.
Published: (2026)
by: Song, Bowen, et al.
Published: (2026)
Sample-Efficient Model-Free Policy Gradient Methods for Stochastic LQR via Robust Linear Regression
by: Song, Bowen, et al.
Published: (2025)
by: Song, Bowen, et al.
Published: (2025)
Robustness of Online Identification-based Policy Iteration to Noisy Data
by: Song, Bowen, et al.
Published: (2025)
by: Song, Bowen, et al.
Published: (2025)
The Role of Identification in Data-driven Policy Iteration: A System Theoretic Study
by: Song, Bowen, et al.
Published: (2024)
by: Song, Bowen, et al.
Published: (2024)
Convergence and Robustness of Value and Policy Iteration for the Linear Quadratic Regulator
by: Song, Bowen, et al.
Published: (2024)
by: Song, Bowen, et al.
Published: (2024)
Policy Gradient Methods for the Cost-Constrained LQR: Strong Duality and Global Convergence
by: Zhao, Feiran, et al.
Published: (2024)
by: Zhao, Feiran, et al.
Published: (2024)
On Globally Optimal Stochastic Policy Gradient Methods for Domain Randomized LQR Synthesis
by: Nguyen-Le, Alex, et al.
Published: (2026)
by: Nguyen-Le, Alex, et al.
Published: (2026)
Policy Gradient Bounds in Multitask LQR
by: Stamouli, Charis, et al.
Published: (2025)
by: Stamouli, Charis, et al.
Published: (2025)
Convergence and stability of Q-learning in Hierarchical Reinforcement Learning
by: Manenti, Massimiliano, et al.
Published: (2025)
by: Manenti, Massimiliano, et al.
Published: (2025)
Convergence Analysis of Gradient Flow for Overparameterized LQR Formulations
by: de Oliveira, Arthur Castello B., et al.
Published: (2024)
by: de Oliveira, Arthur Castello B., et al.
Published: (2024)
Hidden Convexity in Active Learning: A Convexified Online Input Design for ARX Systems
by: Chatzikiriakos, Nicolas, et al.
Published: (2025)
by: Chatzikiriakos, Nicolas, et al.
Published: (2025)
Policy Gradient for LQR with Domain Randomization
by: Fujinami, Tesshu, et al.
Published: (2025)
by: Fujinami, Tesshu, et al.
Published: (2025)
Adaptive control mechanisms in gradient descent algorithms
by: Iannelli, Andrea
Published: (2025)
by: Iannelli, Andrea
Published: (2025)
Stability-Certified On-Policy Data-Driven LQR via Recursive Learning and Policy Gradient
by: Sforni, Lorenzo, et al.
Published: (2024)
by: Sforni, Lorenzo, et al.
Published: (2024)
Policy Gradient Adaptive Control for the LQR: Indirect and Direct Approaches
by: Zhao, Feiran, et al.
Published: (2025)
by: Zhao, Feiran, et al.
Published: (2025)
On the (almost) Global Exponential Convergence of the Overparameterized Policy Optimization for the LQR Problem
by: Wafi, Moh Kamalul, et al.
Published: (2025)
by: Wafi, Moh Kamalul, et al.
Published: (2025)
Data-Driven Stabilization of Continuous-Time LTI Systems from Noisy Input-Output Data
by: Bosso, Alessandro, et al.
Published: (2025)
by: Bosso, Alessandro, et al.
Published: (2025)
Accelerated ADMM: Automated Parameter Tuning and Improved Linear Convergence
by: Tavakoli, Meisam, et al.
Published: (2025)
by: Tavakoli, Meisam, et al.
Published: (2025)
LQR for Systems with Probabilistic Parametric Uncertainties: A Gradient Method
by: Cui, Leilei, et al.
Published: (2026)
by: Cui, Leilei, et al.
Published: (2026)
Stochastic LQR Design With Disturbance Preview
by: Liu, Jietian, et al.
Published: (2024)
by: Liu, Jietian, et al.
Published: (2024)
Learning Soft Constrained MPC Value Functions: Efficient MPC Design and Implementation providing Stability and Safety Guarantees
by: Chatzikiriakos, Nicolas, et al.
Published: (2024)
by: Chatzikiriakos, Nicolas, et al.
Published: (2024)
Beyond Bounded Noise: Stochastic Set-Membership Estimation for Nonlinear Systems
by: Brändle, Felix, et al.
Published: (2026)
by: Brändle, Felix, et al.
Published: (2026)
Data-Enabled Policy Optimization for Direct Adaptive Learning of the LQR
by: Zhao, Feiran, et al.
Published: (2024)
by: Zhao, Feiran, et al.
Published: (2024)
A Policy Gradient Framework for Stochastic Optimal Control Problems with Global Convergence Guarantee
by: Zhou, Mo, et al.
Published: (2023)
by: Zhou, Mo, et al.
Published: (2023)
Revisiting LQR Control from the Perspective of Receding-Horizon Policy Gradient
by: Zhang, Xiangyuan, et al.
Published: (2023)
by: Zhang, Xiangyuan, et al.
Published: (2023)
A hybrid systems framework for data-based adaptive control of linear time-varying systems
by: Iannelli, Andrea, et al.
Published: (2024)
by: Iannelli, Andrea, et al.
Published: (2024)
Online Convex Optimization and Integral Quadratic Constraints: An automated approach to regret analysis
by: Jakob, Fabian, et al.
Published: (2025)
by: Jakob, Fabian, et al.
Published: (2025)
Data-Driven LQR with Finite-Time Experiments via Extremum-Seeking Policy Iteration
by: Carnevale, Guido, et al.
Published: (2024)
by: Carnevale, Guido, et al.
Published: (2024)
Sample Complexity Bounds for Linear System Identification from a Finite Set
by: Chatzikiriakos, Nicolas, et al.
Published: (2024)
by: Chatzikiriakos, Nicolas, et al.
Published: (2024)
Harnessing Data from Clustered LQR Systems: Personalized and Collaborative Policy Optimization
by: Kanakeri, Vinay, et al.
Published: (2025)
by: Kanakeri, Vinay, et al.
Published: (2025)
The role of identification in data‐driven policy iteration: A system theoretic study
by: Bowen Song, et al.
Published: (2024)
by: Bowen Song, et al.
Published: (2024)
Density-Driven Optimal Control: Convergence Guarantees for Stochastic LTI Multi-Agent Systems
by: Lee, Kooktae
Published: (2026)
by: Lee, Kooktae
Published: (2026)
A Globally Convergent Policy Gradient Method for Linear Quadratic Gaussian (LQG) Control
by: Sadamoto, Tomonori, et al.
Published: (2023)
by: Sadamoto, Tomonori, et al.
Published: (2023)
Global Convergence of Policy Gradient Methods for ReLU Controllers in Linear Quadratic Regulation
by: Rodriguez-Gil, Jhojan A., et al.
Published: (2026)
by: Rodriguez-Gil, Jhojan A., et al.
Published: (2026)
Stochastic MPC with Online-optimized Policies and Closed-loop Guarantees
by: Bartos, Marcell, et al.
Published: (2025)
by: Bartos, Marcell, et al.
Published: (2025)
Analyzing the Impact of Computation in Adaptive Dynamic Programming for Stochastic LQR Problem
by: Cao, Wenhan, et al.
Published: (2024)
by: Cao, Wenhan, et al.
Published: (2024)
MR-ARL: Model Reference Adaptive Reinforcement Learning for Robustly Stable On-Policy Data-Driven LQR
by: Borghesi, Marco, et al.
Published: (2024)
by: Borghesi, Marco, et al.
Published: (2024)
A Bayesian Perspective on the Data-Driven LQR
by: Schwaller, Thierry, et al.
Published: (2026)
by: Schwaller, Thierry, et al.
Published: (2026)
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
by: Ding, Dongsheng, et al.
Published: (2023)
by: Ding, Dongsheng, et al.
Published: (2023)
Derivative-Free Data-Driven Control of Continuous-Time Linear Time-Invariant Systems
by: Bosso, Alessandro, et al.
Published: (2024)
by: Bosso, Alessandro, et al.
Published: (2024)
Similar Items
-
A Stochastic Gradient Descent Approach to Design Policy Gradient Methods for LQR
by: Song, Bowen, et al.
Published: (2026) -
Sample-Efficient Model-Free Policy Gradient Methods for Stochastic LQR via Robust Linear Regression
by: Song, Bowen, et al.
Published: (2025) -
Robustness of Online Identification-based Policy Iteration to Noisy Data
by: Song, Bowen, et al.
Published: (2025) -
The Role of Identification in Data-driven Policy Iteration: A System Theoretic Study
by: Song, Bowen, et al.
Published: (2024) -
Convergence and Robustness of Value and Policy Iteration for the Linear Quadratic Regulator
by: Song, Bowen, et al.
Published: (2024)