Saved in:
| Main Authors: | Song, Bowen, Gros, Sebastien, Iannelli, Andrea |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.03764 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Convergence Guarantees of Model-free Policy Gradient Methods for LQR with Stochastic Data
by: Song, Bowen, et al.
Published: (2025)
by: Song, Bowen, et al.
Published: (2025)
A Stochastic Gradient Descent Approach to Design Policy Gradient Methods for LQR
by: Song, Bowen, et al.
Published: (2026)
by: Song, Bowen, et al.
Published: (2026)
Robustness of Online Identification-based Policy Iteration to Noisy Data
by: Song, Bowen, et al.
Published: (2025)
by: Song, Bowen, et al.
Published: (2025)
Convergence and Robustness of Value and Policy Iteration for the Linear Quadratic Regulator
by: Song, Bowen, et al.
Published: (2024)
by: Song, Bowen, et al.
Published: (2024)
The Role of Identification in Data-driven Policy Iteration: A System Theoretic Study
by: Song, Bowen, et al.
Published: (2024)
by: Song, Bowen, et al.
Published: (2024)
Towards Safe Reinforcement Learning Using NMPC and Policy Gradients: Part I - Stochastic case
by: Gros, Sebastien, et al.
Published: (2019)
by: Gros, Sebastien, et al.
Published: (2019)
On Globally Optimal Stochastic Policy Gradient Methods for Domain Randomized LQR Synthesis
by: Nguyen-Le, Alex, et al.
Published: (2026)
by: Nguyen-Le, Alex, et al.
Published: (2026)
Towards Safe Reinforcement Learning Using NMPC and Policy Gradients: Part II - Deterministic Case
by: Gros, Sebastien, et al.
Published: (2019)
by: Gros, Sebastien, et al.
Published: (2019)
Sample Complexity Bounds for Linear System Identification from a Finite Set
by: Chatzikiriakos, Nicolas, et al.
Published: (2024)
by: Chatzikiriakos, Nicolas, et al.
Published: (2024)
Policy Gradient Bounds in Multitask LQR
by: Stamouli, Charis, et al.
Published: (2025)
by: Stamouli, Charis, et al.
Published: (2025)
Hidden Convexity in Active Learning: A Convexified Online Input Design for ARX Systems
by: Chatzikiriakos, Nicolas, et al.
Published: (2025)
by: Chatzikiriakos, Nicolas, et al.
Published: (2025)
Policy Gradient Methods for the Cost-Constrained LQR: Strong Duality and Global Convergence
by: Zhao, Feiran, et al.
Published: (2024)
by: Zhao, Feiran, et al.
Published: (2024)
Policy Gradient for LQR with Domain Randomization
by: Fujinami, Tesshu, et al.
Published: (2025)
by: Fujinami, Tesshu, et al.
Published: (2025)
Safe Reinforcement Learning Using Robust MPC
by: Zanon, Mario, et al.
Published: (2019)
by: Zanon, Mario, et al.
Published: (2019)
Adaptive control mechanisms in gradient descent algorithms
by: Iannelli, Andrea
Published: (2025)
by: Iannelli, Andrea
Published: (2025)
Policy Gradient Adaptive Control for the LQR: Indirect and Direct Approaches
by: Zhao, Feiran, et al.
Published: (2025)
by: Zhao, Feiran, et al.
Published: (2025)
Stability-Certified On-Policy Data-Driven LQR via Recursive Learning and Policy Gradient
by: Sforni, Lorenzo, et al.
Published: (2024)
by: Sforni, Lorenzo, et al.
Published: (2024)
Derivative-Free Data-Driven Control of Continuous-Time Linear Time-Invariant Systems
by: Bosso, Alessandro, et al.
Published: (2024)
by: Bosso, Alessandro, et al.
Published: (2024)
LQR for Systems with Probabilistic Parametric Uncertainties: A Gradient Method
by: Cui, Leilei, et al.
Published: (2026)
by: Cui, Leilei, et al.
Published: (2026)
Stochastic LQR Design With Disturbance Preview
by: Liu, Jietian, et al.
Published: (2024)
by: Liu, Jietian, et al.
Published: (2024)
Beyond Bounded Noise: Stochastic Set-Membership Estimation for Nonlinear Systems
by: Brändle, Felix, et al.
Published: (2026)
by: Brändle, Felix, et al.
Published: (2026)
Model-Free Output Feedback Stabilization via Policy Gradient Methods
by: Zhang, Ankang, et al.
Published: (2026)
by: Zhang, Ankang, et al.
Published: (2026)
High Effort, Low Gain: Fundamental Limits of Active Learning for Linear Dynamical Systems
by: Chatzikiriakos, Nicolas, et al.
Published: (2025)
by: Chatzikiriakos, Nicolas, et al.
Published: (2025)
Revisiting LQR Control from the Perspective of Receding-Horizon Policy Gradient
by: Zhang, Xiangyuan, et al.
Published: (2023)
by: Zhang, Xiangyuan, et al.
Published: (2023)
Accelerated ADMM: Automated Parameter Tuning and Improved Linear Convergence
by: Tavakoli, Meisam, et al.
Published: (2025)
by: Tavakoli, Meisam, et al.
Published: (2025)
A hybrid systems framework for data-based adaptive control of linear time-varying systems
by: Iannelli, Andrea, et al.
Published: (2024)
by: Iannelli, Andrea, et al.
Published: (2024)
Online Convex Optimization and Integral Quadratic Constraints: An automated approach to regret analysis
by: Jakob, Fabian, et al.
Published: (2025)
by: Jakob, Fabian, et al.
Published: (2025)
Convergence and stability of Q-learning in Hierarchical Reinforcement Learning
by: Manenti, Massimiliano, et al.
Published: (2025)
by: Manenti, Massimiliano, et al.
Published: (2025)
The Distributionally Robust Infinite-Horizon LQR
by: Hajar, Joudi, et al.
Published: (2024)
by: Hajar, Joudi, et al.
Published: (2024)
Robustness of Iteratively Pre-Conditioned Gradient-Descent Method: The Case of Distributed Linear Regression Problem
by: Chakrabarti, Kushal, et al.
Published: (2021)
by: Chakrabarti, Kushal, et al.
Published: (2021)
Data-Driven Stabilization of Continuous-Time LTI Systems from Noisy Input-Output Data
by: Bosso, Alessandro, et al.
Published: (2025)
by: Bosso, Alessandro, et al.
Published: (2025)
Mixed Regular and Impulsive Sampled-data LQR
by: Daafouz, Jamal, et al.
Published: (2024)
by: Daafouz, Jamal, et al.
Published: (2024)
CORL: Reinforcement Learning of MILP Policies Solved via Branch and Bound
by: Anand, Akhil S, et al.
Published: (2025)
by: Anand, Akhil S, et al.
Published: (2025)
Personalized Dynamic Pricing Policy for Electric Vehicles: Reinforcement learning approach
by: Bae, Sangjun, et al.
Published: (2024)
by: Bae, Sangjun, et al.
Published: (2024)
Convergence Analysis of Gradient Flow for Overparameterized LQR Formulations
by: de Oliveira, Arthur Castello B., et al.
Published: (2024)
by: de Oliveira, Arthur Castello B., et al.
Published: (2024)
A Linear Parameter-Varying Framework for the Analysis of Time-Varying Optimization Algorithms
by: Jakob, Fabian, et al.
Published: (2025)
by: Jakob, Fabian, et al.
Published: (2025)
Cost-Matching Model Predictive Control for Efficient Reinforcement Learning in Humanoid Locomotion
by: Cai, Wenqi, et al.
Published: (2026)
by: Cai, Wenqi, et al.
Published: (2026)
MR-ARL: Model Reference Adaptive Reinforcement Learning for Robustly Stable On-Policy Data-Driven LQR
by: Borghesi, Marco, et al.
Published: (2024)
by: Borghesi, Marco, et al.
Published: (2024)
Quasi-Newton Compatible Actor-Critic for Deterministic Policies
by: Kordabad, Arash Bahari, et al.
Published: (2025)
by: Kordabad, Arash Bahari, et al.
Published: (2025)
Data-Driven LQR with Finite-Time Experiments via Extremum-Seeking Policy Iteration
by: Carnevale, Guido, et al.
Published: (2024)
by: Carnevale, Guido, et al.
Published: (2024)
Similar Items
-
Convergence Guarantees of Model-free Policy Gradient Methods for LQR with Stochastic Data
by: Song, Bowen, et al.
Published: (2025) -
A Stochastic Gradient Descent Approach to Design Policy Gradient Methods for LQR
by: Song, Bowen, et al.
Published: (2026) -
Robustness of Online Identification-based Policy Iteration to Noisy Data
by: Song, Bowen, et al.
Published: (2025) -
Convergence and Robustness of Value and Policy Iteration for the Linear Quadratic Regulator
by: Song, Bowen, et al.
Published: (2024) -
The Role of Identification in Data-driven Policy Iteration: A System Theoretic Study
by: Song, Bowen, et al.
Published: (2024)