:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Song, Bowen, Gros, Sebastien, Iannelli, Andrea
Format:	Preprint
Published:	2025
Subjects:	Systems and Control
Online Access:	https://arxiv.org/abs/2512.03764
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Convergence Guarantees of Model-free Policy Gradient Methods for LQR with Stochastic Data
by: Song, Bowen, et al.
Published: (2025)

A Stochastic Gradient Descent Approach to Design Policy Gradient Methods for LQR
by: Song, Bowen, et al.
Published: (2026)

Robustness of Online Identification-based Policy Iteration to Noisy Data
by: Song, Bowen, et al.
Published: (2025)

Convergence and Robustness of Value and Policy Iteration for the Linear Quadratic Regulator
by: Song, Bowen, et al.
Published: (2024)

The Role of Identification in Data-driven Policy Iteration: A System Theoretic Study
by: Song, Bowen, et al.
Published: (2024)

Towards Safe Reinforcement Learning Using NMPC and Policy Gradients: Part I - Stochastic case
by: Gros, Sebastien, et al.
Published: (2019)

On Globally Optimal Stochastic Policy Gradient Methods for Domain Randomized LQR Synthesis
by: Nguyen-Le, Alex, et al.
Published: (2026)

Towards Safe Reinforcement Learning Using NMPC and Policy Gradients: Part II - Deterministic Case
by: Gros, Sebastien, et al.
Published: (2019)

Sample Complexity Bounds for Linear System Identification from a Finite Set
by: Chatzikiriakos, Nicolas, et al.
Published: (2024)

Policy Gradient Bounds in Multitask LQR
by: Stamouli, Charis, et al.
Published: (2025)

Hidden Convexity in Active Learning: A Convexified Online Input Design for ARX Systems
by: Chatzikiriakos, Nicolas, et al.
Published: (2025)

Policy Gradient Methods for the Cost-Constrained LQR: Strong Duality and Global Convergence
by: Zhao, Feiran, et al.
Published: (2024)

Policy Gradient for LQR with Domain Randomization
by: Fujinami, Tesshu, et al.
Published: (2025)

Safe Reinforcement Learning Using Robust MPC
by: Zanon, Mario, et al.
Published: (2019)

Adaptive control mechanisms in gradient descent algorithms
by: Iannelli, Andrea
Published: (2025)

Policy Gradient Adaptive Control for the LQR: Indirect and Direct Approaches
by: Zhao, Feiran, et al.
Published: (2025)

Stability-Certified On-Policy Data-Driven LQR via Recursive Learning and Policy Gradient
by: Sforni, Lorenzo, et al.
Published: (2024)

Derivative-Free Data-Driven Control of Continuous-Time Linear Time-Invariant Systems
by: Bosso, Alessandro, et al.
Published: (2024)

LQR for Systems with Probabilistic Parametric Uncertainties: A Gradient Method
by: Cui, Leilei, et al.
Published: (2026)

Stochastic LQR Design With Disturbance Preview
by: Liu, Jietian, et al.
Published: (2024)

Beyond Bounded Noise: Stochastic Set-Membership Estimation for Nonlinear Systems
by: Brändle, Felix, et al.
Published: (2026)

Model-Free Output Feedback Stabilization via Policy Gradient Methods
by: Zhang, Ankang, et al.
Published: (2026)

High Effort, Low Gain: Fundamental Limits of Active Learning for Linear Dynamical Systems
by: Chatzikiriakos, Nicolas, et al.
Published: (2025)

Revisiting LQR Control from the Perspective of Receding-Horizon Policy Gradient
by: Zhang, Xiangyuan, et al.
Published: (2023)

Accelerated ADMM: Automated Parameter Tuning and Improved Linear Convergence
by: Tavakoli, Meisam, et al.
Published: (2025)

A hybrid systems framework for data-based adaptive control of linear time-varying systems
by: Iannelli, Andrea, et al.
Published: (2024)

Online Convex Optimization and Integral Quadratic Constraints: An automated approach to regret analysis
by: Jakob, Fabian, et al.
Published: (2025)

Convergence and stability of Q-learning in Hierarchical Reinforcement Learning
by: Manenti, Massimiliano, et al.
Published: (2025)

The Distributionally Robust Infinite-Horizon LQR
by: Hajar, Joudi, et al.
Published: (2024)

Robustness of Iteratively Pre-Conditioned Gradient-Descent Method: The Case of Distributed Linear Regression Problem
by: Chakrabarti, Kushal, et al.
Published: (2021)

Data-Driven Stabilization of Continuous-Time LTI Systems from Noisy Input-Output Data
by: Bosso, Alessandro, et al.
Published: (2025)

Mixed Regular and Impulsive Sampled-data LQR
by: Daafouz, Jamal, et al.
Published: (2024)

CORL: Reinforcement Learning of MILP Policies Solved via Branch and Bound
by: Anand, Akhil S, et al.
Published: (2025)

Personalized Dynamic Pricing Policy for Electric Vehicles: Reinforcement learning approach
by: Bae, Sangjun, et al.
Published: (2024)

Convergence Analysis of Gradient Flow for Overparameterized LQR Formulations
by: de Oliveira, Arthur Castello B., et al.
Published: (2024)

A Linear Parameter-Varying Framework for the Analysis of Time-Varying Optimization Algorithms
by: Jakob, Fabian, et al.
Published: (2025)

Cost-Matching Model Predictive Control for Efficient Reinforcement Learning in Humanoid Locomotion
by: Cai, Wenqi, et al.
Published: (2026)

MR-ARL: Model Reference Adaptive Reinforcement Learning for Robustly Stable On-Policy Data-Driven LQR
by: Borghesi, Marco, et al.
Published: (2024)

Quasi-Newton Compatible Actor-Critic for Deterministic Policies
by: Kordabad, Arash Bahari, et al.
Published: (2025)

Data-Driven LQR with Finite-Time Experiments via Extremum-Seeking Policy Iteration
by: Carnevale, Guido, et al.
Published: (2024)