Saved in:
| Main Authors: | Shi, Chongyang, Han, Shuo, Dorothy, Michael, Fu, Jie |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.16439 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Policy Gradient Methods for Information-Theoretic Opacity in Markov Decision Processes
by: Shi, Chongyang, et al.
Published: (2025)
by: Shi, Chongyang, et al.
Published: (2025)
Integrated Control and Active Perception in POMDPs for Temporal Logic Tasks and Information Acquisition
by: Shi, Chongyang, et al.
Published: (2025)
by: Shi, Chongyang, et al.
Published: (2025)
IMAS$^2$: Joint Agent Selection and Information-Theoretic Coordinated Perception In Dec-POMDPs
by: Shi, Chongyang, et al.
Published: (2025)
by: Shi, Chongyang, et al.
Published: (2025)
C-IDS: Solving Contextual POMDP via Information-Directed Objective
by: Shi, Chongyang, et al.
Published: (2026)
by: Shi, Chongyang, et al.
Published: (2026)
Active Inference through Incentive Design in Markov Decision Processes
by: Wei, Xinyi, et al.
Published: (2025)
by: Wei, Xinyi, et al.
Published: (2025)
Information-Theoretic Opacity-Enforcement in Markov Decision Processes
by: Shi, Chongyang, et al.
Published: (2024)
by: Shi, Chongyang, et al.
Published: (2024)
Synthesis of Dynamic Masks for Information-Theoretic Opacity in Stochastic Systems
by: Udupa, Sumukha, et al.
Published: (2025)
by: Udupa, Sumukha, et al.
Published: (2025)
Information-Driven Active Perception for k-step Predictive Safety Monitoring
by: Udupa, Sumukha, et al.
Published: (2026)
by: Udupa, Sumukha, et al.
Published: (2026)
DiffOP: Reinforcement Learning of Optimization-Based Control Policies via Implicit Policy Gradients
by: Bian, Yuexin, et al.
Published: (2024)
by: Bian, Yuexin, et al.
Published: (2024)
Implicit Bias of Policy Gradient in Linear Quadratic Control: Extrapolation to Unseen Initial States
by: Razin, Noam, et al.
Published: (2024)
by: Razin, Noam, et al.
Published: (2024)
LQR for Systems with Probabilistic Parametric Uncertainties: A Gradient Method
by: Cui, Leilei, et al.
Published: (2026)
by: Cui, Leilei, et al.
Published: (2026)
A Stochastic Gradient Descent Approach to Design Policy Gradient Methods for LQR
by: Song, Bowen, et al.
Published: (2026)
by: Song, Bowen, et al.
Published: (2026)
A Globally Convergent Policy Gradient Method for Linear Quadratic Gaussian (LQG) Control
by: Sadamoto, Tomonori, et al.
Published: (2023)
by: Sadamoto, Tomonori, et al.
Published: (2023)
Policy Gradient Methods for the Cost-Constrained LQR: Strong Duality and Global Convergence
by: Zhao, Feiran, et al.
Published: (2024)
by: Zhao, Feiran, et al.
Published: (2024)
Policy Gradient Methods for Designing Dynamic Output Feedback Controllers
by: Sadamoto, Tomonori, et al.
Published: (2022)
by: Sadamoto, Tomonori, et al.
Published: (2022)
Second-Order Policy Gradient Methods for the Linear Quadratic Regulator
by: Valaei, Amirreza, et al.
Published: (2025)
by: Valaei, Amirreza, et al.
Published: (2025)
Global Convergence of Policy Gradient Methods for ReLU Controllers in Linear Quadratic Regulation
by: Rodriguez-Gil, Jhojan A., et al.
Published: (2026)
by: Rodriguez-Gil, Jhojan A., et al.
Published: (2026)
Model-Free Output Feedback Stabilization via Policy Gradient Methods
by: Zhang, Ankang, et al.
Published: (2026)
by: Zhang, Ankang, et al.
Published: (2026)
Convergence Guarantees of Model-free Policy Gradient Methods for LQR with Stochastic Data
by: Song, Bowen, et al.
Published: (2025)
by: Song, Bowen, et al.
Published: (2025)
On Globally Optimal Stochastic Policy Gradient Methods for Domain Randomized LQR Synthesis
by: Nguyen-Le, Alex, et al.
Published: (2026)
by: Nguyen-Le, Alex, et al.
Published: (2026)
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
by: Ding, Dongsheng, et al.
Published: (2023)
by: Ding, Dongsheng, et al.
Published: (2023)
Policy Gradient Bounds in Multitask LQR
by: Stamouli, Charis, et al.
Published: (2025)
by: Stamouli, Charis, et al.
Published: (2025)
Planning Stealthy Backdoor Attacks in MDPs with Observation-Based Triggers
by: Wei, Xinyi, et al.
Published: (2025)
by: Wei, Xinyi, et al.
Published: (2025)
Data-Driven Contextual-Aware Uncertainty Set for Robust Dispatch of Power Systems
by: Ruan, Zhaojun, et al.
Published: (2026)
by: Ruan, Zhaojun, et al.
Published: (2026)
Heterogeneous Roles against Assignment Based Policies in Two vs Two Target Defense Game
by: Das, Goutam, et al.
Published: (2024)
by: Das, Goutam, et al.
Published: (2024)
Policy Gradient Method for LQG Control via Input-Output-History Representation: Convergence to $O(ε)$-Stationary Points
by: Sadamoto, Tomonori, et al.
Published: (2025)
by: Sadamoto, Tomonori, et al.
Published: (2025)
Stabilizing Policy Gradient Methods via Reward Profiling
by: Ahmed, Shihab, et al.
Published: (2025)
by: Ahmed, Shihab, et al.
Published: (2025)
Receding-Horizon Policy Gradient for Polytopic Controller Synthesis
by: Shakeri, Shiva, et al.
Published: (2026)
by: Shakeri, Shiva, et al.
Published: (2026)
Efficient Policy Adaptation for Voltage Control Under Unknown Topology Changes
by: Feng, Jie, et al.
Published: (2026)
by: Feng, Jie, et al.
Published: (2026)
Sample-Efficient Model-Free Policy Gradient Methods for Stochastic LQR via Robust Linear Regression
by: Song, Bowen, et al.
Published: (2025)
by: Song, Bowen, et al.
Published: (2025)
A Twin Delayed Deep Deterministic Policy Gradient Algorithm for Autonomous Ground Vehicle Navigation via Digital Twin Perception Awareness
by: Olayemi, Kabirat, et al.
Published: (2024)
by: Olayemi, Kabirat, et al.
Published: (2024)
Policy Gradient Adaptive Control for the LQR: Indirect and Direct Approaches
by: Zhao, Feiran, et al.
Published: (2025)
by: Zhao, Feiran, et al.
Published: (2025)
Policy Optimization with Differentiable MPC: Convergence Analysis under Uncertainty
by: Zuliani, Riccardo, et al.
Published: (2026)
by: Zuliani, Riccardo, et al.
Published: (2026)
Initial State Privacy of Nonlinear Systems on Riemannian Manifolds
by: Liu, Le, et al.
Published: (2025)
by: Liu, Le, et al.
Published: (2025)
Active Learning-Based Input Design for Angle-Only Initial Relative Orbit Determination
by: Xie, Kui, et al.
Published: (2025)
by: Xie, Kui, et al.
Published: (2025)
Variance-Reduced Gradient Estimator for Nonconvex Zeroth-Order Distributed Optimization
by: Mu, Huaiyi, et al.
Published: (2024)
by: Mu, Huaiyi, et al.
Published: (2024)
Relax, Estimate, and Track: a Simple Battery State-of-charge and State-of-health Estimation Method
by: Jiang, Shida, et al.
Published: (2024)
by: Jiang, Shida, et al.
Published: (2024)
Uncertainty-Aware Perception-Based Control for Autonomous Racing
by: Trisovic, Jelena, et al.
Published: (2025)
by: Trisovic, Jelena, et al.
Published: (2025)
Event-triggered Dual Gradient Tracking for Distributed Resource Allocation
by: Xu, Xiayan, et al.
Published: (2025)
by: Xu, Xiayan, et al.
Published: (2025)
Stability-Certified On-Policy Data-Driven LQR via Recursive Learning and Policy Gradient
by: Sforni, Lorenzo, et al.
Published: (2024)
by: Sforni, Lorenzo, et al.
Published: (2024)
Similar Items
-
Policy Gradient Methods for Information-Theoretic Opacity in Markov Decision Processes
by: Shi, Chongyang, et al.
Published: (2025) -
Integrated Control and Active Perception in POMDPs for Temporal Logic Tasks and Information Acquisition
by: Shi, Chongyang, et al.
Published: (2025) -
IMAS$^2$: Joint Agent Selection and Information-Theoretic Coordinated Perception In Dec-POMDPs
by: Shi, Chongyang, et al.
Published: (2025) -
C-IDS: Solving Contextual POMDP via Information-Directed Objective
by: Shi, Chongyang, et al.
Published: (2026) -
Active Inference through Incentive Design in Markov Decision Processes
by: Wei, Xinyi, et al.
Published: (2025)