Saved in:
| Main Authors: | Sinha, Amit, Mahajan, Aditya |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.15703 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Periodic agent-state based Q-learning for POMDPs
by: Sinha, Amit, et al.
Published: (2024)
by: Sinha, Amit, et al.
Published: (2024)
Convergence of regularized agent-state-based Q-learning in POMDPs
by: Sinha, Amit, et al.
Published: (2025)
by: Sinha, Amit, et al.
Published: (2025)
Risk-seeking conservative policy iteration with agent-state based policies for Dec-POMDPs with guaranteed convergence
by: Sinha, Amit, et al.
Published: (2026)
by: Sinha, Amit, et al.
Published: (2026)
Model approximation in MDPs with unbounded per-step cost
by: Bozkurt, Berk, et al.
Published: (2024)
by: Bozkurt, Berk, et al.
Published: (2024)
Concentration of Cumulative Reward in Markov Decision Processes
by: Sayedana, Borna, et al.
Published: (2024)
by: Sayedana, Borna, et al.
Published: (2024)
Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs
by: Ding, Dongsheng, et al.
Published: (2022)
by: Ding, Dongsheng, et al.
Published: (2022)
Approximate Control for Continuous-Time POMDPs
by: Eich, Yannick, et al.
Published: (2024)
by: Eich, Yannick, et al.
Published: (2024)
Thinking Beyond Visibility: A Near-Optimal Policy Framework for Locally Interdependent Multi-Agent MDPs
by: DeWeese, Alex, et al.
Published: (2025)
by: DeWeese, Alex, et al.
Published: (2025)
Scalable spectral representations for multi-agent reinforcement learning in network MDPs
by: Ren, Zhaolin, et al.
Published: (2024)
by: Ren, Zhaolin, et al.
Published: (2024)
Achieving Tractable Minimax Optimal Regret in Average Reward MDPs
by: Boone, Victor, et al.
Published: (2024)
by: Boone, Victor, et al.
Published: (2024)
Posterior Sampling-based Online Learning for Episodic POMDPs
by: Tang, Dengwang, et al.
Published: (2023)
by: Tang, Dengwang, et al.
Published: (2023)
Model-Free Learning and Optimal Policy Design in Multi-Agent MDPs Under Probabilistic Agent Dropout
by: Fiscko, Carmel, et al.
Published: (2023)
by: Fiscko, Carmel, et al.
Published: (2023)
Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
by: Ding, Dongsheng, et al.
Published: (2023)
by: Ding, Dongsheng, et al.
Published: (2023)
Transformer-Based Scalable Multi-Agent Reinforcement Learning for Networked Systems with Long-Range Interactions
by: Sinha, Vidur, et al.
Published: (2025)
by: Sinha, Vidur, et al.
Published: (2025)
Operator-Theoretic Foundations and Policy Gradient Methods for General MDPs with Unbounded Costs
by: Gupta, Abhishek, et al.
Published: (2026)
by: Gupta, Abhishek, et al.
Published: (2026)
Probabilistic Safety Guarantee for Stochastic Control Systems Using Average Reward MDPs
by: Omidi, Saber, et al.
Published: (2025)
by: Omidi, Saber, et al.
Published: (2025)
Explainable Representation of Finite-Memory Policies for POMDPs using Decision Trees
by: Azeem, Muqsit, et al.
Published: (2024)
by: Azeem, Muqsit, et al.
Published: (2024)
Machine learning based state observer for discrete time systems evolving on Lie groups
by: Shanbhag, Soham, et al.
Published: (2024)
by: Shanbhag, Soham, et al.
Published: (2024)
Beyond expected value: geometric mean optimization for long-term policy performance in reinforcement learning
by: Sheng, Xinyi, et al.
Published: (2025)
by: Sheng, Xinyi, et al.
Published: (2025)
Observability conditions for neural state-space models with eigenvalues and their roots of unity
by: Gracyk, Andrew
Published: (2025)
by: Gracyk, Andrew
Published: (2025)
Structured state-space models are deep Wiener models
by: Bonassi, Fabio, et al.
Published: (2023)
by: Bonassi, Fabio, et al.
Published: (2023)
Learning based Modelling of Throttleable Engine Dynamics for Lunar Landing Mission
by: Kumar, Suraj, et al.
Published: (2025)
by: Kumar, Suraj, et al.
Published: (2025)
RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation
by: Kwon, Jeongyeol, et al.
Published: (2024)
by: Kwon, Jeongyeol, et al.
Published: (2024)
Global Search of Optimal Spacecraft Trajectories using Amortization and Deep Generative Models
by: Beeson, Ryne, et al.
Published: (2024)
by: Beeson, Ryne, et al.
Published: (2024)
On zero-shot learning in neural state estimation of power distribution systems
by: Berezin, Aleksandr, et al.
Published: (2024)
by: Berezin, Aleksandr, et al.
Published: (2024)
Independent policy gradient-based reinforcement learning for economic and reliable energy management of multi-microgrid systems
by: Hu, Junkai, et al.
Published: (2025)
by: Hu, Junkai, et al.
Published: (2025)
Efficient Policy Optimization in Robust Constrained MDPs with Iteration Complexity Guarantees
by: Ganguly, Sourav, et al.
Published: (2025)
by: Ganguly, Sourav, et al.
Published: (2025)
A virtual sensor fusion approach for state of charge estimation of lithium-ion cells
by: Previtali, Davide, et al.
Published: (2025)
by: Previtali, Davide, et al.
Published: (2025)
Learning Optimal Control and Dynamical Structure of Global Trajectory Search Problems with Diffusion Models
by: Graebner, Jannik, et al.
Published: (2024)
by: Graebner, Jannik, et al.
Published: (2024)
Scalable and Interpretable Verification of Image-based Neural Network Controllers for Autonomous Vehicles
by: Parameshwaran, Aditya, et al.
Published: (2025)
by: Parameshwaran, Aditya, et al.
Published: (2025)
Constraint-Generation Policy Optimization (CGPO): Nonlinear Programming for Policy Optimization in Mixed Discrete-Continuous MDPs
by: Gimelfarb, Michael, et al.
Published: (2024)
by: Gimelfarb, Michael, et al.
Published: (2024)
Model order reduction of deep structured state-space models: A system-theoretic approach
by: Forgione, Marco, et al.
Published: (2024)
by: Forgione, Marco, et al.
Published: (2024)
Domain knowledge-guided machine learning framework for state of health estimation in Lithium-ion batteries
by: Lanubile, Andrea, et al.
Published: (2024)
by: Lanubile, Andrea, et al.
Published: (2024)
Sub-optimality bounds for certainty equivalent policies in partially observed systems
by: Bozkurt, Berk, et al.
Published: (2026)
by: Bozkurt, Berk, et al.
Published: (2026)
Transformer based time series prediction of the maximum power point for solar photovoltaic cells
by: Agrawal, Palaash, et al.
Published: (2024)
by: Agrawal, Palaash, et al.
Published: (2024)
FNO$^{\angle θ}$: Extended Fourier neural operator for learning state and optimal control of distributed parameter systems
by: Li, Zhexian, et al.
Published: (2026)
by: Li, Zhexian, et al.
Published: (2026)
Agile Climate-Sensor Design and Calibration Algorithms Using Machine Learning: Experiments From Cape Point
by: Barrett, Travis, et al.
Published: (2025)
by: Barrett, Travis, et al.
Published: (2025)
Taming "data-hungry" reinforcement learning? Stability in continuous state-action spaces
by: Duan, Yaqi, et al.
Published: (2024)
by: Duan, Yaqi, et al.
Published: (2024)
High entropy leads to symmetry equivariant policies in Dec-POMDPs
by: Forkel, Johannes, et al.
Published: (2025)
by: Forkel, Johannes, et al.
Published: (2025)
Statistical Study of Sensor Data and Investigation of ML-based Calibration Algorithms for Inexpensive Sensor Modules: Experiments from Cape Point
by: Barrett, Travis, et al.
Published: (2025)
by: Barrett, Travis, et al.
Published: (2025)
Similar Items
-
Periodic agent-state based Q-learning for POMDPs
by: Sinha, Amit, et al.
Published: (2024) -
Convergence of regularized agent-state-based Q-learning in POMDPs
by: Sinha, Amit, et al.
Published: (2025) -
Risk-seeking conservative policy iteration with agent-state based policies for Dec-POMDPs with guaranteed convergence
by: Sinha, Amit, et al.
Published: (2026) -
Model approximation in MDPs with unbounded per-step cost
by: Bozkurt, Berk, et al.
Published: (2024) -
Concentration of Cumulative Reward in Markov Decision Processes
by: Sayedana, Borna, et al.
Published: (2024)