Saved in:
| Main Authors: | Mehrabi, Mohammad, Wager, Stefan |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.08201 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Conformal Off-Policy Evaluation in Markov Decision Processes
by: Foffano, Daniele, et al.
Published: (2023)
by: Foffano, Daniele, et al.
Published: (2023)
Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision Processes
by: Bennett, Andrew, et al.
Published: (2024)
by: Bennett, Andrew, et al.
Published: (2024)
Policy Learning with Competing Agents
by: Sahoo, Roshni, et al.
Published: (2022)
by: Sahoo, Roshni, et al.
Published: (2022)
Policy Testing in Markov Decision Processes
by: Ariu, Kaito, et al.
Published: (2025)
by: Ariu, Kaito, et al.
Published: (2025)
Distributional Off-Policy Evaluation with Deep Quantile Process Regression
by: Kuang, Qi, et al.
Published: (2026)
by: Kuang, Qi, et al.
Published: (2026)
Optimal Decision Tree Policies for Markov Decision Processes
by: Vos, Daniël, et al.
Published: (2023)
by: Vos, Daniël, et al.
Published: (2023)
Weakly Time-Coupled Approximation of Markov Decision Processes
by: Soheili, Negar, et al.
Published: (2026)
by: Soheili, Negar, et al.
Published: (2026)
Fair Resource Allocation in Weakly Coupled Markov Decision Processes
by: Tu, Xiaohui, et al.
Published: (2024)
by: Tu, Xiaohui, et al.
Published: (2024)
Policy Regularized Distributionally Robust Markov Decision Processes with Linear Function Approximation
by: Gu, Jingwen, et al.
Published: (2025)
by: Gu, Jingwen, et al.
Published: (2025)
Policy Gradient for Robust Markov Decision Processes
by: Wang, Qiuhao, et al.
Published: (2024)
by: Wang, Qiuhao, et al.
Published: (2024)
Learning Deterministic Policies with Policy Gradients in Constrained Markov Decision Processes
by: Montenegro, Alessandro, et al.
Published: (2025)
by: Montenegro, Alessandro, et al.
Published: (2025)
Markov Decision Processes under External Temporal Processes
by: Ayyagari, Ranga Shaarad, et al.
Published: (2023)
by: Ayyagari, Ranga Shaarad, et al.
Published: (2023)
Initial Distribution Sensitivity of Constrained Markov Decision Processes
by: Tercan, Alperen, et al.
Published: (2025)
by: Tercan, Alperen, et al.
Published: (2025)
Rate-Optimal Policy Optimization for Linear Markov Decision Processes
by: Sherman, Uri, et al.
Published: (2023)
by: Sherman, Uri, et al.
Published: (2023)
Off-Policy Evaluation for Ranking Policies under Deterministic Logging Policies
by: Tanaka, Koichi, et al.
Published: (2026)
by: Tanaka, Koichi, et al.
Published: (2026)
Optimistic Actor-Critic with Parametric Policies for Linear Markov Decision Processes
by: Lin, Max Qiushi, et al.
Published: (2026)
by: Lin, Max Qiushi, et al.
Published: (2026)
Optimal Posterior Sampling for Policy Identification in Tabular Markov Decision Processes
by: Kone, Cyrille, et al.
Published: (2026)
by: Kone, Cyrille, et al.
Published: (2026)
Learning Markov Decision Processes under Fully Bandit Feedback
by: Zhuo, Zhengjia, et al.
Published: (2026)
by: Zhuo, Zhengjia, et al.
Published: (2026)
Flipping-based Policy for Chance-Constrained Markov Decision Processes
by: Shen, Xun, et al.
Published: (2024)
by: Shen, Xun, et al.
Published: (2024)
SPOT: Scalable Policy Optimization with Trees for Markov Decision Processes
by: Xiong, Xuyuan, et al.
Published: (2025)
by: Xiong, Xuyuan, et al.
Published: (2025)
On Convergence of Average-Reward Q-Learning in Weakly Communicating Markov Decision Processes
by: Wan, Yi, et al.
Published: (2024)
by: Wan, Yi, et al.
Published: (2024)
An Orthogonal Learner for Individualized Outcomes in Markov Decision Processes
by: Javurek, Emil, et al.
Published: (2025)
by: Javurek, Emil, et al.
Published: (2025)
Linear Mixture Distributionally Robust Markov Decision Processes
by: Liu, Zhishuai, et al.
Published: (2025)
by: Liu, Zhishuai, et al.
Published: (2025)
Monitored Markov Decision Processes
by: Parisi, Simone, et al.
Published: (2024)
by: Parisi, Simone, et al.
Published: (2024)
Efficient and Sharp Off-Policy Learning under Unobserved Confounding
by: Hess, Konstantin, et al.
Published: (2025)
by: Hess, Konstantin, et al.
Published: (2025)
On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes
by: Kumar, Navdeep, et al.
Published: (2024)
by: Kumar, Navdeep, et al.
Published: (2024)
Optimal Mechanisms for Demand Response: An Indifference Set Approach
by: Mehrabi, Mohammad, et al.
Published: (2024)
by: Mehrabi, Mohammad, et al.
Published: (2024)
Mirror Descent Policy Optimisation for Robust Constrained Markov Decision Processes
by: Bossens, David M., et al.
Published: (2025)
by: Bossens, David M., et al.
Published: (2025)
Warm-up Free Policy Optimization: Improved Regret in Linear Markov Decision Processes
by: Cassel, Asaf, et al.
Published: (2024)
by: Cassel, Asaf, et al.
Published: (2024)
Federated Control in Markov Decision Processes
by: Jin, Hao, et al.
Published: (2024)
by: Jin, Hao, et al.
Published: (2024)
Generalized Linear Markov Decision Process
by: Zhang, Sinian, et al.
Published: (2025)
by: Zhang, Sinian, et al.
Published: (2025)
Multi-Timescale Ensemble Q-learning for Markov Decision Process Policy Optimization
by: Bozkus, Talha, et al.
Published: (2024)
by: Bozkus, Talha, et al.
Published: (2024)
Sample Complexity of Offline Distributionally Robust Linear Markov Decision Processes
by: Wang, He, et al.
Published: (2024)
by: Wang, He, et al.
Published: (2024)
Admissibility of Completely Randomized Trials: A Large-Deviation Approach
by: Imbens, Guido, et al.
Published: (2025)
by: Imbens, Guido, et al.
Published: (2025)
Learning from a Biased Sample
by: Sahoo, Roshni, et al.
Published: (2022)
by: Sahoo, Roshni, et al.
Published: (2022)
Off-Policy Evaluation and Learning for the Future under Non-Stationarity
by: Shimizu, Tatsuhiro, et al.
Published: (2025)
by: Shimizu, Tatsuhiro, et al.
Published: (2025)
Learning in Markov Decision Processes with Exogenous Dynamics
by: Maran, Davide, et al.
Published: (2026)
by: Maran, Davide, et al.
Published: (2026)
Off-Policy Evaluation and Learning for Survival Outcomes under Censoring
by: Kubota, Kohsuke, et al.
Published: (2026)
by: Kubota, Kohsuke, et al.
Published: (2026)
Policy Gradient Algorithms with Monte Carlo Tree Learning for Non-Markov Decision Processes
by: Morimura, Tetsuro, et al.
Published: (2022)
by: Morimura, Tetsuro, et al.
Published: (2022)
Bayesian Learning of Optimal Policies in Markov Decision Processes with Countably Infinite State-Space
by: Adler, Saghar, et al.
Published: (2023)
by: Adler, Saghar, et al.
Published: (2023)
Similar Items
-
Conformal Off-Policy Evaluation in Markov Decision Processes
by: Foffano, Daniele, et al.
Published: (2023) -
Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision Processes
by: Bennett, Andrew, et al.
Published: (2024) -
Policy Learning with Competing Agents
by: Sahoo, Roshni, et al.
Published: (2022) -
Policy Testing in Markov Decision Processes
by: Ariu, Kaito, et al.
Published: (2025) -
Distributional Off-Policy Evaluation with Deep Quantile Process Regression
by: Kuang, Qi, et al.
Published: (2026)