Similar Items
Infinite-Horizon Reinforcement Learning with Multinomial Logistic Function Approximation
by: Park, Jaehyun, et al.
Published: (2024)
by: Park, Jaehyun, et al.
Published: (2024)
Statistical Inference in Reinforcement Learning: A Selective Survey
by: Shi, Chengchun
Published: (2025)
by: Shi, Chengchun
Published: (2025)
Provably Efficient Infinite-Horizon Average-Reward Reinforcement Learning with Linear Function Approximation
by: Chae, Woojin, et al.
Published: (2024)
by: Chae, Woojin, et al.
Published: (2024)
Optimistically Optimistic Exploration for Provably Efficient Infinite-Horizon Reinforcement and Imitation Learning
by: Moulin, Antoine, et al.
Published: (2025)
by: Moulin, Antoine, et al.
Published: (2025)
Policy Zooming: Adaptive Discretization-based Infinite-Horizon Average-Reward Reinforcement Learning
by: Kar, Avik, et al.
Published: (2024)
by: Kar, Avik, et al.
Published: (2024)
Reinforcement Learning for Infinite-Horizon Average-Reward Linear MDPs via Approximation by Discounted-Reward MDPs
by: Hong, Kihyuk, et al.
Published: (2024)
by: Hong, Kihyuk, et al.
Published: (2024)
Deep Reinforcement Learning for Infinite Horizon Mean Field Problems in Continuous Spaces
by: Angiuli, Andrea, et al.
Published: (2023)
by: Angiuli, Andrea, et al.
Published: (2023)
Nonparametric Additive Value Functions: Interpretable Reinforcement Learning with an Application to Surgical Recovery
by: Emedom-Nnamdi, Patrick, et al.
Published: (2023)
by: Emedom-Nnamdi, Patrick, et al.
Published: (2023)
Infinite-Horizon Reach-Avoid Zero-Sum Games via Deep Reinforcement Learning
by: Li, Jingqi, et al.
Published: (2022)
by: Li, Jingqi, et al.
Published: (2022)
Learning Infinite-Horizon Average-Reward Linear Mixture MDPs of Bounded Span
by: Chae, Woojin, et al.
Published: (2024)
by: Chae, Woojin, et al.
Published: (2024)
Statistical Inference for Temporal Difference Learning with Linear Function Approximation
by: Wu, Weichen, et al.
Published: (2024)
by: Wu, Weichen, et al.
Published: (2024)
Conformal Prediction Beyond the Horizon: Distribution-Free Inference for Policy Evaluation
by: Gan, Feichen, et al.
Published: (2025)
by: Gan, Feichen, et al.
Published: (2025)
Horizon Generalization in Reinforcement Learning
by: Myers, Vivek, et al.
Published: (2025)
by: Myers, Vivek, et al.
Published: (2025)
Exchange Policy Optimization Algorithm for Semi-Infinite Safe Reinforcement Learning
by: Zhang, Jiaming, et al.
Published: (2025)
by: Zhang, Jiaming, et al.
Published: (2025)
Reinforcement Learning from Human Feedback: A Statistical Perspective
by: Liu, Pangpang, et al.
Published: (2026)
by: Liu, Pangpang, et al.
Published: (2026)
Active Value Querying to Minimize Additive Error in Subadditive Set Function Learning
by: Černý, Martin, et al.
Published: (2026)
by: Černý, Martin, et al.
Published: (2026)
Statistical Inference for Fuzzy Clustering
by: Wu, Qiuyi, et al.
Published: (2026)
by: Wu, Qiuyi, et al.
Published: (2026)
Learning Set Functions with Implicit Differentiation
by: Özcan, Gözde, et al.
Published: (2024)
by: Özcan, Gözde, et al.
Published: (2024)
UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning
by: Zhang, Yu, et al.
Published: (2024)
by: Zhang, Yu, et al.
Published: (2024)
On the Importance of Multistability for Horizon Generalization in Reinforcement Learning
by: Bakija, Asad, et al.
Published: (2026)
by: Bakija, Asad, et al.
Published: (2026)
Inverse Reinforcement Learning with Multiple Planning Horizons
by: Yao, Jiayu, et al.
Published: (2024)
by: Yao, Jiayu, et al.
Published: (2024)
Stochastic Decision Horizons for Constrained Reinforcement Learning
by: Milosevic, Nikola, et al.
Published: (2026)
by: Milosevic, Nikola, et al.
Published: (2026)
Issues with Value-Based Multi-objective Reinforcement Learning: Value Function Interference and Overestimation Sensitivity
by: Vamplew, Peter, et al.
Published: (2024)
by: Vamplew, Peter, et al.
Published: (2024)
Statistical Learning from Attribution Sets
by: Applebaum, Lorne, et al.
Published: (2026)
by: Applebaum, Lorne, et al.
Published: (2026)
Factored Value Functions for Graph-Based Multi-Agent Reinforcement Learning
by: Rashwan, Ahmed, et al.
Published: (2026)
by: Rashwan, Ahmed, et al.
Published: (2026)
Fixing Incomplete Value Function Decomposition for Multi-Agent Reinforcement Learning
by: Baisero, Andrea, et al.
Published: (2025)
by: Baisero, Andrea, et al.
Published: (2025)
On the Effective Horizon of Inverse Reinforcement Learning
by: Xu, Yiqing, et al.
Published: (2023)
by: Xu, Yiqing, et al.
Published: (2023)
Online Learning with Set-Valued Feedback
by: Raman, Vinod, et al.
Published: (2023)
by: Raman, Vinod, et al.
Published: (2023)
Estimation and Inference in Distributional Reinforcement Learning
by: Zhang, Liangyu, et al.
Published: (2023)
by: Zhang, Liangyu, et al.
Published: (2023)
What Makes Value Learning Efficient in Residual Reinforcement Learning?
by: Ma, Guozheng, et al.
Published: (2026)
by: Ma, Guozheng, et al.
Published: (2026)
A Physics-Informed Learning Framework to Solve the Infinite-Horizon Optimal Control Problem
by: Fotiadis, Filippos, et al.
Published: (2025)
by: Fotiadis, Filippos, et al.
Published: (2025)
Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe
by: Wu, Xixi, et al.
Published: (2026)
by: Wu, Xixi, et al.
Published: (2026)
VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
by: Chen, Xuyang, et al.
Published: (2025)
by: Chen, Xuyang, et al.
Published: (2025)
Set-Valued Policy Learning
by: Fuentes-Vicente, Laura, et al.
Published: (2026)
by: Fuentes-Vicente, Laura, et al.
Published: (2026)
Learning U-Statistics with Active Inference
by: Wang, Xiaoning, et al.
Published: (2026)
by: Wang, Xiaoning, et al.
Published: (2026)
Horizon Reduction as Information Loss in Offline Reinforcement Learning
by: Nidadala, Uday Kumar, et al.
Published: (2025)
by: Nidadala, Uday Kumar, et al.
Published: (2025)
Conformal Calibration of Statistical Confidence Sets
by: Cabezas, Luben M. C., et al.
Published: (2024)
by: Cabezas, Luben M. C., et al.
Published: (2024)
Reinforcement Learning with Random Time Horizons
by: Borrell, Enric Ribera, et al.
Published: (2025)
by: Borrell, Enric Ribera, et al.
Published: (2025)
Investigating Lagrangian Neural Networks for Infinite Horizon Planning in Quadrupedal Locomotion
by: Kotecha, Prakrut, et al.
Published: (2025)
by: Kotecha, Prakrut, et al.
Published: (2025)
MARPLE: A Benchmark for Long-Horizon Inference
by: Jin, Emily, et al.
Published: (2024)
by: Jin, Emily, et al.
Published: (2024)
Similar Items
-
Infinite-Horizon Reinforcement Learning with Multinomial Logistic Function Approximation
by: Park, Jaehyun, et al.
Published: (2024) -
Statistical Inference in Reinforcement Learning: A Selective Survey
by: Shi, Chengchun
Published: (2025) -
Provably Efficient Infinite-Horizon Average-Reward Reinforcement Learning with Linear Function Approximation
by: Chae, Woojin, et al.
Published: (2024) -
Optimistically Optimistic Exploration for Provably Efficient Infinite-Horizon Reinforcement and Imitation Learning
by: Moulin, Antoine, et al.
Published: (2025) -
Policy Zooming: Adaptive Discretization-based Infinite-Horizon Average-Reward Reinforcement Learning
by: Kar, Avik, et al.
Published: (2024)