:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Shi, C., Zhang, S., Lu, W., Song, R.
Format:	Preprint
Published:	2020
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2001.04515
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Infinite-Horizon Reinforcement Learning with Multinomial Logistic Function Approximation
by: Park, Jaehyun, et al.
Published: (2024)

Statistical Inference in Reinforcement Learning: A Selective Survey
by: Shi, Chengchun
Published: (2025)

Provably Efficient Infinite-Horizon Average-Reward Reinforcement Learning with Linear Function Approximation
by: Chae, Woojin, et al.
Published: (2024)

Optimistically Optimistic Exploration for Provably Efficient Infinite-Horizon Reinforcement and Imitation Learning
by: Moulin, Antoine, et al.
Published: (2025)

Policy Zooming: Adaptive Discretization-based Infinite-Horizon Average-Reward Reinforcement Learning
by: Kar, Avik, et al.
Published: (2024)

Reinforcement Learning for Infinite-Horizon Average-Reward Linear MDPs via Approximation by Discounted-Reward MDPs
by: Hong, Kihyuk, et al.
Published: (2024)

Deep Reinforcement Learning for Infinite Horizon Mean Field Problems in Continuous Spaces
by: Angiuli, Andrea, et al.
Published: (2023)

Nonparametric Additive Value Functions: Interpretable Reinforcement Learning with an Application to Surgical Recovery
by: Emedom-Nnamdi, Patrick, et al.
Published: (2023)

Infinite-Horizon Reach-Avoid Zero-Sum Games via Deep Reinforcement Learning
by: Li, Jingqi, et al.
Published: (2022)

Learning Infinite-Horizon Average-Reward Linear Mixture MDPs of Bounded Span
by: Chae, Woojin, et al.
Published: (2024)

Statistical Inference for Temporal Difference Learning with Linear Function Approximation
by: Wu, Weichen, et al.
Published: (2024)

Conformal Prediction Beyond the Horizon: Distribution-Free Inference for Policy Evaluation
by: Gan, Feichen, et al.
Published: (2025)

Horizon Generalization in Reinforcement Learning
by: Myers, Vivek, et al.
Published: (2025)

Exchange Policy Optimization Algorithm for Semi-Infinite Safe Reinforcement Learning
by: Zhang, Jiaming, et al.
Published: (2025)

Reinforcement Learning from Human Feedback: A Statistical Perspective
by: Liu, Pangpang, et al.
Published: (2026)

Active Value Querying to Minimize Additive Error in Subadditive Set Function Learning
by: Černý, Martin, et al.
Published: (2026)

Statistical Inference for Fuzzy Clustering
by: Wu, Qiuyi, et al.
Published: (2026)

Learning Set Functions with Implicit Differentiation
by: Özcan, Gözde, et al.
Published: (2024)

UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning
by: Zhang, Yu, et al.
Published: (2024)

On the Importance of Multistability for Horizon Generalization in Reinforcement Learning
by: Bakija, Asad, et al.
Published: (2026)

Inverse Reinforcement Learning with Multiple Planning Horizons
by: Yao, Jiayu, et al.
Published: (2024)

Stochastic Decision Horizons for Constrained Reinforcement Learning
by: Milosevic, Nikola, et al.
Published: (2026)

Issues with Value-Based Multi-objective Reinforcement Learning: Value Function Interference and Overestimation Sensitivity
by: Vamplew, Peter, et al.
Published: (2024)

Statistical Learning from Attribution Sets
by: Applebaum, Lorne, et al.
Published: (2026)

Factored Value Functions for Graph-Based Multi-Agent Reinforcement Learning
by: Rashwan, Ahmed, et al.
Published: (2026)

Fixing Incomplete Value Function Decomposition for Multi-Agent Reinforcement Learning
by: Baisero, Andrea, et al.
Published: (2025)

On the Effective Horizon of Inverse Reinforcement Learning
by: Xu, Yiqing, et al.
Published: (2023)

Online Learning with Set-Valued Feedback
by: Raman, Vinod, et al.
Published: (2023)

Estimation and Inference in Distributional Reinforcement Learning
by: Zhang, Liangyu, et al.
Published: (2023)

What Makes Value Learning Efficient in Residual Reinforcement Learning?
by: Ma, Guozheng, et al.
Published: (2026)

A Physics-Informed Learning Framework to Solve the Infinite-Horizon Optimal Control Problem
by: Fotiadis, Filippos, et al.
Published: (2025)

Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe
by: Wu, Xixi, et al.
Published: (2026)

VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
by: Chen, Xuyang, et al.
Published: (2025)

Set-Valued Policy Learning
by: Fuentes-Vicente, Laura, et al.
Published: (2026)

Learning U-Statistics with Active Inference
by: Wang, Xiaoning, et al.
Published: (2026)

Horizon Reduction as Information Loss in Offline Reinforcement Learning
by: Nidadala, Uday Kumar, et al.
Published: (2025)

Conformal Calibration of Statistical Confidence Sets
by: Cabezas, Luben M. C., et al.
Published: (2024)

Reinforcement Learning with Random Time Horizons
by: Borrell, Enric Ribera, et al.
Published: (2025)

Investigating Lagrangian Neural Networks for Infinite Horizon Planning in Quadrupedal Locomotion
by: Kotecha, Prakrut, et al.
Published: (2025)

MARPLE: A Benchmark for Long-Horizon Inference
by: Jin, Emily, et al.
Published: (2024)