:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Pavse, Brahma S., Chen, Yudong, Xie, Qiaomin, Hanna, Josiah P.
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2410.01643
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Learning to Stabilize Online Reinforcement Learning in Unbounded State Spaces
by: Pavse, Brahma S., et al.
Published: (2023)

Adaptive Exploration for Data-Efficient General Value Function Evaluations
by: Jain, Arushi, et al.
Published: (2024)

Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
by: Mukherjee, Subhojyoti, et al.
Published: (2024)

VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning
by: Chen, Xuyang, et al.
Published: (2025)

SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits
by: Mukherjee, Subhojyoti, et al.
Published: (2023)

Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning?
by: Tarasov, Denis, et al.
Published: (2024)

BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning
by: Lin, Haohong, et al.
Published: (2024)

Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning
by: Yu, Xudong, et al.
Published: (2024)

Reinforcement Learning via Auxiliary Task Distillation
by: Harish, Abhinav Narayan, et al.
Published: (2024)

An Empirical Study on the Power of Future Prediction in Partially Observable Environments
by: Kwon, Jeongyeol, et al.
Published: (2024)

Demystifying the Paradox of Importance Sampling with an Estimated History-Dependent Behavior Policy in Off-Policy Evaluation
by: Zhou, Hongyi, et al.
Published: (2025)

Is Value Learning Really the Main Bottleneck in Offline RL?
by: Park, Seohong, et al.
Published: (2024)

Residual Q-Learning: Offline and Online Policy Customization without Value
by: Li, Chenran, et al.
Published: (2023)

LLM-Augmented Computational Phenotyping of Long Covid
by: Wang, Jing, et al.
Published: (2026)

AD4RL: Autonomous Driving Benchmarks for Offline Reinforcement Learning with Value-based Dataset
by: Lee, Dongsu, et al.
Published: (2024)

Decoupled Guidance Diffusion for Adaptive Offline Safe Reinforcement Learning
by: Chen, Rufeng, et al.
Published: (2026)

Behavior-Invariant Task Representation Learning with Transformer-based World Models for Offline Meta-Reinforcement Learning
by: Qian, Fuyuan, et al.
Published: (2026)

Hindsight Preference Learning for Offline Preference-based Reinforcement Learning
by: Gao, Chen-Xiao, et al.
Published: (2024)

Sparsely Multimodal Data Fusion
by: Bjorgaard, Josiah
Published: (2024)

Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
by: Zhang, Xinyu, et al.
Published: (2024)

Diffusion Policies with Value-Conditional Optimization for Offline Reinforcement Learning
by: Ma, Yunchang, et al.
Published: (2025)

Bias and Extrapolation in Markovian Linear Stochastic Approximation with Constant Stepsizes
by: Huo, Dongyan, et al.
Published: (2022)

Offline Trajectory Optimization for Offline Reinforcement Learning
by: Zhao, Ziqi, et al.
Published: (2024)

Disentangling Policy from Offline Task Representation Learning via Adversarial Data Augmentation
by: Jia, Chengxing, et al.
Published: (2024)

Guided Flow Policy: Learning from High-Value Actions in Offline Reinforcement Learning
by: Tiofack, Franki Nguimatsia, et al.
Published: (2025)

OffSim: Offline Simulator for Model-based Offline Inverse Reinforcement Learning
by: Ahn, Woo-Jin, et al.
Published: (2025)

Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning
by: Bai, Chenjia, et al.
Published: (2024)

Option-aware Temporally Abstracted Value for Offline Goal-Conditioned Reinforcement Learning
by: Ahn, Hongjoon, et al.
Published: (2025)

Expressive Value Learning for Scalable Offline Reinforcement Learning
by: Espinosa-Dice, Nicolas, et al.
Published: (2025)

Offline Reinforcement Learning in Large State Spaces: Algorithms and Guarantees
by: Jiang, Nan, et al.
Published: (2025)

Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning
by: Wen, Xiaoyu, et al.
Published: (2024)

Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning
by: Wang, Changhong, et al.
Published: (2024)

Using Curiosity for an Even Representation of Tasks in Continual Offline Reinforcement Learning
by: Pathmanathan, Pankayaraj, et al.
Published: (2023)

OPRIDE: Offline Preference-based Reinforcement Learning via In-Dataset Exploration
by: Yang, Yiqin, et al.
Published: (2026)

A Recipe for Stable Offline Multi-agent Reinforcement Learning
by: Lee, Dongsu, et al.
Published: (2026)

Stable CDE Autoencoders with Acuity Regularization for Offline Reinforcement Learning in Sepsis Treatment
by: Gao, Yue
Published: (2025)

Permutation Equivariant Model-based Offline Reinforcement Learning for Auto-bidding
by: Mou, Zhiyu, et al.
Published: (2025)

In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning
by: Tu, Songjun, et al.
Published: (2024)

Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning
by: Chen, Jiayu, et al.
Published: (2024)

Offline Imitation Learning with Model-based Reverse Augmentation
by: Shao, Jie-Jing, et al.
Published: (2024)