Saved in:
| Main Authors: | Srinivasan, Padmanaba, Knottenbelt, William |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.00930 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Offline Reinforcement Learning with Behavioral Supervisor Tuning
by: Srinivasan, Padmanaba, et al.
Published: (2024)
by: Srinivasan, Padmanaba, et al.
Published: (2024)
Offline Model-Based Reinforcement Learning with Anti-Exploration
by: Srinivasan, Padmanaba, et al.
Published: (2024)
by: Srinivasan, Padmanaba, et al.
Published: (2024)
Preference Elicitation for Offline Reinforcement Learning
by: Pace, Alizée, et al.
Published: (2024)
by: Pace, Alizée, et al.
Published: (2024)
Hindsight Preference Learning for Offline Preference-based Reinforcement Learning
by: Gao, Chen-Xiao, et al.
Published: (2024)
by: Gao, Chen-Xiao, et al.
Published: (2024)
Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
by: Kang, Hyungkyu, et al.
Published: (2025)
by: Kang, Hyungkyu, et al.
Published: (2025)
Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement Learning
by: Zhang, Tianle, et al.
Published: (2024)
by: Zhang, Tianle, et al.
Published: (2024)
Listwise Reward Estimation for Offline Preference-based Reinforcement Learning
by: Choi, Heewoong, et al.
Published: (2024)
by: Choi, Heewoong, et al.
Published: (2024)
Should We Ever Prefer Decision Transformer for Offline Reinforcement Learning?
by: Omori, Yumi, et al.
Published: (2025)
by: Omori, Yumi, et al.
Published: (2025)
Search-Based Credit Assignment for Offline Preference-Based Reinforcement Learning
by: Gao, Xiancheng, et al.
Published: (2025)
by: Gao, Xiancheng, et al.
Published: (2025)
LEASE: Offline Preference-based Reinforcement Learning with High Sample Efficiency
by: Liu, Xiao-Yin, et al.
Published: (2024)
by: Liu, Xiao-Yin, et al.
Published: (2024)
OPRIDE: Offline Preference-based Reinforcement Learning via In-Dataset Exploration
by: Yang, Yiqin, et al.
Published: (2026)
by: Yang, Yiqin, et al.
Published: (2026)
Two-Step Offline Preference-Based Reinforcement Learning with Constrained Actions
by: Xu, Yinglun, et al.
Published: (2023)
by: Xu, Yinglun, et al.
Published: (2023)
In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning
by: Tu, Songjun, et al.
Published: (2024)
by: Tu, Songjun, et al.
Published: (2024)
Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning
by: Gao, Chen-Xiao, et al.
Published: (2025)
by: Gao, Chen-Xiao, et al.
Published: (2025)
SelfBC: Self Behavior Cloning for Offline Reinforcement Learning
by: Liu, Shirong, et al.
Published: (2024)
by: Liu, Shirong, et al.
Published: (2024)
Offline Trajectory Optimization for Offline Reinforcement Learning
by: Zhao, Ziqi, et al.
Published: (2024)
by: Zhao, Ziqi, et al.
Published: (2024)
Interactive Symbolic Regression through Offline Reinforcement Learning: A Co-Design Framework
by: Tian, Yuan, et al.
Published: (2024)
by: Tian, Yuan, et al.
Published: (2024)
RAMAC: Multimodal Risk-Aware Offline Reinforcement Learning and the Role of Behavior Regularization
by: Fukazawa, Kai, et al.
Published: (2025)
by: Fukazawa, Kai, et al.
Published: (2025)
Proximal Action Replacement for Behavior Cloning Actor-Critic in Offline Reinforcement Learning
by: Dong, Jinzong, et al.
Published: (2026)
by: Dong, Jinzong, et al.
Published: (2026)
MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning
by: Yuan, Yifu, et al.
Published: (2024)
by: Yuan, Yifu, et al.
Published: (2024)
Exploring and Addressing Reward Confusion in Offline Preference Learning
by: Chen, Xin, et al.
Published: (2024)
by: Chen, Xin, et al.
Published: (2024)
Offline Reinforcement Learning of High-Quality Behaviors Under Robust Style Alignment
by: Petitbois, Mathieu, et al.
Published: (2026)
by: Petitbois, Mathieu, et al.
Published: (2026)
Fine-tuning Behavioral Cloning Policies with Preference-Based Reinforcement Learning
by: Macuglia, Maël, et al.
Published: (2025)
by: Macuglia, Maël, et al.
Published: (2025)
Behavior-Invariant Task Representation Learning with Transformer-based World Models for Offline Meta-Reinforcement Learning
by: Qian, Fuyuan, et al.
Published: (2026)
by: Qian, Fuyuan, et al.
Published: (2026)
zkFL: Zero-Knowledge Proof-based Gradient Aggregation for Federated Learning
by: Wang, Zhipeng, et al.
Published: (2023)
by: Wang, Zhipeng, et al.
Published: (2023)
Offline Reinforcement Learning with Imbalanced Datasets
by: Jiang, Li, et al.
Published: (2023)
by: Jiang, Li, et al.
Published: (2023)
Simple Ingredients for Offline Reinforcement Learning
by: Cetin, Edoardo, et al.
Published: (2024)
by: Cetin, Edoardo, et al.
Published: (2024)
State-Constrained Offline Reinforcement Learning
by: Hepburn, Charles A., et al.
Published: (2024)
by: Hepburn, Charles A., et al.
Published: (2024)
The Generalization Gap in Offline Reinforcement Learning
by: Mediratta, Ishita, et al.
Published: (2023)
by: Mediratta, Ishita, et al.
Published: (2023)
Dataset Distillation for Offline Reinforcement Learning
by: Light, Jonathan, et al.
Published: (2024)
by: Light, Jonathan, et al.
Published: (2024)
Offline Reinforcement Learning with Imputed Rewards
by: Romeo, Carlo, et al.
Published: (2024)
by: Romeo, Carlo, et al.
Published: (2024)
Interactive Symbolic Regression through Offline Reinforcement Learning: A Co-Design Framework
by: Tian, Yuan, et al.
Published: (2025)
by: Tian, Yuan, et al.
Published: (2025)
Offline Learning of Controllable Diverse Behaviors
by: Petitbois, Mathieu, et al.
Published: (2025)
by: Petitbois, Mathieu, et al.
Published: (2025)
Offline Behavior Distillation
by: Lei, Shiye, et al.
Published: (2024)
by: Lei, Shiye, et al.
Published: (2024)
OffSim: Offline Simulator for Model-based Offline Inverse Reinforcement Learning
by: Ahn, Woo-Jin, et al.
Published: (2025)
by: Ahn, Woo-Jin, et al.
Published: (2025)
Offline Reinforcement Learning with Generative Trajectory Policies
by: Feng, Xinsong, et al.
Published: (2025)
by: Feng, Xinsong, et al.
Published: (2025)
Online Optimization for Offline Safe Reinforcement Learning
by: Chemingui, Yassine, et al.
Published: (2025)
by: Chemingui, Yassine, et al.
Published: (2025)
The Three Regimes of Offline-to-Online Reinforcement Learning
by: Li, Lu, et al.
Published: (2025)
by: Li, Lu, et al.
Published: (2025)
In-Context Compositional Q-Learning for Offline Reinforcement Learning
by: Xu, Qiushui, et al.
Published: (2025)
by: Xu, Qiushui, et al.
Published: (2025)
Imagination-Limited Q-Learning for Offline Reinforcement Learning
by: Liu, Wenhui, et al.
Published: (2025)
by: Liu, Wenhui, et al.
Published: (2025)
Similar Items
-
Offline Reinforcement Learning with Behavioral Supervisor Tuning
by: Srinivasan, Padmanaba, et al.
Published: (2024) -
Offline Model-Based Reinforcement Learning with Anti-Exploration
by: Srinivasan, Padmanaba, et al.
Published: (2024) -
Preference Elicitation for Offline Reinforcement Learning
by: Pace, Alizée, et al.
Published: (2024) -
Hindsight Preference Learning for Offline Preference-based Reinforcement Learning
by: Gao, Chen-Xiao, et al.
Published: (2024) -
Adversarial Policy Optimization for Offline Preference-based Reinforcement Learning
by: Kang, Hyungkyu, et al.
Published: (2025)