Saved in:
| Main Authors: | Romeo, Carlo, Bagdanov, Andrew D. |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.10839 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ARC-RL: A Reinforcement Learning Playground Inspired by ARC Raiders
by: Romeo, Carlo, et al.
Published: (2026)
by: Romeo, Carlo, et al.
Published: (2026)
SPEQ: Offline Stabilization Phases for Efficient Q-Learning in High Update-To-Data Ratio Reinforcement Learning
by: Romeo, Carlo, et al.
Published: (2025)
by: Romeo, Carlo, et al.
Published: (2025)
A Benchmark Environment for Offline Reinforcement Learning in Racing Games
by: Macaluso, Girolamo, et al.
Published: (2024)
by: Macaluso, Girolamo, et al.
Published: (2024)
SOPE: Stabilizing Off-Policy Evaluation for Online RL with Prior Data
by: Romeo, Carlo, et al.
Published: (2026)
by: Romeo, Carlo, et al.
Published: (2026)
TROFI: Trajectory-Ranked Offline Inverse Reinforcement Learning
by: Sestini, Alessandro, et al.
Published: (2025)
by: Sestini, Alessandro, et al.
Published: (2025)
NTRL: Encounter Generation via Reinforcement Learning for Dynamic Difficulty Adjustment in Dungeons and Dragons
by: Romeo, Carlo, et al.
Published: (2025)
by: Romeo, Carlo, et al.
Published: (2025)
Listwise Reward Estimation for Offline Preference-based Reinforcement Learning
by: Choi, Heewoong, et al.
Published: (2024)
by: Choi, Heewoong, et al.
Published: (2024)
Reward Generation via Large Vision-Language Model in Offline Reinforcement Learning
by: Lee, Younghwan, et al.
Published: (2025)
by: Lee, Younghwan, et al.
Published: (2025)
Penalizing Infeasible Actions and Reward Scaling in Reinforcement Learning with Offline Data
by: Kim, Jeonghye, et al.
Published: (2025)
by: Kim, Jeonghye, et al.
Published: (2025)
Robust Offline Reinforcement learning with Heavy-Tailed Rewards
by: Zhu, Jin, et al.
Published: (2023)
by: Zhu, Jin, et al.
Published: (2023)
Offline Trajectory Optimization for Offline Reinforcement Learning
by: Zhao, Ziqi, et al.
Published: (2024)
by: Zhao, Ziqi, et al.
Published: (2024)
Generative Adversarial Networks for Imputing Sparse Learning Performance
by: Zhang, Liang, et al.
Published: (2024)
by: Zhang, Liang, et al.
Published: (2024)
Universal Black-Box Reward Poisoning Attack against Offline Reinforcement Learning
by: Xu, Yinglun, et al.
Published: (2024)
by: Xu, Yinglun, et al.
Published: (2024)
When is Offline Policy Selection Sample Efficient for Reinforcement Learning?
by: Liu, Vincent, et al.
Published: (2023)
by: Liu, Vincent, et al.
Published: (2023)
Exploring and Addressing Reward Confusion in Offline Preference Learning
by: Chen, Xin, et al.
Published: (2024)
by: Chen, Xin, et al.
Published: (2024)
Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation
by: Mistretta, Marco, et al.
Published: (2024)
by: Mistretta, Marco, et al.
Published: (2024)
SpectralGCD: Spectral Concept Selection and Cross-modal Representation Learning for Generalized Category Discovery
by: Caselli, Lorenzo, et al.
Published: (2026)
by: Caselli, Lorenzo, et al.
Published: (2026)
Preference Elicitation for Offline Reinforcement Learning
by: Pace, Alizée, et al.
Published: (2024)
by: Pace, Alizée, et al.
Published: (2024)
Simple Ingredients for Offline Reinforcement Learning
by: Cetin, Edoardo, et al.
Published: (2024)
by: Cetin, Edoardo, et al.
Published: (2024)
State-Constrained Offline Reinforcement Learning
by: Hepburn, Charles A., et al.
Published: (2024)
by: Hepburn, Charles A., et al.
Published: (2024)
Dataset Distillation for Offline Reinforcement Learning
by: Light, Jonathan, et al.
Published: (2024)
by: Light, Jonathan, et al.
Published: (2024)
Offline Reinforcement Learning with Imbalanced Datasets
by: Jiang, Li, et al.
Published: (2023)
by: Jiang, Li, et al.
Published: (2023)
The Generalization Gap in Offline Reinforcement Learning
by: Mediratta, Ishita, et al.
Published: (2023)
by: Mediratta, Ishita, et al.
Published: (2023)
OffSim: Offline Simulator for Model-based Offline Inverse Reinforcement Learning
by: Ahn, Woo-Jin, et al.
Published: (2025)
by: Ahn, Woo-Jin, et al.
Published: (2025)
Multi-level Certified Defense Against Poisoning Attacks in Offline Reinforcement Learning
by: Liu, Shijie, et al.
Published: (2025)
by: Liu, Shijie, et al.
Published: (2025)
M$^3$-Impute: Mask-guided Representation Learning for Missing Value Imputation
by: Yu, Zhongyi, et al.
Published: (2024)
by: Yu, Zhongyi, et al.
Published: (2024)
Percentile Criterion Optimization in Offline Reinforcement Learning
by: Lobo, Elita A., et al.
Published: (2024)
by: Lobo, Elita A., et al.
Published: (2024)
Doubly Mild Generalization for Offline Reinforcement Learning
by: Mao, Yixiu, et al.
Published: (2024)
by: Mao, Yixiu, et al.
Published: (2024)
KAN v.s. MLP for Offline Reinforcement Learning
by: Guo, Haihong, et al.
Published: (2024)
by: Guo, Haihong, et al.
Published: (2024)
Offline Reinforcement Learning with Behavioral Supervisor Tuning
by: Srinivasan, Padmanaba, et al.
Published: (2024)
by: Srinivasan, Padmanaba, et al.
Published: (2024)
Mutual Information Regularized Offline Reinforcement Learning
by: Ma, Xiao, et al.
Published: (2022)
by: Ma, Xiao, et al.
Published: (2022)
Abstraction for Offline Goal-Conditioned Reinforcement Learning
by: Wibault, Clarisse, et al.
Published: (2026)
by: Wibault, Clarisse, et al.
Published: (2026)
Offline Reinforcement Learning with Generative Trajectory Policies
by: Feng, Xinsong, et al.
Published: (2025)
by: Feng, Xinsong, et al.
Published: (2025)
Behavior Preference Regression for Offline Reinforcement Learning
by: Srinivasan, Padmanaba, et al.
Published: (2025)
by: Srinivasan, Padmanaba, et al.
Published: (2025)
Online Optimization for Offline Safe Reinforcement Learning
by: Chemingui, Yassine, et al.
Published: (2025)
by: Chemingui, Yassine, et al.
Published: (2025)
Offline Reinforcement Learning with Universal Horizon Models
by: Chung, Hojun, et al.
Published: (2026)
by: Chung, Hojun, et al.
Published: (2026)
Flow Actor-Critic for Offline Reinforcement Learning
by: Chae, Jongseong, et al.
Published: (2026)
by: Chae, Jongseong, et al.
Published: (2026)
The Three Regimes of Offline-to-Online Reinforcement Learning
by: Li, Lu, et al.
Published: (2025)
by: Li, Lu, et al.
Published: (2025)
Federated Ensemble-Directed Offline Reinforcement Learning
by: Rengarajan, Desik, et al.
Published: (2023)
by: Rengarajan, Desik, et al.
Published: (2023)
In-Context Compositional Q-Learning for Offline Reinforcement Learning
by: Xu, Qiushui, et al.
Published: (2025)
by: Xu, Qiushui, et al.
Published: (2025)
Similar Items
-
ARC-RL: A Reinforcement Learning Playground Inspired by ARC Raiders
by: Romeo, Carlo, et al.
Published: (2026) -
SPEQ: Offline Stabilization Phases for Efficient Q-Learning in High Update-To-Data Ratio Reinforcement Learning
by: Romeo, Carlo, et al.
Published: (2025) -
A Benchmark Environment for Offline Reinforcement Learning in Racing Games
by: Macaluso, Girolamo, et al.
Published: (2024) -
SOPE: Stabilizing Off-Policy Evaluation for Online RL with Prior Data
by: Romeo, Carlo, et al.
Published: (2026) -
TROFI: Trajectory-Ranked Offline Inverse Reinforcement Learning
by: Sestini, Alessandro, et al.
Published: (2025)