Guardado en:
| Autores principales: | Liu, Wei, Long, Ting |
|---|---|
| Formato: | Preprint |
| Publicado: |
2026
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2605.22376 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Bellman Calibration for $V$-Learning in Offline Reinforcement Learning
por: van der Laan, Lars, et al.
Publicado: (2025)
por: van der Laan, Lars, et al.
Publicado: (2025)
MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator
por: Liu, Xiao-Yin, et al.
Publicado: (2023)
por: Liu, Xiao-Yin, et al.
Publicado: (2023)
Bridging Domain Gaps with Target-Aligned Generation for Offline Reinforcement Learning
por: Kim, Minung, et al.
Publicado: (2026)
por: Kim, Minung, et al.
Publicado: (2026)
The Role of Inherent Bellman Error in Offline Reinforcement Learning with Linear Function Approximation
por: Golowich, Noah, et al.
Publicado: (2024)
por: Golowich, Noah, et al.
Publicado: (2024)
Efficient Cross-Domain Offline Reinforcement Learning with Dynamics- and Value-Aligned Data Filtering
por: Qiao, Zhongjian, et al.
Publicado: (2025)
por: Qiao, Zhongjian, et al.
Publicado: (2025)
Active Advantage-Aligned Online Reinforcement Learning with Offline Data
por: Liu, Xuefeng, et al.
Publicado: (2025)
por: Liu, Xuefeng, et al.
Publicado: (2025)
Learning to Trust Bellman Updates: Selective State-Adaptive Regularization for Offline RL
por: Luo, Qin-Wen, et al.
Publicado: (2025)
por: Luo, Qin-Wen, et al.
Publicado: (2025)
Target-Aligned Reinforcement Learning
por: Pleiss, Leonard S., et al.
Publicado: (2026)
por: Pleiss, Leonard S., et al.
Publicado: (2026)
DmC: Nearest Neighbor Guidance Diffusion Model for Offline Cross-domain Reinforcement Learning
por: Van, Linh Le Pham, et al.
Publicado: (2025)
por: Van, Linh Le Pham, et al.
Publicado: (2025)
Offline Reinforcement Learning using Human-Aligned Reward Labeling for Autonomous Emergency Braking in Occluded Pedestrian Crossing
por: Asodia, Vinal, et al.
Publicado: (2025)
por: Asodia, Vinal, et al.
Publicado: (2025)
Theoretical Barriers in Bellman-Based Reinforcement Learning
por: Pinon, Brieuc, et al.
Publicado: (2025)
por: Pinon, Brieuc, et al.
Publicado: (2025)
Gradual Transition from Bellman Optimality Operator to Bellman Operator in Online Reinforcement Learning
por: Omura, Motoki, et al.
Publicado: (2025)
por: Omura, Motoki, et al.
Publicado: (2025)
DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching
por: Li, Guanghe, et al.
Publicado: (2024)
por: Li, Guanghe, et al.
Publicado: (2024)
Sparse Offline Reinforcement Learning with Corruption Robustness
por: Tran, Nam Phuong, et al.
Publicado: (2025)
por: Tran, Nam Phuong, et al.
Publicado: (2025)
Offline Trajectory Optimization for Offline Reinforcement Learning
por: Zhao, Ziqi, et al.
Publicado: (2024)
por: Zhao, Ziqi, et al.
Publicado: (2024)
Reinforcement Learning with Imperfect Transition Predictions: A Bellman-Jensen Approach
por: Lu, Chenbei, et al.
Publicado: (2025)
por: Lu, Chenbei, et al.
Publicado: (2025)
Efficient Anti-exploration via VQVAE and Fuzzy Clustering in Offline Reinforcement Learning
por: Chen, Long, et al.
Publicado: (2026)
por: Chen, Long, et al.
Publicado: (2026)
Nonparametric Bellman Mappings for Value Iteration in Distributed Reinforcement Learning
por: Akiyama, Yuki, et al.
Publicado: (2025)
por: Akiyama, Yuki, et al.
Publicado: (2025)
Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning
por: Wei, Honghao, et al.
Publicado: (2024)
por: Wei, Honghao, et al.
Publicado: (2024)
Path-Coupled Bellman Flows for Distributional Reinforcement Learning
por: Xu, Boyang, et al.
Publicado: (2026)
por: Xu, Boyang, et al.
Publicado: (2026)
Dual-Robust Cross-Domain Offline Reinforcement Learning Against Dynamics Shifts
por: Qiao, Zhongjian, et al.
Publicado: (2025)
por: Qiao, Zhongjian, et al.
Publicado: (2025)
Hybrid Cross-domain Robust Reinforcement Learning
por: Van, Linh Le Pham, et al.
Publicado: (2025)
por: Van, Linh Le Pham, et al.
Publicado: (2025)
Double Check My Desired Return: Transformer with Target Alignment for Offline Reinforcement Learning
por: Pei, Yue, et al.
Publicado: (2025)
por: Pei, Yue, et al.
Publicado: (2025)
Dataset Distillation for Offline Reinforcement Learning
por: Light, Jonathan, et al.
Publicado: (2024)
por: Light, Jonathan, et al.
Publicado: (2024)
Cross-domain Random Pre-training with Prototypes for Reinforcement Learning
por: Liu, Xin, et al.
Publicado: (2023)
por: Liu, Xin, et al.
Publicado: (2023)
Cross-Domain Policy Optimization via Bellman Consistency and Hybrid Critics
por: Chen, Ming-Hong, et al.
Publicado: (2026)
por: Chen, Ming-Hong, et al.
Publicado: (2026)
Markowitz Meets Bellman: Knowledge-distilled Reinforcement Learning for Portfolio Management
por: Hu, Gang, et al.
Publicado: (2024)
por: Hu, Gang, et al.
Publicado: (2024)
Nonparametric Bellman Mappings for Reinforcement Learning: Application to Robust Adaptive Filtering
por: Akiyama, Yuki, et al.
Publicado: (2024)
por: Akiyama, Yuki, et al.
Publicado: (2024)
Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning
por: Wen, Xiaoyu, et al.
Publicado: (2024)
por: Wen, Xiaoyu, et al.
Publicado: (2024)
Equivariant Offline Reinforcement Learning
por: Tangri, Arsh, et al.
Publicado: (2024)
por: Tangri, Arsh, et al.
Publicado: (2024)
Federated Offline Reinforcement Learning
por: Zhou, Doudou, et al.
Publicado: (2022)
por: Zhou, Doudou, et al.
Publicado: (2022)
On the Complexity of Offline Reinforcement Learning with $Q^\star$-Approximation and Partial Coverage
por: Liu, Haolin, et al.
Publicado: (2026)
por: Liu, Haolin, et al.
Publicado: (2026)
Unifying Value Alignment and Assignment in Cross-Domain Offline Reinforcement Learning with Heterogeneous Datasets
por: Qiao, Zhongjian, et al.
Publicado: (2026)
por: Qiao, Zhongjian, et al.
Publicado: (2026)
Epistemic Robust Offline Reinforcement Learning
por: Chenreddy, Abhilash Reddy, et al.
Publicado: (2026)
por: Chenreddy, Abhilash Reddy, et al.
Publicado: (2026)
Offline Inverse Constrained Reinforcement Learning for Safe-Critical Decision Making in Healthcare
por: Fang, Nan, et al.
Publicado: (2024)
por: Fang, Nan, et al.
Publicado: (2024)
Offline Multitask Representation Learning for Reinforcement Learning
por: Ishfaq, Haque, et al.
Publicado: (2024)
por: Ishfaq, Haque, et al.
Publicado: (2024)
Quotient-Categorical Representations for Bellman-Compatible Average-Reward Distributional Reinforcement Learning
por: Kaya, Ege C., et al.
Publicado: (2026)
por: Kaya, Ege C., et al.
Publicado: (2026)
KAN v.s. MLP for Offline Reinforcement Learning
por: Guo, Haihong, et al.
Publicado: (2024)
por: Guo, Haihong, et al.
Publicado: (2024)
Linear Bellman Completeness Suffices for Efficient Online Reinforcement Learning with Few Actions
por: Golowich, Noah, et al.
Publicado: (2024)
por: Golowich, Noah, et al.
Publicado: (2024)
Symmetric Q-learning: Reducing Skewness of Bellman Error in Online Reinforcement Learning
por: Omura, Motoki, et al.
Publicado: (2024)
por: Omura, Motoki, et al.
Publicado: (2024)
Ejemplares similares
-
Bellman Calibration for $V$-Learning in Offline Reinforcement Learning
por: van der Laan, Lars, et al.
Publicado: (2025) -
MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator
por: Liu, Xiao-Yin, et al.
Publicado: (2023) -
Bridging Domain Gaps with Target-Aligned Generation for Offline Reinforcement Learning
por: Kim, Minung, et al.
Publicado: (2026) -
The Role of Inherent Bellman Error in Offline Reinforcement Learning with Linear Function Approximation
por: Golowich, Noah, et al.
Publicado: (2024) -
Efficient Cross-Domain Offline Reinforcement Learning with Dynamics- and Value-Aligned Data Filtering
por: Qiao, Zhongjian, et al.
Publicado: (2025)