Saved in:
| Main Authors: | Zhong, Jingfeng, Liu, Zhengxiang, Wang, Zhijie, Li, Shuai |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.21160 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
QTMRL: An Agent for Quantitative Trading Decision-Making Based on Multi-Indicator Guided Reinforcement Learning
by: Pan, Jingfeng, et al.
Published: (2025)
by: Pan, Jingfeng, et al.
Published: (2025)
FedKL: Tackling Data Heterogeneity in Federated Reinforcement Learning by Penalizing KL Divergence
by: Xie, Zhijie, et al.
Published: (2022)
by: Xie, Zhijie, et al.
Published: (2022)
Soft Forward-Backward Representations for Zero-shot Reinforcement Learning with General Utilities
by: Bagatella, Marco, et al.
Published: (2026)
by: Bagatella, Marco, et al.
Published: (2026)
ROAD: Adaptive Data Mixing for Offline-to-Online Reinforcement Learning via Bi-Level Optimization
by: Yang, Letian, et al.
Published: (2026)
by: Yang, Letian, et al.
Published: (2026)
The Actor-Critic Update Order Matters for PPO in Federated Reinforcement Learning
by: Xie, Zhijie, et al.
Published: (2025)
by: Xie, Zhijie, et al.
Published: (2025)
LightningRL: Breaking the Accuracy-Parallelism Trade-off of Block-wise dLLMs via Reinforcement Learning
by: Hu, Yanzhe, et al.
Published: (2026)
by: Hu, Yanzhe, et al.
Published: (2026)
Offline-to-Online Reinforcement Learning with Classifier-Free Diffusion Generation
by: Huang, Xiao, et al.
Published: (2025)
by: Huang, Xiao, et al.
Published: (2025)
Preference-Guided Reinforcement Learning for Efficient Exploration
by: Wang, Guojian, et al.
Published: (2024)
by: Wang, Guojian, et al.
Published: (2024)
Sample Efficient Reinforcement Learning by Automatically Learning to Compose Subtasks
by: Han, Shuai, et al.
Published: (2024)
by: Han, Shuai, et al.
Published: (2024)
Fixed Design Analysis of Regularization-Based Continual Learning
by: Li, Haoran, et al.
Published: (2023)
by: Li, Haoran, et al.
Published: (2023)
Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning
by: Corrado, Nicholas E., et al.
Published: (2023)
by: Corrado, Nicholas E., et al.
Published: (2023)
Labeled TrustSet Guided: Batch Active Learning with Reinforcement Learning
by: Cui, Guofeng, et al.
Published: (2026)
by: Cui, Guofeng, et al.
Published: (2026)
Neural Backward Filtering Forward Guiding
by: Yang, Gefan, et al.
Published: (2026)
by: Yang, Gefan, et al.
Published: (2026)
Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond
by: Liu, Xutong, et al.
Published: (2024)
by: Liu, Xutong, et al.
Published: (2024)
RL-Selector: Reinforcement Learning-Guided Data Selection via Redundancy Assessment
by: Yang, Suorong, et al.
Published: (2025)
by: Yang, Suorong, et al.
Published: (2025)
Privacy-Preserving Heterogeneous Federated Learning for Sensitive Healthcare Data
by: Xu, Yukai, et al.
Published: (2024)
by: Xu, Yukai, et al.
Published: (2024)
Goal-Guided Efficient Exploration via Large Language Model in Reinforcement Learning
by: Qi, Yajie, et al.
Published: (2025)
by: Qi, Yajie, et al.
Published: (2025)
SpecRLBench: A Benchmark for Generalization in Specification-Guided Reinforcement Learning
by: Guo, Zijian, et al.
Published: (2026)
by: Guo, Zijian, et al.
Published: (2026)
Backward Learning for Goal-Conditioned Policies
by: Höftmann, Marc, et al.
Published: (2023)
by: Höftmann, Marc, et al.
Published: (2023)
Enhancing Geometric Perception in VLMs via Translator-Guided Reinforcement Learning
by: Yu, Hao, et al.
Published: (2026)
by: Yu, Hao, et al.
Published: (2026)
Plasma Shape Control via Zero-shot Generative Reinforcement Learning
by: Wu, Niannian, et al.
Published: (2025)
by: Wu, Niannian, et al.
Published: (2025)
Memory-Statistics Tradeoff in Continual Learning with Structural Regularization
by: Li, Haoran, et al.
Published: (2025)
by: Li, Haoran, et al.
Published: (2025)
G-PCGRL: Procedural Graph Data Generation via Reinforcement Learning
by: Rupp, Florian, et al.
Published: (2024)
by: Rupp, Florian, et al.
Published: (2024)
Answer First, Reason Later: Aligning Search Relevance via Mode-Balanced Reinforcement Learning
by: Zhang, Shijie, et al.
Published: (2026)
by: Zhang, Shijie, et al.
Published: (2026)
Guiding Diffusion Models with Reinforcement Learning for Stable Molecule Generation
by: Zhou, Zhijian, et al.
Published: (2025)
by: Zhou, Zhijian, et al.
Published: (2025)
Guiding Generative Models to Uncover Diverse and Novel Crystals via Reinforcement Learning
by: Park, Hyunsoo, et al.
Published: (2025)
by: Park, Hyunsoo, et al.
Published: (2025)
Working Backwards: Learning to Place by Picking
by: Limoyo, Oliver, et al.
Published: (2023)
by: Limoyo, Oliver, et al.
Published: (2023)
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning
by: Suttle, Wesley A., et al.
Published: (2025)
by: Suttle, Wesley A., et al.
Published: (2025)
Robust Learning of Diffusion Models with Extremely Noisy Conditions
by: Chen, Xin, et al.
Published: (2025)
by: Chen, Xin, et al.
Published: (2025)
Classifier Clustering and Feature Alignment for Federated Learning under Distributed Concept Drift
by: Chen, Junbao, et al.
Published: (2024)
by: Chen, Junbao, et al.
Published: (2024)
Hierarchical Reinforcement Learning with Uncertainty-Guided Diffusional Subgoals
by: Wang, Vivienne Huiling, et al.
Published: (2025)
by: Wang, Vivienne Huiling, et al.
Published: (2025)
Tackling Data Heterogeneity in Federated Learning via Loss Decomposition
by: Zeng, Shuang, et al.
Published: (2024)
by: Zeng, Shuang, et al.
Published: (2024)
Towards Generalizable Reinforcement Learning via Causality-Guided Self-Adaptive Representations
by: Yang, Yupei, et al.
Published: (2024)
by: Yang, Yupei, et al.
Published: (2024)
Recursive Backwards Q-Learning in Deterministic Environments
by: Diekhoff, Jan, et al.
Published: (2024)
by: Diekhoff, Jan, et al.
Published: (2024)
Guided Cooperation in Hierarchical Reinforcement Learning via Model-based Rollout
by: Wang, Haoran, et al.
Published: (2023)
by: Wang, Haoran, et al.
Published: (2023)
Reinforcement Learning on Pre-Training Data
by: Li, Siheng, et al.
Published: (2025)
by: Li, Siheng, et al.
Published: (2025)
Learning Along the Arrow of Time: Hyperbolic Geometry for Backward-Compatible Representation Learning
by: Bui, Ngoc, et al.
Published: (2025)
by: Bui, Ngoc, et al.
Published: (2025)
Neuro-symbolic Action Masking for Deep Reinforcement Learning
by: Han, Shuai, et al.
Published: (2026)
by: Han, Shuai, et al.
Published: (2026)
LLaPipe: LLM-Guided Reinforcement Learning for Automated Data Preparation Pipeline Construction
by: Chang, Jing, et al.
Published: (2025)
by: Chang, Jing, et al.
Published: (2025)
First-order Sobolev Reinforcement Learning
by: Schramm, Fabian, et al.
Published: (2025)
by: Schramm, Fabian, et al.
Published: (2025)
Similar Items
-
QTMRL: An Agent for Quantitative Trading Decision-Making Based on Multi-Indicator Guided Reinforcement Learning
by: Pan, Jingfeng, et al.
Published: (2025) -
FedKL: Tackling Data Heterogeneity in Federated Reinforcement Learning by Penalizing KL Divergence
by: Xie, Zhijie, et al.
Published: (2022) -
Soft Forward-Backward Representations for Zero-shot Reinforcement Learning with General Utilities
by: Bagatella, Marco, et al.
Published: (2026) -
ROAD: Adaptive Data Mixing for Offline-to-Online Reinforcement Learning via Bi-Level Optimization
by: Yang, Letian, et al.
Published: (2026) -
The Actor-Critic Update Order Matters for PPO in Federated Reinforcement Learning
by: Xie, Zhijie, et al.
Published: (2025)