Saved in:
| Main Authors: | He, Zhouyu, Qiao, Peng, Li, Rongchun, Dou, Yong, Tan, Yusong |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.20190 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Generative Pre-Trained Transformer for Symbolic Regression Base In-Context Reinforcement Learning
by: Li, Yanjie, et al.
Published: (2024)
by: Li, Yanjie, et al.
Published: (2024)
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
by: Liu, Zhihong, et al.
Published: (2024)
by: Liu, Zhihong, et al.
Published: (2024)
DAWM: Diffusion Action World Models for Offline Reinforcement Learning via Action-Inferred Transitions
by: Li, Zongyue, et al.
Published: (2025)
by: Li, Zongyue, et al.
Published: (2025)
Transolver is a Linear Transformer: Revisiting Physics-Attention through the Lens of Linear Attention
by: Hu, Wenjie, et al.
Published: (2025)
by: Hu, Wenjie, et al.
Published: (2025)
Search-Based Credit Assignment for Offline Preference-Based Reinforcement Learning
by: Gao, Xiancheng, et al.
Published: (2025)
by: Gao, Xiancheng, et al.
Published: (2025)
Pinpointing crucial steps: Attribution-based Credit Assignment for Verifiable Reinforcement Learning
by: Yin, Junxi, et al.
Published: (2025)
by: Yin, Junxi, et al.
Published: (2025)
HiPER: Hierarchical Reinforcement Learning with Explicit Credit Assignment for Large Language Model Agents
by: Peng, Jiangweizhi, et al.
Published: (2026)
by: Peng, Jiangweizhi, et al.
Published: (2026)
Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning
by: Ramesh, Aditya A., et al.
Published: (2024)
by: Ramesh, Aditya A., et al.
Published: (2024)
A Survey of Temporal Credit Assignment in Deep Reinforcement Learning
by: Pignatelli, Eduardo, et al.
Published: (2023)
by: Pignatelli, Eduardo, et al.
Published: (2023)
Learning Solution-Aware Transformers for Efficiently Solving Quadratic Assignment Problem
by: Tan, Zhentao, et al.
Published: (2024)
by: Tan, Zhentao, et al.
Published: (2024)
SOLAR-RL: Semi-Online Long-horizon Assignment Reinforcement Learning
by: Wang, Jichao, et al.
Published: (2026)
by: Wang, Jichao, et al.
Published: (2026)
Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning
by: Qu, Yun, et al.
Published: (2024)
by: Qu, Yun, et al.
Published: (2024)
Double Successive Over-Relaxation Q-Learning with an Extension to Deep Reinforcement Learning
by: R, Shreyas S
Published: (2024)
by: R, Shreyas S
Published: (2024)
NurseSchedRL: Attention-Guided Reinforcement Learning for Nurse-Patient Assignment
by: Koduri, Harsha
Published: (2025)
by: Koduri, Harsha
Published: (2025)
LLM-Guided Reinforcement Learning: Addressing Training Bottlenecks through Policy Modulation
by: Tan, Heng, et al.
Published: (2025)
by: Tan, Heng, et al.
Published: (2025)
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
by: Luo, Xufang, et al.
Published: (2025)
by: Luo, Xufang, et al.
Published: (2025)
Reinforcement Learning for Scalable Train Timetable Rescheduling with Graph Representation
by: Yue, Peng, et al.
Published: (2024)
by: Yue, Peng, et al.
Published: (2024)
Evaluating Feature Dependent Noise in Preference-based Reinforcement Learning
by: Li, Yuxuan, et al.
Published: (2026)
by: Li, Yuxuan, et al.
Published: (2026)
Unsupervised Learning for Quadratic Assignment
by: Min, Yimeng, et al.
Published: (2025)
by: Min, Yimeng, et al.
Published: (2025)
Rumor Detection on Social Media with Reinforcement Learning-based Key Propagation Graph Generator
by: Zhang, Yusong, et al.
Published: (2024)
by: Zhang, Yusong, et al.
Published: (2024)
Pgx: Hardware-Accelerated Parallel Game Simulators for Reinforcement Learning
by: Koyamada, Sotetsu, et al.
Published: (2023)
by: Koyamada, Sotetsu, et al.
Published: (2023)
Scaling Behaviors of LLM Reinforcement Learning Post-Training: An Empirical Study in Mathematical Reasoning
by: Tan, Zelin, et al.
Published: (2025)
by: Tan, Zelin, et al.
Published: (2025)
Controllable Flow Matching for Online Reinforcement Learning
by: Wang, Bin, et al.
Published: (2025)
by: Wang, Bin, et al.
Published: (2025)
ParaDySe: A Parallel-Strategy Switching Framework for Dynamic Sequence Lengths in Transformer
by: Ou, Zhixin, et al.
Published: (2025)
by: Ou, Zhixin, et al.
Published: (2025)
Energy Consumption in Parallel Neural Network Training
by: Huber, Philipp, et al.
Published: (2025)
by: Huber, Philipp, et al.
Published: (2025)
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
by: Xi, Zhiheng, et al.
Published: (2024)
by: Xi, Zhiheng, et al.
Published: (2024)
RLCAD: Reinforcement Learning Training Gym for Revolution Involved CAD Command Sequence Generation
by: Yin, Xiaolong, et al.
Published: (2025)
by: Yin, Xiaolong, et al.
Published: (2025)
SIRI: Self-Internalizing Reinforcement Learning with Intrinsic Skills for LLM Agent Training
by: He, Zhongyu, et al.
Published: (2026)
by: He, Zhongyu, et al.
Published: (2026)
The Impact of On-Policy Parallelized Data Collection on Deep Reinforcement Learning Networks
by: Mayor, Walter, et al.
Published: (2025)
by: Mayor, Walter, et al.
Published: (2025)
AdaGamma: State-Dependent Discounting for Temporal Adaptation in Reinforcement Learning
by: Wang, Yaomin, et al.
Published: (2026)
by: Wang, Yaomin, et al.
Published: (2026)
FORLER: Federated Offline Reinforcement Learning with Q-Ensemble and Actor Rectification
by: Qiao, Nan, et al.
Published: (2026)
by: Qiao, Nan, et al.
Published: (2026)
Dependable Distributed Training of Compressed Machine Learning Models
by: Malandrino, Francesco, et al.
Published: (2024)
by: Malandrino, Francesco, et al.
Published: (2024)
HyperEyes: Dual-Grained Efficiency-Aware Reinforcement Learning for Parallel Multimodal Search Agents
by: Li, Guankai, et al.
Published: (2026)
by: Li, Guankai, et al.
Published: (2026)
Challenges in Credit Assignment for Multi-Agent Reinforcement Learning in Open Agent Systems
by: Abadi, Alireza Saleh, et al.
Published: (2025)
by: Abadi, Alireza Saleh, et al.
Published: (2025)
From Observations to Events: Event-Aware World Model for Reinforcement Learning
by: Peng, Zhao-Han, et al.
Published: (2026)
by: Peng, Zhao-Han, et al.
Published: (2026)
HLS-Seek: QoR-Aware Code Generation for High-Level Synthesis via Proxy Comparative Reward Reinforcement Learning
by: Zou, Qingyun, et al.
Published: (2026)
by: Zou, Qingyun, et al.
Published: (2026)
Angel or Devil: Discriminating Hard Samples and Anomaly Contaminations for Unsupervised Time Series Anomaly Detection
by: Zhang, Ruyi, et al.
Published: (2024)
by: Zhang, Ruyi, et al.
Published: (2024)
Scheduling Drone and Mobile Charger via Hybrid-Action Deep Reinforcement Learning
by: Dou, Jizhe, et al.
Published: (2024)
by: Dou, Jizhe, et al.
Published: (2024)
FedDRL: A Trustworthy Federated Learning Model Fusion Method Based on Staged Reinforcement Learning
by: Chen, Leiming, et al.
Published: (2023)
by: Chen, Leiming, et al.
Published: (2023)
Hindsight Credit Assignment for Long-Horizon LLM Agents
by: Tan, Hui-Ze, et al.
Published: (2026)
by: Tan, Hui-Ze, et al.
Published: (2026)
Similar Items
-
Generative Pre-Trained Transformer for Symbolic Regression Base In-Context Reinforcement Learning
by: Li, Yanjie, et al.
Published: (2024) -
Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
by: Liu, Zhihong, et al.
Published: (2024) -
DAWM: Diffusion Action World Models for Offline Reinforcement Learning via Action-Inferred Transitions
by: Li, Zongyue, et al.
Published: (2025) -
Transolver is a Linear Transformer: Revisiting Physics-Attention through the Lens of Linear Attention
by: Hu, Wenjie, et al.
Published: (2025) -
Search-Based Credit Assignment for Offline Preference-Based Reinforcement Learning
by: Gao, Xiancheng, et al.
Published: (2025)