:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	He, Zhouyu, Qiao, Peng, Li, Rongchun, Dou, Yong, Tan, Yusong
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2502.20190
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Generative Pre-Trained Transformer for Symbolic Regression Base In-Context Reinforcement Learning
by: Li, Yanjie, et al.
Published: (2024)

Acceleration for Deep Reinforcement Learning using Parallel and Distributed Computing: A Survey
by: Liu, Zhihong, et al.
Published: (2024)

DAWM: Diffusion Action World Models for Offline Reinforcement Learning via Action-Inferred Transitions
by: Li, Zongyue, et al.
Published: (2025)

Transolver is a Linear Transformer: Revisiting Physics-Attention through the Lens of Linear Attention
by: Hu, Wenjie, et al.
Published: (2025)

Search-Based Credit Assignment for Offline Preference-Based Reinforcement Learning
by: Gao, Xiancheng, et al.
Published: (2025)

Pinpointing crucial steps: Attribution-based Credit Assignment for Verifiable Reinforcement Learning
by: Yin, Junxi, et al.
Published: (2025)

HiPER: Hierarchical Reinforcement Learning with Explicit Credit Assignment for Large Language Model Agents
by: Peng, Jiangweizhi, et al.
Published: (2026)

Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning
by: Ramesh, Aditya A., et al.
Published: (2024)

A Survey of Temporal Credit Assignment in Deep Reinforcement Learning
by: Pignatelli, Eduardo, et al.
Published: (2023)

Learning Solution-Aware Transformers for Efficiently Solving Quadratic Assignment Problem
by: Tan, Zhentao, et al.
Published: (2024)

SOLAR-RL: Semi-Online Long-horizon Assignment Reinforcement Learning
by: Wang, Jichao, et al.
Published: (2026)

Latent Reward: LLM-Empowered Credit Assignment in Episodic Reinforcement Learning
by: Qu, Yun, et al.
Published: (2024)

Double Successive Over-Relaxation Q-Learning with an Extension to Deep Reinforcement Learning
by: R, Shreyas S
Published: (2024)

NurseSchedRL: Attention-Guided Reinforcement Learning for Nurse-Patient Assignment
by: Koduri, Harsha
Published: (2025)

LLM-Guided Reinforcement Learning: Addressing Training Bottlenecks through Policy Modulation
by: Tan, Heng, et al.
Published: (2025)

Agent Lightning: Train ANY AI Agents with Reinforcement Learning
by: Luo, Xufang, et al.
Published: (2025)

Reinforcement Learning for Scalable Train Timetable Rescheduling with Graph Representation
by: Yue, Peng, et al.
Published: (2024)

Evaluating Feature Dependent Noise in Preference-based Reinforcement Learning
by: Li, Yuxuan, et al.
Published: (2026)

Unsupervised Learning for Quadratic Assignment
by: Min, Yimeng, et al.
Published: (2025)

Rumor Detection on Social Media with Reinforcement Learning-based Key Propagation Graph Generator
by: Zhang, Yusong, et al.
Published: (2024)

Pgx: Hardware-Accelerated Parallel Game Simulators for Reinforcement Learning
by: Koyamada, Sotetsu, et al.
Published: (2023)

Scaling Behaviors of LLM Reinforcement Learning Post-Training: An Empirical Study in Mathematical Reasoning
by: Tan, Zelin, et al.
Published: (2025)

Controllable Flow Matching for Online Reinforcement Learning
by: Wang, Bin, et al.
Published: (2025)

ParaDySe: A Parallel-Strategy Switching Framework for Dynamic Sequence Lengths in Transformer
by: Ou, Zhixin, et al.
Published: (2025)

Energy Consumption in Parallel Neural Network Training
by: Huber, Philipp, et al.
Published: (2025)

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
by: Xi, Zhiheng, et al.
Published: (2024)

RLCAD: Reinforcement Learning Training Gym for Revolution Involved CAD Command Sequence Generation
by: Yin, Xiaolong, et al.
Published: (2025)

SIRI: Self-Internalizing Reinforcement Learning with Intrinsic Skills for LLM Agent Training
by: He, Zhongyu, et al.
Published: (2026)

The Impact of On-Policy Parallelized Data Collection on Deep Reinforcement Learning Networks
by: Mayor, Walter, et al.
Published: (2025)

AdaGamma: State-Dependent Discounting for Temporal Adaptation in Reinforcement Learning
by: Wang, Yaomin, et al.
Published: (2026)

FORLER: Federated Offline Reinforcement Learning with Q-Ensemble and Actor Rectification
by: Qiao, Nan, et al.
Published: (2026)

Dependable Distributed Training of Compressed Machine Learning Models
by: Malandrino, Francesco, et al.
Published: (2024)

HyperEyes: Dual-Grained Efficiency-Aware Reinforcement Learning for Parallel Multimodal Search Agents
by: Li, Guankai, et al.
Published: (2026)

Challenges in Credit Assignment for Multi-Agent Reinforcement Learning in Open Agent Systems
by: Abadi, Alireza Saleh, et al.
Published: (2025)

From Observations to Events: Event-Aware World Model for Reinforcement Learning
by: Peng, Zhao-Han, et al.
Published: (2026)

HLS-Seek: QoR-Aware Code Generation for High-Level Synthesis via Proxy Comparative Reward Reinforcement Learning
by: Zou, Qingyun, et al.
Published: (2026)

Angel or Devil: Discriminating Hard Samples and Anomaly Contaminations for Unsupervised Time Series Anomaly Detection
by: Zhang, Ruyi, et al.
Published: (2024)

Scheduling Drone and Mobile Charger via Hybrid-Action Deep Reinforcement Learning
by: Dou, Jizhe, et al.
Published: (2024)

FedDRL: A Trustworthy Federated Learning Model Fusion Method Based on Staged Reinforcement Learning
by: Chen, Leiming, et al.
Published: (2023)

Hindsight Credit Assignment for Long-Horizon LLM Agents
by: Tan, Hui-Ze, et al.
Published: (2026)