Saved in:
| Main Authors: | Ma, Guoqing, Zhang, Yuhan, Dai, Yuming, Hao, Guangfu, Chen, Yang, Yu, Shan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.11607 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Orthogonal Weight Modification Enhances Learning Scalability and Convergence Efficiency without Gradient Backpropagation
by: Ma, Guoqing, et al.
Published: (2026)
by: Ma, Guoqing, et al.
Published: (2026)
Brain-Inspired Graph Multi-Agent Systems for LLM Reasoning
by: Hao, Guangfu, et al.
Published: (2026)
by: Hao, Guangfu, et al.
Published: (2026)
Flow-based Policy With Distributional Reinforcement Learning in Trajectory Optimization
by: Hao, Ruijie, et al.
Published: (2026)
by: Hao, Ruijie, et al.
Published: (2026)
Policy-Based Trajectory Clustering in Offline Reinforcement Learning
by: Hu, Hao, et al.
Published: (2025)
by: Hu, Hao, et al.
Published: (2025)
Weight Clipping for Deep Continual and Reinforcement Learning
by: Elsayed, Mohamed, et al.
Published: (2024)
by: Elsayed, Mohamed, et al.
Published: (2024)
Deep Clustering of Tabular Data by Weighted Gaussian Distribution Learning
by: Rabbani, Shourav B., et al.
Published: (2023)
by: Rabbani, Shourav B., et al.
Published: (2023)
DeepStock: Reinforcement Learning with Policy Regularizations for Inventory Management
by: Xie, Yaqi, et al.
Published: (2026)
by: Xie, Yaqi, et al.
Published: (2026)
FedDRL: A Trustworthy Federated Learning Model Fusion Method Based on Staged Reinforcement Learning
by: Chen, Leiming, et al.
Published: (2023)
by: Chen, Leiming, et al.
Published: (2023)
Neuron-level Balance between Stability and Plasticity in Deep Reinforcement Learning
by: Lan, Jiahua, et al.
Published: (2025)
by: Lan, Jiahua, et al.
Published: (2025)
Research on Optimization of Natural Language Processing Model Based on Multimodal Deep Learning
by: Sun, Dan, et al.
Published: (2024)
by: Sun, Dan, et al.
Published: (2024)
Multiplicative Orthogonal Sequential Editing for Language Models
by: Xu, Hao-Xiang, et al.
Published: (2026)
by: Xu, Hao-Xiang, et al.
Published: (2026)
Angel or Demon: Investigating the Plasticity Interventions' Impact on Backdoor Threats in Deep Reinforcement Learning
by: Ma, Oubo, et al.
Published: (2026)
by: Ma, Oubo, et al.
Published: (2026)
Representation Learning Enhanced Deep Reinforcement Learning for Optimal Operation of Hydrogen-based Multi-Energy Systems
by: Pu, Zhenyu, et al.
Published: (2026)
by: Pu, Zhenyu, et al.
Published: (2026)
Deep Matrix Factorization with Adaptive Weights for Multi-View Clustering
by: Khalafaoui, Yasser, et al.
Published: (2024)
by: Khalafaoui, Yasser, et al.
Published: (2024)
Multiobjective Hydropower Reservoir Operation Optimization with Transformer-Based Deep Reinforcement Learning
by: Wu, Rixin, et al.
Published: (2023)
by: Wu, Rixin, et al.
Published: (2023)
Deep Orthogonal Hypersphere Compression for Anomaly Detection
by: Zhang, Yunhe, et al.
Published: (2023)
by: Zhang, Yunhe, et al.
Published: (2023)
Reward Models in Deep Reinforcement Learning: A Survey
by: Yu, Rui, et al.
Published: (2025)
by: Yu, Rui, et al.
Published: (2025)
Enhancing Reinforcement Learning Fine-Tuning with an Online Refiner
by: Ma, Hao, et al.
Published: (2026)
by: Ma, Hao, et al.
Published: (2026)
A Practical Introduction to Deep Reinforcement Learning
by: Sun, Yinghan, et al.
Published: (2025)
by: Sun, Yinghan, et al.
Published: (2025)
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
by: Zheng, Chujie, et al.
Published: (2025)
by: Zheng, Chujie, et al.
Published: (2025)
Multi-order Graph Clustering with Adaptive Node-level Weight Learning
by: Liu, Ye, et al.
Published: (2024)
by: Liu, Ye, et al.
Published: (2024)
UNIDOOR: A Universal Framework for Action-Level Backdoor Attacks in Deep Reinforcement Learning
by: Ma, Oubo, et al.
Published: (2025)
by: Ma, Oubo, et al.
Published: (2025)
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
by: Xu, Haoran, et al.
Published: (2025)
by: Xu, Haoran, et al.
Published: (2025)
QSIM: Mitigating Overestimation in Multi-Agent Reinforcement Learning via Action Similarity Weighted Q-Learning
by: Li, Yuanjun, et al.
Published: (2026)
by: Li, Yuanjun, et al.
Published: (2026)
Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning
by: Yuan, Mingqi, et al.
Published: (2025)
by: Yuan, Mingqi, et al.
Published: (2025)
Federated Incomplete Multi-view Clustering with Globally Fused Graph Guidance
by: Chao, Guoqing, et al.
Published: (2025)
by: Chao, Guoqing, et al.
Published: (2025)
Preconditioning Benefits of Spectral Orthogonalization in Muon
by: Ma, Jianhao, et al.
Published: (2026)
by: Ma, Jianhao, et al.
Published: (2026)
Mastering the Game of Guandan with Deep Reinforcement Learning and Behavior Regulating
by: Yanggong, Yifan, et al.
Published: (2024)
by: Yanggong, Yifan, et al.
Published: (2024)
Bit-Identical Medical Deep Learning via Structured Orthogonal Initialization
by: Shkolnikov, Yakov Pyotr
Published: (2026)
by: Shkolnikov, Yakov Pyotr
Published: (2026)
Stabilizing Reinforcement Learning for Diffusion Language Models
by: Zhong, Jianyuan, et al.
Published: (2026)
by: Zhong, Jianyuan, et al.
Published: (2026)
A Survey on Explainable Deep Reinforcement Learning
by: Cheng, Zelei, et al.
Published: (2025)
by: Cheng, Zelei, et al.
Published: (2025)
Flow-Based Policy for Online Reinforcement Learning
by: Lv, Lei, et al.
Published: (2025)
by: Lv, Lei, et al.
Published: (2025)
TW-CRL: Time-Weighted Contrastive Reward Learning for Efficient Inverse Reinforcement Learning
by: Li, Yuxuan, et al.
Published: (2025)
by: Li, Yuxuan, et al.
Published: (2025)
Approximated Orthogonal Projection Unit: Stabilizing Regression Network Training Using Natural Gradient
by: Wang, Shaoqi, et al.
Published: (2024)
by: Wang, Shaoqi, et al.
Published: (2024)
Deep Contrastive Graph Learning with Clustering-Oriented Guidance
by: Chen, Mulin, et al.
Published: (2024)
by: Chen, Mulin, et al.
Published: (2024)
Discovering Behavioral Modes in Deep Reinforcement Learning Policies Using Trajectory Clustering in Latent Space
by: Remman, Sindre Benjamin, et al.
Published: (2024)
by: Remman, Sindre Benjamin, et al.
Published: (2024)
Reinfier and Reintrainer: Verification and Interpretation-Driven Safe Deep Reinforcement Learning Frameworks
by: Yang, Zixuan, et al.
Published: (2024)
by: Yang, Zixuan, et al.
Published: (2024)
StaRPO: Stability-Augmented Reinforcement Policy Optimization
by: Zhang, Jinghan, et al.
Published: (2026)
by: Zhang, Jinghan, et al.
Published: (2026)
Enhancing Deep Learning with Optimized Gradient Descent: Bridging Numerical Methods and Neural Network Training
by: Ma, Yuhan, et al.
Published: (2024)
by: Ma, Yuhan, et al.
Published: (2024)
Vision-Based Generic Potential Function for Policy Alignment in Multi-Agent Reinforcement Learning
by: Ma, Hao, et al.
Published: (2025)
by: Ma, Hao, et al.
Published: (2025)
Similar Items
-
Orthogonal Weight Modification Enhances Learning Scalability and Convergence Efficiency without Gradient Backpropagation
by: Ma, Guoqing, et al.
Published: (2026) -
Brain-Inspired Graph Multi-Agent Systems for LLM Reasoning
by: Hao, Guangfu, et al.
Published: (2026) -
Flow-based Policy With Distributional Reinforcement Learning in Trajectory Optimization
by: Hao, Ruijie, et al.
Published: (2026) -
Policy-Based Trajectory Clustering in Offline Reinforcement Learning
by: Hu, Hao, et al.
Published: (2025) -
Weight Clipping for Deep Continual and Reinforcement Learning
by: Elsayed, Mohamed, et al.
Published: (2024)