:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ma, Guoqing, Zhang, Yuhan, Dai, Yuming, Hao, Guangfu, Chen, Yang, Yu, Shan
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2511.11607
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Orthogonal Weight Modification Enhances Learning Scalability and Convergence Efficiency without Gradient Backpropagation
by: Ma, Guoqing, et al.
Published: (2026)

Brain-Inspired Graph Multi-Agent Systems for LLM Reasoning
by: Hao, Guangfu, et al.
Published: (2026)

Flow-based Policy With Distributional Reinforcement Learning in Trajectory Optimization
by: Hao, Ruijie, et al.
Published: (2026)

Policy-Based Trajectory Clustering in Offline Reinforcement Learning
by: Hu, Hao, et al.
Published: (2025)

Weight Clipping for Deep Continual and Reinforcement Learning
by: Elsayed, Mohamed, et al.
Published: (2024)

Deep Clustering of Tabular Data by Weighted Gaussian Distribution Learning
by: Rabbani, Shourav B., et al.
Published: (2023)

DeepStock: Reinforcement Learning with Policy Regularizations for Inventory Management
by: Xie, Yaqi, et al.
Published: (2026)

FedDRL: A Trustworthy Federated Learning Model Fusion Method Based on Staged Reinforcement Learning
by: Chen, Leiming, et al.
Published: (2023)

Neuron-level Balance between Stability and Plasticity in Deep Reinforcement Learning
by: Lan, Jiahua, et al.
Published: (2025)

Research on Optimization of Natural Language Processing Model Based on Multimodal Deep Learning
by: Sun, Dan, et al.
Published: (2024)

Multiplicative Orthogonal Sequential Editing for Language Models
by: Xu, Hao-Xiang, et al.
Published: (2026)

Angel or Demon: Investigating the Plasticity Interventions' Impact on Backdoor Threats in Deep Reinforcement Learning
by: Ma, Oubo, et al.
Published: (2026)

Representation Learning Enhanced Deep Reinforcement Learning for Optimal Operation of Hydrogen-based Multi-Energy Systems
by: Pu, Zhenyu, et al.
Published: (2026)

Deep Matrix Factorization with Adaptive Weights for Multi-View Clustering
by: Khalafaoui, Yasser, et al.
Published: (2024)

Multiobjective Hydropower Reservoir Operation Optimization with Transformer-Based Deep Reinforcement Learning
by: Wu, Rixin, et al.
Published: (2023)

Deep Orthogonal Hypersphere Compression for Anomaly Detection
by: Zhang, Yunhe, et al.
Published: (2023)

Reward Models in Deep Reinforcement Learning: A Survey
by: Yu, Rui, et al.
Published: (2025)

Enhancing Reinforcement Learning Fine-Tuning with an Online Refiner
by: Ma, Hao, et al.
Published: (2026)

A Practical Introduction to Deep Reinforcement Learning
by: Sun, Yinghan, et al.
Published: (2025)

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
by: Zheng, Chujie, et al.
Published: (2025)

Multi-order Graph Clustering with Adaptive Node-level Weight Learning
by: Liu, Ye, et al.
Published: (2024)

UNIDOOR: A Universal Framework for Action-Level Backdoor Attacks in Deep Reinforcement Learning
by: Ma, Oubo, et al.
Published: (2025)

An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
by: Xu, Haoran, et al.
Published: (2025)

QSIM: Mitigating Overestimation in Multi-Agent Reinforcement Learning via Action Similarity Weighted Q-Learning
by: Li, Yuanjun, et al.
Published: (2026)

Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning
by: Yuan, Mingqi, et al.
Published: (2025)

Federated Incomplete Multi-view Clustering with Globally Fused Graph Guidance
by: Chao, Guoqing, et al.
Published: (2025)

Preconditioning Benefits of Spectral Orthogonalization in Muon
by: Ma, Jianhao, et al.
Published: (2026)

Mastering the Game of Guandan with Deep Reinforcement Learning and Behavior Regulating
by: Yanggong, Yifan, et al.
Published: (2024)

Bit-Identical Medical Deep Learning via Structured Orthogonal Initialization
by: Shkolnikov, Yakov Pyotr
Published: (2026)

Stabilizing Reinforcement Learning for Diffusion Language Models
by: Zhong, Jianyuan, et al.
Published: (2026)

A Survey on Explainable Deep Reinforcement Learning
by: Cheng, Zelei, et al.
Published: (2025)

Flow-Based Policy for Online Reinforcement Learning
by: Lv, Lei, et al.
Published: (2025)

TW-CRL: Time-Weighted Contrastive Reward Learning for Efficient Inverse Reinforcement Learning
by: Li, Yuxuan, et al.
Published: (2025)

Approximated Orthogonal Projection Unit: Stabilizing Regression Network Training Using Natural Gradient
by: Wang, Shaoqi, et al.
Published: (2024)

Deep Contrastive Graph Learning with Clustering-Oriented Guidance
by: Chen, Mulin, et al.
Published: (2024)

Discovering Behavioral Modes in Deep Reinforcement Learning Policies Using Trajectory Clustering in Latent Space
by: Remman, Sindre Benjamin, et al.
Published: (2024)

Reinfier and Reintrainer: Verification and Interpretation-Driven Safe Deep Reinforcement Learning Frameworks
by: Yang, Zixuan, et al.
Published: (2024)

StaRPO: Stability-Augmented Reinforcement Policy Optimization
by: Zhang, Jinghan, et al.
Published: (2026)

Enhancing Deep Learning with Optimized Gradient Descent: Bridging Numerical Methods and Neural Network Training
by: Ma, Yuhan, et al.
Published: (2024)

Vision-Based Generic Potential Function for Policy Alignment in Multi-Agent Reinforcement Learning
by: Ma, Hao, et al.
Published: (2025)