:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhu, Siyuan, Xu, Chengdong, Ke, Kaiqiang, Yu, Chao
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2512.14465
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

H$^2$R: Hierarchical Hindsight Reflection for Multi-Task LLM Agents
by: Ye, Shicheng, et al.
Published: (2025)

EvoMAS: Learning Execution-Time Workflows for Multi-Agent Systems
by: Xu, Chengdong, et al.
Published: (2026)

Federated reinforcement learning for robot motion planning with zero-shot generalization
by: Yuan, Zhenyuan, et al.
Published: (2024)

Dynamic feature selection in medical predictive monitoring by reinforcement learning
by: Chen, Yutong, et al.
Published: (2024)

Adaptive parameter sharing for multi-agent reinforcement learning
by: Li, Dapeng, et al.
Published: (2023)

Curriculum reinforcement learning with measurable task representation learning
by: Wen, Yongyan, et al.
Published: (2026)

Economic span selection of bridge based on deep reinforcement learning
by: Zhang, Leye, et al.
Published: (2024)

On the consistency of hyper-parameter selection in value-based deep reinforcement learning
by: Obando-Ceron, Johan, et al.
Published: (2024)

SpeContext: Enabling Efficient Long-context Reasoning with Speculative Context Sparsity in LLMs
by: Xu, Jiaming, et al.
Published: (2025)

Counterfactual experience augmented off-policy reinforcement learning
by: Lee, Sunbowen, et al.
Published: (2025)

The challenge of hidden gifts in multi-agent reinforcement learning
by: Malenfant, Dane, et al.
Published: (2025)

The impact of behavioral diversity in multi-agent reinforcement learning
by: Bettini, Matteo, et al.
Published: (2024)

An exploration for higher efficiency in multi objective optimisation with reinforcement learning
by: Aydin, Mehmet Emin
Published: (2025)

Improving monotonic optimization in heterogeneous multi-agent reinforcement learning with optimal marginal deterministic policy gradient
by: Yu, Xiaoyang, et al.
Published: (2025)

Designing a double deep reinforcement learning selection tool for resilient demand prediction
by: Benziane, Bilel Abderrahmane, et al.
Published: (2026)

Contrastive learning-based agent modeling for deep reinforcement learning
by: Ma, Wenhao, et al.
Published: (2023)

ASLSL: Adaptive shared latent structure learning with incomplete multi-modal physiological data for multi-dimensional emotional feature selection
by: Xu, Xueyuan, et al.
Published: (2025)

Data-driven simulator of multi-animal behavior with unknown dynamics via offline and online reinforcement learning
by: Fujii, Keisuke, et al.
Published: (2025)

Context selectivity with dynamic availability enables lifelong continual learning
by: Barry, Martin, et al.
Published: (2023)

Modelling crypto markets by multi-agent reinforcement learning
by: Lussange, Johann, et al.
Published: (2024)

Bridging the phenotype-target gap for molecular generation via multi-objective reinforcement learning
by: Guo, Haotian, et al.
Published: (2025)

ADSEL: Adaptive dual self-expression learning for EEG feature selection via incomplete multi-dimensional emotional tagging
by: Yu, Tianze, et al.
Published: (2025)

Retrieval-augmented in-context learning for multimodal large language models in disease classification
by: Zhan, Zaifu, et al.
Published: (2025)

Soft $Q(λ)$: A multi-step off-policy method for entropy regularised reinforcement learning using eligibility traces
by: Mahajan, Pranav, et al.
Published: (2026)

Self-training superconducting neuromorphic circuits using reinforcement learning rules
by: Schneider, M. L., et al.
Published: (2024)

Adaptive traffic signal safety and efficiency improvement by multi objective deep reinforcement learning approach
by: Mirbakhsh, Shahin, et al.
Published: (2024)

Normalization and effective learning rates in reinforcement learning
by: Lyle, Clare, et al.
Published: (2024)

Advancing network resilience theories with symbolized reinforcement learning
by: Zheng, Yu, et al.
Published: (2025)

Complementing reinforcement learning with SFT through logit averaging in the post training of LLMs
by: Gan, Xingwei, et al.
Published: (2026)

Knowledge acquisition for dialogue agents using reinforcement learning on graph representations
by: Santamaria, Selene Baez, et al.
Published: (2024)

Causal prompting model-based offline reinforcement learning
by: Yu, Xuehui, et al.
Published: (2024)

Deep Reinforcement Learning for Picker Routing Problem in Warehousing
by: Dunn, George, et al.
Published: (2024)

Analyzing limits for in-context learning
by: Naim, Omar, et al.
Published: (2025)

In-context learning and Occam's razor
by: Elmoznino, Eric, et al.
Published: (2024)

Not all tokens are needed(NAT): token efficient reinforcement learning
by: Sang, Hejian, et al.
Published: (2026)

Counterfactual reasoning: an analysis of in-context emergence
by: Miller, Moritz, et al.
Published: (2025)

Multi-Objective Constraint Inference using Inverse reinforcement learning
by: Shah, Syed Ihtesham Hussain, et al.
Published: (2026)

Analyzing sequential activity and travel decisions with interpretable deep inverse reinforcement learning
by: Liang, Yuebing, et al.
Published: (2025)

Deep reinforcement learning for irrigation scheduling using high-dimensional sensor feedback
by: Saikai, Yuji, et al.
Published: (2023)

ContextBudget: Budget-Aware Context Management for Long-Horizon Search Agents
by: Wu, Yong, et al.
Published: (2026)