Saved in:
| Main Authors: | Zhu, Siyuan, Xu, Chengdong, Ke, Kaiqiang, Yu, Chao |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.14465 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
H$^2$R: Hierarchical Hindsight Reflection for Multi-Task LLM Agents
by: Ye, Shicheng, et al.
Published: (2025)
by: Ye, Shicheng, et al.
Published: (2025)
EvoMAS: Learning Execution-Time Workflows for Multi-Agent Systems
by: Xu, Chengdong, et al.
Published: (2026)
by: Xu, Chengdong, et al.
Published: (2026)
Federated reinforcement learning for robot motion planning with zero-shot generalization
by: Yuan, Zhenyuan, et al.
Published: (2024)
by: Yuan, Zhenyuan, et al.
Published: (2024)
Dynamic feature selection in medical predictive monitoring by reinforcement learning
by: Chen, Yutong, et al.
Published: (2024)
by: Chen, Yutong, et al.
Published: (2024)
Adaptive parameter sharing for multi-agent reinforcement learning
by: Li, Dapeng, et al.
Published: (2023)
by: Li, Dapeng, et al.
Published: (2023)
Curriculum reinforcement learning with measurable task representation learning
by: Wen, Yongyan, et al.
Published: (2026)
by: Wen, Yongyan, et al.
Published: (2026)
Economic span selection of bridge based on deep reinforcement learning
by: Zhang, Leye, et al.
Published: (2024)
by: Zhang, Leye, et al.
Published: (2024)
On the consistency of hyper-parameter selection in value-based deep reinforcement learning
by: Obando-Ceron, Johan, et al.
Published: (2024)
by: Obando-Ceron, Johan, et al.
Published: (2024)
SpeContext: Enabling Efficient Long-context Reasoning with Speculative Context Sparsity in LLMs
by: Xu, Jiaming, et al.
Published: (2025)
by: Xu, Jiaming, et al.
Published: (2025)
Counterfactual experience augmented off-policy reinforcement learning
by: Lee, Sunbowen, et al.
Published: (2025)
by: Lee, Sunbowen, et al.
Published: (2025)
The challenge of hidden gifts in multi-agent reinforcement learning
by: Malenfant, Dane, et al.
Published: (2025)
by: Malenfant, Dane, et al.
Published: (2025)
The impact of behavioral diversity in multi-agent reinforcement learning
by: Bettini, Matteo, et al.
Published: (2024)
by: Bettini, Matteo, et al.
Published: (2024)
An exploration for higher efficiency in multi objective optimisation with reinforcement learning
by: Aydin, Mehmet Emin
Published: (2025)
by: Aydin, Mehmet Emin
Published: (2025)
Improving monotonic optimization in heterogeneous multi-agent reinforcement learning with optimal marginal deterministic policy gradient
by: Yu, Xiaoyang, et al.
Published: (2025)
by: Yu, Xiaoyang, et al.
Published: (2025)
Designing a double deep reinforcement learning selection tool for resilient demand prediction
by: Benziane, Bilel Abderrahmane, et al.
Published: (2026)
by: Benziane, Bilel Abderrahmane, et al.
Published: (2026)
Contrastive learning-based agent modeling for deep reinforcement learning
by: Ma, Wenhao, et al.
Published: (2023)
by: Ma, Wenhao, et al.
Published: (2023)
ASLSL: Adaptive shared latent structure learning with incomplete multi-modal physiological data for multi-dimensional emotional feature selection
by: Xu, Xueyuan, et al.
Published: (2025)
by: Xu, Xueyuan, et al.
Published: (2025)
Data-driven simulator of multi-animal behavior with unknown dynamics via offline and online reinforcement learning
by: Fujii, Keisuke, et al.
Published: (2025)
by: Fujii, Keisuke, et al.
Published: (2025)
Context selectivity with dynamic availability enables lifelong continual learning
by: Barry, Martin, et al.
Published: (2023)
by: Barry, Martin, et al.
Published: (2023)
Modelling crypto markets by multi-agent reinforcement learning
by: Lussange, Johann, et al.
Published: (2024)
by: Lussange, Johann, et al.
Published: (2024)
Bridging the phenotype-target gap for molecular generation via multi-objective reinforcement learning
by: Guo, Haotian, et al.
Published: (2025)
by: Guo, Haotian, et al.
Published: (2025)
ADSEL: Adaptive dual self-expression learning for EEG feature selection via incomplete multi-dimensional emotional tagging
by: Yu, Tianze, et al.
Published: (2025)
by: Yu, Tianze, et al.
Published: (2025)
Retrieval-augmented in-context learning for multimodal large language models in disease classification
by: Zhan, Zaifu, et al.
Published: (2025)
by: Zhan, Zaifu, et al.
Published: (2025)
Soft $Q(λ)$: A multi-step off-policy method for entropy regularised reinforcement learning using eligibility traces
by: Mahajan, Pranav, et al.
Published: (2026)
by: Mahajan, Pranav, et al.
Published: (2026)
Self-training superconducting neuromorphic circuits using reinforcement learning rules
by: Schneider, M. L., et al.
Published: (2024)
by: Schneider, M. L., et al.
Published: (2024)
Adaptive traffic signal safety and efficiency improvement by multi objective deep reinforcement learning approach
by: Mirbakhsh, Shahin, et al.
Published: (2024)
by: Mirbakhsh, Shahin, et al.
Published: (2024)
Normalization and effective learning rates in reinforcement learning
by: Lyle, Clare, et al.
Published: (2024)
by: Lyle, Clare, et al.
Published: (2024)
Advancing network resilience theories with symbolized reinforcement learning
by: Zheng, Yu, et al.
Published: (2025)
by: Zheng, Yu, et al.
Published: (2025)
Complementing reinforcement learning with SFT through logit averaging in the post training of LLMs
by: Gan, Xingwei, et al.
Published: (2026)
by: Gan, Xingwei, et al.
Published: (2026)
Knowledge acquisition for dialogue agents using reinforcement learning on graph representations
by: Santamaria, Selene Baez, et al.
Published: (2024)
by: Santamaria, Selene Baez, et al.
Published: (2024)
Causal prompting model-based offline reinforcement learning
by: Yu, Xuehui, et al.
Published: (2024)
by: Yu, Xuehui, et al.
Published: (2024)
Deep Reinforcement Learning for Picker Routing Problem in Warehousing
by: Dunn, George, et al.
Published: (2024)
by: Dunn, George, et al.
Published: (2024)
Analyzing limits for in-context learning
by: Naim, Omar, et al.
Published: (2025)
by: Naim, Omar, et al.
Published: (2025)
In-context learning and Occam's razor
by: Elmoznino, Eric, et al.
Published: (2024)
by: Elmoznino, Eric, et al.
Published: (2024)
Not all tokens are needed(NAT): token efficient reinforcement learning
by: Sang, Hejian, et al.
Published: (2026)
by: Sang, Hejian, et al.
Published: (2026)
Counterfactual reasoning: an analysis of in-context emergence
by: Miller, Moritz, et al.
Published: (2025)
by: Miller, Moritz, et al.
Published: (2025)
Multi-Objective Constraint Inference using Inverse reinforcement learning
by: Shah, Syed Ihtesham Hussain, et al.
Published: (2026)
by: Shah, Syed Ihtesham Hussain, et al.
Published: (2026)
Analyzing sequential activity and travel decisions with interpretable deep inverse reinforcement learning
by: Liang, Yuebing, et al.
Published: (2025)
by: Liang, Yuebing, et al.
Published: (2025)
Deep reinforcement learning for irrigation scheduling using high-dimensional sensor feedback
by: Saikai, Yuji, et al.
Published: (2023)
by: Saikai, Yuji, et al.
Published: (2023)
ContextBudget: Budget-Aware Context Management for Long-Horizon Search Agents
by: Wu, Yong, et al.
Published: (2026)
by: Wu, Yong, et al.
Published: (2026)
Similar Items
-
H$^2$R: Hierarchical Hindsight Reflection for Multi-Task LLM Agents
by: Ye, Shicheng, et al.
Published: (2025) -
EvoMAS: Learning Execution-Time Workflows for Multi-Agent Systems
by: Xu, Chengdong, et al.
Published: (2026) -
Federated reinforcement learning for robot motion planning with zero-shot generalization
by: Yuan, Zhenyuan, et al.
Published: (2024) -
Dynamic feature selection in medical predictive monitoring by reinforcement learning
by: Chen, Yutong, et al.
Published: (2024) -
Adaptive parameter sharing for multi-agent reinforcement learning
by: Li, Dapeng, et al.
Published: (2023)