Saved in:
| Main Authors: | Zhou, Hanlin, Chan, Huah Yong |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.01797 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Runtime Burden Allocation for Structured LLM Routing in Agentic Expert Systems: A Full-Factorial Cross-Backend Methodology
by: Hanlin, Zhou, et al.
Published: (2026)
by: Hanlin, Zhou, et al.
Published: (2026)
ADEMA: A Knowledge-State Orchestration Architecture for Long-Horizon Knowledge Synthesis with LLMAgents
by: Hanlin, Zhou, et al.
Published: (2026)
by: Hanlin, Zhou, et al.
Published: (2026)
Automatic Adjustment of HPA Parameters and Attack Prevention in Kubernetes Using Random Forests
by: Zhou, Hanlin, et al.
Published: (2026)
by: Zhou, Hanlin, et al.
Published: (2026)
Position: agentic AI orchestration should be Bayes-consistent
by: Papamarkou, Theodore, et al.
Published: (2026)
by: Papamarkou, Theodore, et al.
Published: (2026)
Dynamic fairness-aware recommendation through multi-agent social choice
by: Aird, Amanda, et al.
Published: (2023)
by: Aird, Amanda, et al.
Published: (2023)
MARS: toward more efficient multi-agent collaboration for LLM reasoning
by: Wang, Xiao, et al.
Published: (2025)
by: Wang, Xiao, et al.
Published: (2025)
Evidence-based diagnostic reasoning with multi-agent copilot for human pathology
by: Weishaupt, Luca L., et al.
Published: (2025)
by: Weishaupt, Luca L., et al.
Published: (2025)
TrajOnco: a multi-agent framework for temporal reasoning over longitudinal EHR for multi-cancer early detection
by: Zeng, Sihang, et al.
Published: (2026)
by: Zeng, Sihang, et al.
Published: (2026)
Improving monotonic optimization in heterogeneous multi-agent reinforcement learning with optimal marginal deterministic policy gradient
by: Yu, Xiaoyang, et al.
Published: (2025)
by: Yu, Xiaoyang, et al.
Published: (2025)
EMA Policy Gradient: Taming Reinforcement Learning for LLMs with EMA Anchor and Top-k KL
by: Zhang, Lunjun, et al.
Published: (2026)
by: Zhang, Lunjun, et al.
Published: (2026)
Learning for routing: A guided review of recent developments and future directions
by: Zhou, Fangting, et al.
Published: (2025)
by: Zhou, Fangting, et al.
Published: (2025)
AIonopedia: an LLM agent orchestrating multimodal learning for ionic liquid discovery
by: Yin, Yuqi, et al.
Published: (2025)
by: Yin, Yuqi, et al.
Published: (2025)
Adaptive routing protocols for determining optimal paths in AI multi-agent systems: a priority- and learning-enhanced approach
by: Panayotov, Theodor, et al.
Published: (2025)
by: Panayotov, Theodor, et al.
Published: (2025)
SIKeD: Self-guided Iterative Knowledge Distillation for mathematical reasoning
by: Adarsh, Shivam, et al.
Published: (2024)
by: Adarsh, Shivam, et al.
Published: (2024)
Automated legal reasoning with discretion to act using s(LAW)
by: Arias, Joaquín, et al.
Published: (2024)
by: Arias, Joaquín, et al.
Published: (2024)
WiseMind: a knowledge-guided multi-agent framework for accurate and empathetic psychiatric diagnosis
by: Wu, Yuqi, et al.
Published: (2025)
by: Wu, Yuqi, et al.
Published: (2025)
Biomedical reasoning in action: Multi-agent System for Auditable Biomedical Evidence Synthesis
by: Wysocki, Oskar, et al.
Published: (2025)
by: Wysocki, Oskar, et al.
Published: (2025)
Learning to reason about rare diseases through retrieval-augmented agents
by: Kim, Ha Young, et al.
Published: (2025)
by: Kim, Ha Young, et al.
Published: (2025)
SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoning
by: Ghafarollahi, Alireza, et al.
Published: (2024)
by: Ghafarollahi, Alireza, et al.
Published: (2024)
Reinforcement Learning in hyperbolic space for multi-step reasoning
by: Xu, Tao, et al.
Published: (2025)
by: Xu, Tao, et al.
Published: (2025)
DecompSR: A dataset for decomposed analyses of compositional multihop spatial reasoning
by: McPheat, Lachlan, et al.
Published: (2025)
by: McPheat, Lachlan, et al.
Published: (2025)
Retrieval-augmented reasoning with lean language models
by: Chan, Ryan Sze-Yin, et al.
Published: (2025)
by: Chan, Ryan Sze-Yin, et al.
Published: (2025)
EMA Without the Lag: Bias-Corrected Iterate Averaging Schemes
by: Block, Adam, et al.
Published: (2025)
by: Block, Adam, et al.
Published: (2025)
Is continuous CoT better suited for multi-lingual reasoning?
by: Bashir, Ali Hamza, et al.
Published: (2026)
by: Bashir, Ali Hamza, et al.
Published: (2026)
Pushing the Limits of Low-Bit Optimizers: A Focus on EMA Dynamics
by: Xu, Cong, et al.
Published: (2025)
by: Xu, Cong, et al.
Published: (2025)
Incorporating uncertainty quantification into travel mode choice modeling: a Bayesian neural network (BNN) approach and an uncertainty-guided active survey framework
by: Zheng, Shuwen, et al.
Published: (2024)
by: Zheng, Shuwen, et al.
Published: (2024)
From LLM-anation to LLM-orchestrator: Coordinating Small Models for Data Labeling
by: Lu, Yao, et al.
Published: (2025)
by: Lu, Yao, et al.
Published: (2025)
The cognitive companion: a lightweight parallel monitoring architecture for detecting and recovering from reasoning degradation in LLM agents
by: Khan, Rafflesia, et al.
Published: (2026)
by: Khan, Rafflesia, et al.
Published: (2026)
Configurable multi-agent framework for scalable and realistic testing of llm-based agents
by: Wang, Sai, et al.
Published: (2025)
by: Wang, Sai, et al.
Published: (2025)
Explore Theory of Mind: Program-guided adversarial data generation for theory of mind reasoning
by: Sclar, Melanie, et al.
Published: (2024)
by: Sclar, Melanie, et al.
Published: (2024)
Performance of AI agents based on reasoning language models on ALD process optimization tasks
by: Yanguas-Gil, Angel
Published: (2026)
by: Yanguas-Gil, Angel
Published: (2026)
A Kubernetes custom scheduler based on reinforcement learning for compute-intensive pods
by: Zhou, Hanlin, et al.
Published: (2026)
by: Zhou, Hanlin, et al.
Published: (2026)
Performance Comparison of IBN orchestration using LLM and SLMs
by: Phone, Wai Lwin, et al.
Published: (2026)
by: Phone, Wai Lwin, et al.
Published: (2026)
MetaOpenFOAM: an LLM-based multi-agent framework for CFD
by: Chen, Yuxuan, et al.
Published: (2024)
by: Chen, Yuxuan, et al.
Published: (2024)
Causal vs. Anticausal merging of predictors
by: Mejia, Sergio Hernan Garrido, et al.
Published: (2025)
by: Mejia, Sergio Hernan Garrido, et al.
Published: (2025)
Reasonably reasoning AI agents can avoid game-theoretic failures in zero-shot, provably
by: Kang, Enoch Hyunwook
Published: (2026)
by: Kang, Enoch Hyunwook
Published: (2026)
Adaptive parameter sharing for multi-agent reinforcement learning
by: Li, Dapeng, et al.
Published: (2023)
by: Li, Dapeng, et al.
Published: (2023)
Automated stereotactic radiosurgery planning using a human-in-the-loop reasoning large language model agent
by: Nusrat, Humza, et al.
Published: (2025)
by: Nusrat, Humza, et al.
Published: (2025)
PathReasoning: A multimodal reasoning agent for query-based ROI navigation on whole-slide images
by: Zhang, Kunpeng, et al.
Published: (2025)
by: Zhang, Kunpeng, et al.
Published: (2025)
Using multi-agent architecture to mitigate the risk of LLM hallucinations
by: Amer, Abd Elrahman, et al.
Published: (2025)
by: Amer, Abd Elrahman, et al.
Published: (2025)
Similar Items
-
Runtime Burden Allocation for Structured LLM Routing in Agentic Expert Systems: A Full-Factorial Cross-Backend Methodology
by: Hanlin, Zhou, et al.
Published: (2026) -
ADEMA: A Knowledge-State Orchestration Architecture for Long-Horizon Knowledge Synthesis with LLMAgents
by: Hanlin, Zhou, et al.
Published: (2026) -
Automatic Adjustment of HPA Parameters and Attack Prevention in Kubernetes Using Random Forests
by: Zhou, Hanlin, et al.
Published: (2026) -
Position: agentic AI orchestration should be Bayes-consistent
by: Papamarkou, Theodore, et al.
Published: (2026) -
Dynamic fairness-aware recommendation through multi-agent social choice
by: Aird, Amanda, et al.
Published: (2023)