:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhou, Hanlin, Chan, Huah Yong
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.01797
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Runtime Burden Allocation for Structured LLM Routing in Agentic Expert Systems: A Full-Factorial Cross-Backend Methodology
by: Hanlin, Zhou, et al.
Published: (2026)

ADEMA: A Knowledge-State Orchestration Architecture for Long-Horizon Knowledge Synthesis with LLMAgents
by: Hanlin, Zhou, et al.
Published: (2026)

Automatic Adjustment of HPA Parameters and Attack Prevention in Kubernetes Using Random Forests
by: Zhou, Hanlin, et al.
Published: (2026)

Position: agentic AI orchestration should be Bayes-consistent
by: Papamarkou, Theodore, et al.
Published: (2026)

Dynamic fairness-aware recommendation through multi-agent social choice
by: Aird, Amanda, et al.
Published: (2023)

MARS: toward more efficient multi-agent collaboration for LLM reasoning
by: Wang, Xiao, et al.
Published: (2025)

Evidence-based diagnostic reasoning with multi-agent copilot for human pathology
by: Weishaupt, Luca L., et al.
Published: (2025)

TrajOnco: a multi-agent framework for temporal reasoning over longitudinal EHR for multi-cancer early detection
by: Zeng, Sihang, et al.
Published: (2026)

Improving monotonic optimization in heterogeneous multi-agent reinforcement learning with optimal marginal deterministic policy gradient
by: Yu, Xiaoyang, et al.
Published: (2025)

EMA Policy Gradient: Taming Reinforcement Learning for LLMs with EMA Anchor and Top-k KL
by: Zhang, Lunjun, et al.
Published: (2026)

Learning for routing: A guided review of recent developments and future directions
by: Zhou, Fangting, et al.
Published: (2025)

AIonopedia: an LLM agent orchestrating multimodal learning for ionic liquid discovery
by: Yin, Yuqi, et al.
Published: (2025)

Adaptive routing protocols for determining optimal paths in AI multi-agent systems: a priority- and learning-enhanced approach
by: Panayotov, Theodor, et al.
Published: (2025)

SIKeD: Self-guided Iterative Knowledge Distillation for mathematical reasoning
by: Adarsh, Shivam, et al.
Published: (2024)

Automated legal reasoning with discretion to act using s(LAW)
by: Arias, Joaquín, et al.
Published: (2024)

WiseMind: a knowledge-guided multi-agent framework for accurate and empathetic psychiatric diagnosis
by: Wu, Yuqi, et al.
Published: (2025)

Biomedical reasoning in action: Multi-agent System for Auditable Biomedical Evidence Synthesis
by: Wysocki, Oskar, et al.
Published: (2025)

Learning to reason about rare diseases through retrieval-augmented agents
by: Kim, Ha Young, et al.
Published: (2025)

SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoning
by: Ghafarollahi, Alireza, et al.
Published: (2024)

Reinforcement Learning in hyperbolic space for multi-step reasoning
by: Xu, Tao, et al.
Published: (2025)

DecompSR: A dataset for decomposed analyses of compositional multihop spatial reasoning
by: McPheat, Lachlan, et al.
Published: (2025)

Retrieval-augmented reasoning with lean language models
by: Chan, Ryan Sze-Yin, et al.
Published: (2025)

EMA Without the Lag: Bias-Corrected Iterate Averaging Schemes
by: Block, Adam, et al.
Published: (2025)

Is continuous CoT better suited for multi-lingual reasoning?
by: Bashir, Ali Hamza, et al.
Published: (2026)

Pushing the Limits of Low-Bit Optimizers: A Focus on EMA Dynamics
by: Xu, Cong, et al.
Published: (2025)

Incorporating uncertainty quantification into travel mode choice modeling: a Bayesian neural network (BNN) approach and an uncertainty-guided active survey framework
by: Zheng, Shuwen, et al.
Published: (2024)

From LLM-anation to LLM-orchestrator: Coordinating Small Models for Data Labeling
by: Lu, Yao, et al.
Published: (2025)

The cognitive companion: a lightweight parallel monitoring architecture for detecting and recovering from reasoning degradation in LLM agents
by: Khan, Rafflesia, et al.
Published: (2026)

Configurable multi-agent framework for scalable and realistic testing of llm-based agents
by: Wang, Sai, et al.
Published: (2025)

Explore Theory of Mind: Program-guided adversarial data generation for theory of mind reasoning
by: Sclar, Melanie, et al.
Published: (2024)

Performance of AI agents based on reasoning language models on ALD process optimization tasks
by: Yanguas-Gil, Angel
Published: (2026)

A Kubernetes custom scheduler based on reinforcement learning for compute-intensive pods
by: Zhou, Hanlin, et al.
Published: (2026)

Performance Comparison of IBN orchestration using LLM and SLMs
by: Phone, Wai Lwin, et al.
Published: (2026)

MetaOpenFOAM: an LLM-based multi-agent framework for CFD
by: Chen, Yuxuan, et al.
Published: (2024)

Causal vs. Anticausal merging of predictors
by: Mejia, Sergio Hernan Garrido, et al.
Published: (2025)

Reasonably reasoning AI agents can avoid game-theoretic failures in zero-shot, provably
by: Kang, Enoch Hyunwook
Published: (2026)

Adaptive parameter sharing for multi-agent reinforcement learning
by: Li, Dapeng, et al.
Published: (2023)

Automated stereotactic radiosurgery planning using a human-in-the-loop reasoning large language model agent
by: Nusrat, Humza, et al.
Published: (2025)

PathReasoning: A multimodal reasoning agent for query-based ROI navigation on whole-slide images
by: Zhang, Kunpeng, et al.
Published: (2025)

Using multi-agent architecture to mitigate the risk of LLM hallucinations
by: Amer, Abd Elrahman, et al.
Published: (2025)