Saved in:
| Main Authors: | Healy, Kait, Srinivasan, Bharathi, Madathil, Visakh, Wu, Jing |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.05214 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Methodology for Selecting and Composing Runtime Architecture Patterns for Production LLM Agents
by: Srinivasan, Vasundra
Published: (2026)
by: Srinivasan, Vasundra
Published: (2026)
UniToolCall: Unifying Tool-Use Representation, Data, and Evaluation for LLM Agents
by: Liang, Yijuan, et al.
Published: (2026)
by: Liang, Yijuan, et al.
Published: (2026)
Semantic Tool Discovery for Large Language Models: A Vector-Based Approach to MCP Tool Selection
by: Mudunuri, Sarat, et al.
Published: (2026)
by: Mudunuri, Sarat, et al.
Published: (2026)
Hallucination Detection with the Internal Layers of LLMs
by: Preiß, Martin
Published: (2025)
by: Preiß, Martin
Published: (2025)
Sequence-Level Certainty Reduces Hallucination In Knowledge-Grounded Dialogue Generation
by: Wan, Yixin, et al.
Published: (2023)
by: Wan, Yixin, et al.
Published: (2023)
AutoTool: Efficient Tool Selection for Large Language Model Agents
by: Jia, Jingyi, et al.
Published: (2025)
by: Jia, Jingyi, et al.
Published: (2025)
Stateless Decision Memory for Enterprise AI Agents
by: Srinivasan, Vasundra
Published: (2026)
by: Srinivasan, Vasundra
Published: (2026)
ToolTweak: An Attack on Tool Selection in LLM-based Agents
by: Sneh, Jonathan, et al.
Published: (2025)
by: Sneh, Jonathan, et al.
Published: (2025)
MICE for CATs: Model-Internal Confidence Estimation for Calibrating Agents with Tools
by: Subramani, Nishant, et al.
Published: (2025)
by: Subramani, Nishant, et al.
Published: (2025)
Unsupervised Real-Time Hallucination Detection based on the Internal States of Large Language Models
by: Su, Weihang, et al.
Published: (2024)
by: Su, Weihang, et al.
Published: (2024)
Do We Really Need External Tools to Mitigate Hallucinations? SIRA: Shared-Prefix Internal Reconstruction of Attribution
by: Qin, Tian, et al.
Published: (2026)
by: Qin, Tian, et al.
Published: (2026)
ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models
by: Zhang, Yuxiang, et al.
Published: (2024)
by: Zhang, Yuxiang, et al.
Published: (2024)
Spectral Guardrails for Agents in the Wild: Detecting Tool Use Hallucinations via Attention Topology
by: Noël, Valentin
Published: (2026)
by: Noël, Valentin
Published: (2026)
IRIS: Implicit Reward-Guided Internal Sifting for Mitigating Multimodal Hallucination
by: Li, Yuanshuai, et al.
Published: (2026)
by: Li, Yuanshuai, et al.
Published: (2026)
On the Internal Representations of Graph Metanetworks
by: Yeom, Taesun, et al.
Published: (2025)
by: Yeom, Taesun, et al.
Published: (2025)
SciToolAgent: A Knowledge Graph-Driven Scientific Agent for Multi-Tool Integration
by: Ding, Keyan, et al.
Published: (2025)
by: Ding, Keyan, et al.
Published: (2025)
ASA: Training-Free Representation Engineering for Tool-Calling Agents
by: Wang, Youjin, et al.
Published: (2026)
by: Wang, Youjin, et al.
Published: (2026)
Modality-Native Routing in Agent-to-Agent Networks: A Multimodal A2A Protocol Extension
by: Srinivasan, Vasundra
Published: (2026)
by: Srinivasan, Vasundra
Published: (2026)
The Tool-Overuse Illusion: Why Does LLM Prefer External Tools over Internal Knowledge?
by: Zeng, Yirong, et al.
Published: (2026)
by: Zeng, Yirong, et al.
Published: (2026)
LLM Safety From Within: Detecting Harmful Content with Internal Representations
by: Jiao, Difan, et al.
Published: (2026)
by: Jiao, Difan, et al.
Published: (2026)
Causal Probing for Internal Visual Representations in Multimodal Large Language Models
by: Deng, Zehao, et al.
Published: (2026)
by: Deng, Zehao, et al.
Published: (2026)
Affinity and Diversity: A Unified Metric for Demonstration Selection via Internal Representations
by: Kato, Mariko, et al.
Published: (2025)
by: Kato, Mariko, et al.
Published: (2025)
ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents
by: Hu, Xuhao, et al.
Published: (2026)
by: Hu, Xuhao, et al.
Published: (2026)
Visualizing and Benchmarking LLM Factual Hallucination Tendencies via Internal State Analysis and Clustering
by: Mao, Nathan, et al.
Published: (2026)
by: Mao, Nathan, et al.
Published: (2026)
Probing LLM Hallucination from Within: Perturbation-Driven Approach via Internal Knowledge
by: Lee, Seongmin, et al.
Published: (2024)
by: Lee, Seongmin, et al.
Published: (2024)
DART: Semantic Recoverability for Structured Tool Agents
by: Yang, Ke, et al.
Published: (2026)
by: Yang, Ke, et al.
Published: (2026)
Bridging Protocol and Production: Design Patterns for Deploying AI Agents with Model Context Protocol
by: Srinivasan, Vasundra
Published: (2026)
by: Srinivasan, Vasundra
Published: (2026)
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation
by: Chen, Shiqi, et al.
Published: (2024)
by: Chen, Shiqi, et al.
Published: (2024)
EvoTool: Self-Evolving Tool-Use Policy Optimization in LLM Agents via Blame-Aware Mutation and Diversity-Aware Selection
by: Yang, Shuo, et al.
Published: (2026)
by: Yang, Shuo, et al.
Published: (2026)
CORVUS: Red-Teaming Hallucination Detectors via Internal Signal Camouflage in Large Language Models
by: Min, Nay Myat, et al.
Published: (2026)
by: Min, Nay Myat, et al.
Published: (2026)
Hallucination Detection and Hallucination Mitigation: An Investigation
by: Luo, Junliang, et al.
Published: (2024)
by: Luo, Junliang, et al.
Published: (2024)
Multimodal Policy Internalization for Conversational Agents
by: Wang, Zhenhailong, et al.
Published: (2025)
by: Wang, Zhenhailong, et al.
Published: (2025)
Analyzing and Internalizing Complex Policy Documents for LLM Agents
by: Liu, Jiateng, et al.
Published: (2025)
by: Liu, Jiateng, et al.
Published: (2025)
Latent Agents: A Post-Training Procedure for Internalized Multi-Agent Debate
by: Yi, John Seon Keun, et al.
Published: (2026)
by: Yi, John Seon Keun, et al.
Published: (2026)
Hallucination as Exploit: Evidence-Carrying Multimodal Agents
by: Zhang, Guijia, et al.
Published: (2026)
by: Zhang, Guijia, et al.
Published: (2026)
HalluClear: Diagnosing, Evaluating and Mitigating Hallucinations in GUI Agents
by: Jin, Chao, et al.
Published: (2026)
by: Jin, Chao, et al.
Published: (2026)
MIRAGE-Bench: LLM Agent is Hallucinating and Where to Find Them
by: Zhang, Weichen, et al.
Published: (2025)
by: Zhang, Weichen, et al.
Published: (2025)
On LLMs' Internal Representation of Code Correctness
by: Ribeiro, Francisco, et al.
Published: (2025)
by: Ribeiro, Francisco, et al.
Published: (2025)
Evo-MARL: Co-Evolutionary Multi-Agent Reinforcement Learning for Internalized Safety
by: Pan, Zhenyu, et al.
Published: (2025)
by: Pan, Zhenyu, et al.
Published: (2025)
Co-Evolution of Policy and Internal Reward for Language Agents
by: Wang, Xinyu, et al.
Published: (2026)
by: Wang, Xinyu, et al.
Published: (2026)
Similar Items
-
A Methodology for Selecting and Composing Runtime Architecture Patterns for Production LLM Agents
by: Srinivasan, Vasundra
Published: (2026) -
UniToolCall: Unifying Tool-Use Representation, Data, and Evaluation for LLM Agents
by: Liang, Yijuan, et al.
Published: (2026) -
Semantic Tool Discovery for Large Language Models: A Vector-Based Approach to MCP Tool Selection
by: Mudunuri, Sarat, et al.
Published: (2026) -
Hallucination Detection with the Internal Layers of LLMs
by: Preiß, Martin
Published: (2025) -
Sequence-Level Certainty Reduces Hallucination In Knowledge-Grounded Dialogue Generation
by: Wan, Yixin, et al.
Published: (2023)