Saved in:
| Main Authors: | Koaik, Fatima, Gupta, Aayush, Sheikh, Farahan Raza |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.06111 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
TaxAgent: How Large Language Model Designs Fiscal Policy
by: Wang, Jizhou, et al.
Published: (2025)
by: Wang, Jizhou, et al.
Published: (2025)
Safe and Policy-Compliant Multi-Agent Orchestration for Enterprise AI
by: Pasupuleti, Vinil, et al.
Published: (2026)
by: Pasupuleti, Vinil, et al.
Published: (2026)
From Idea to CAD: A Language Model-Driven Multi-Agent System for Collaborative Design
by: Ocker, Felix, et al.
Published: (2025)
by: Ocker, Felix, et al.
Published: (2025)
Knowledge Equivalence in Digital Twins of Intelligent Systems
by: Zhang, Nan, et al.
Published: (2022)
by: Zhang, Nan, et al.
Published: (2022)
LLM Scalability Risk for Agentic-AI and Model Supply Chain Security
by: Ahi, Kiarash, et al.
Published: (2026)
by: Ahi, Kiarash, et al.
Published: (2026)
TwinLoop: Simulation-in-the-Loop Digital Twins for Online Multi-Agent Reinforcement Learning
by: Zhang, Nan, et al.
Published: (2026)
by: Zhang, Nan, et al.
Published: (2026)
Bounded Autonomy for Enterprise AI: Typed Action Contracts and Consumer-Side Execution
by: Sohail, Sarmad, et al.
Published: (2026)
by: Sohail, Sarmad, et al.
Published: (2026)
TinyTroupe: An LLM-powered Multiagent Persona Simulation Toolkit
by: Salem, Paulo, et al.
Published: (2025)
by: Salem, Paulo, et al.
Published: (2025)
The Stochastic Gap: A Markovian Framework for Pre-Deployment Reliability and Oversight-Cost Auditing in Agentic Artificial Intelligence
by: Pal, Biplab, et al.
Published: (2026)
by: Pal, Biplab, et al.
Published: (2026)
Rewarding Beliefs, Not Actions: Consistency-Guided Credit Assignment for Long-Horizon Agents
by: Tang, Wenjie, et al.
Published: (2026)
by: Tang, Wenjie, et al.
Published: (2026)
The Importance of Out-of-Band Metadata for Safe Autonomous Agents: The Redpanda Agentic Data Plane
by: Akidau, Tyler, et al.
Published: (2026)
by: Akidau, Tyler, et al.
Published: (2026)
Research on Security Enhancement Methods for Adversarial Robust Large Language Model Intelligent Agents for Medical Decision-Making Tasks
by: Hu, Saisai
Published: (2026)
by: Hu, Saisai
Published: (2026)
The Power of Stories: Narrative Priming Shapes How LLM Agents Collaborate and Compete
by: Großmann, Gerrit, et al.
Published: (2025)
by: Großmann, Gerrit, et al.
Published: (2025)
Beyond Benchmark Islands: Toward Representative Trustworthiness Evaluation for Agentic AI
by: Qi, Jinhu, et al.
Published: (2026)
by: Qi, Jinhu, et al.
Published: (2026)
AI Agentic workflows and Enterprise APIs: Adapting API architectures for the age of AI agents
by: Tupe, Vaibhav, et al.
Published: (2025)
by: Tupe, Vaibhav, et al.
Published: (2025)
Security Attack and Defense Strategies for Autonomous Agent Frameworks: A Layered Review with OpenClaw as a Case Study
by: Xu, Luyao, et al.
Published: (2026)
by: Xu, Luyao, et al.
Published: (2026)
Governance Architecture for Autonomous Agent Systems: Threats, Framework, and Engineering Practice
by: Ge, Yuxu
Published: (2026)
by: Ge, Yuxu
Published: (2026)
HBEE: Human Behavioral Entropy Engine -- Pre-Registered Multi-Agent LLM Simulation of Peer-Suspicion-Based Detection Inversion
by: Ferrel, Vickson
Published: (2026)
by: Ferrel, Vickson
Published: (2026)
Adversarial Feeds Steer LLM Agent Decisions Against Their Defaults
by: Usman, Rana Muhammad
Published: (2026)
by: Usman, Rana Muhammad
Published: (2026)
PARNESS: A Paper Harness for End-to-End Automated Scientific Research with Dynamic Workflows, Full-Text Indexing, and Cross-Run Knowledge Accumulation
by: Wang, Yuchen, et al.
Published: (2026)
by: Wang, Yuchen, et al.
Published: (2026)
Depth-Dependent Indirect Prompt Injection in Tool-Calling ReAct Agents: Injection Depth, Payload Framing, and Turn-Budget Sensitivity
by: Rashidi, Mohammadreza
Published: (2026)
by: Rashidi, Mohammadreza
Published: (2026)
Go Big or Go Home: Simulating Mobbing Behavior with Braitenbergian Robots
by: Sanoubari, Elaheh
Published: (2026)
by: Sanoubari, Elaheh
Published: (2026)
Robust and Diverse Multi-Agent Learning via Rational Policy Gradient
by: Lauffer, Niklas, et al.
Published: (2025)
by: Lauffer, Niklas, et al.
Published: (2025)
PilotBench: A Benchmark for General Aviation Agents with Safety Constraints
by: Wu, Yalun, et al.
Published: (2026)
by: Wu, Yalun, et al.
Published: (2026)
Design Principles for the Construction of a Benchmark Evaluating Security Operation Capabilities of Multi-agent AI Systems
by: Cai, Yicheng, et al.
Published: (2026)
by: Cai, Yicheng, et al.
Published: (2026)
An Agentic Multi-Agent Architecture for Cybersecurity Risk Management
by: Gupta, Ravish, et al.
Published: (2026)
by: Gupta, Ravish, et al.
Published: (2026)
Real-Time In Silico Modeling of Postprandial Macronutrient Kinetics: A Validated Computational Engine for Nutrition Research and Digital Health
by: Calderone, Alberto
Published: (2026)
by: Calderone, Alberto
Published: (2026)
One Policy, Infinite NPCs: Persona-Traceable Shared RL Policies for Scalable Game Agents
by: Hong, Yoosung
Published: (2026)
by: Hong, Yoosung
Published: (2026)
CPEMH: An Agentic Framework for Prompt-Driven Behavior Evaluation and Assurance in Foundation-Model Systems for Mental Health Screening
by: Lorenzoni, Giuliano, et al.
Published: (2026)
by: Lorenzoni, Giuliano, et al.
Published: (2026)
Integrating Anomaly Detection into Agentic AI for Proactive Risk Management in Human Activity
by: Zorriassatine, Farbod, et al.
Published: (2026)
by: Zorriassatine, Farbod, et al.
Published: (2026)
When Outcome Looks Right But Discipline Fails: Trace-Based Evaluation Under Hidden Competitor State
by: Zhu, Peiying, et al.
Published: (2026)
by: Zhu, Peiying, et al.
Published: (2026)
HybridVFL: Disentangled Feature Learning for Edge-Enabled Vertical Federated Multimodal Classification
by: Anoosha, Mostafa, et al.
Published: (2025)
by: Anoosha, Mostafa, et al.
Published: (2025)
When the Agent Is the Adversary: Architectural Requirements for Agentic AI Containment After the April 2026 Frontier Model Escape
by: Mitchell, Richard Joseph
Published: (2026)
by: Mitchell, Richard Joseph
Published: (2026)
The Effect of State Representation on LLM Agent Behavior in Dynamic Routing Games
by: Goodyear, Lyle, et al.
Published: (2025)
by: Goodyear, Lyle, et al.
Published: (2025)
Difference Rewards Policy Gradients
by: Castellini, Jacopo, et al.
Published: (2020)
by: Castellini, Jacopo, et al.
Published: (2020)
Collaborative On-Sensor Array Cameras
by: Sun, Jipeng, et al.
Published: (2025)
by: Sun, Jipeng, et al.
Published: (2025)
Super-additive Cooperation in Language Model Agents
by: Tonini, Filippo, et al.
Published: (2025)
by: Tonini, Filippo, et al.
Published: (2025)
Instruction-Level Weight Shaping: A Framework for Self-Improving AI Agents
by: Costa, Rimom
Published: (2025)
by: Costa, Rimom
Published: (2025)
ILION: Deterministic Pre-Execution Safety Gates for Agentic AI Systems
by: Chitan, Florin Adrian
Published: (2026)
by: Chitan, Florin Adrian
Published: (2026)
Session Risk Memory (SRM): Temporal Authorization for Deterministic Pre-Execution Safety Gates
by: Chitan, Florin Adrian
Published: (2026)
by: Chitan, Florin Adrian
Published: (2026)
Similar Items
-
TaxAgent: How Large Language Model Designs Fiscal Policy
by: Wang, Jizhou, et al.
Published: (2025) -
Safe and Policy-Compliant Multi-Agent Orchestration for Enterprise AI
by: Pasupuleti, Vinil, et al.
Published: (2026) -
From Idea to CAD: A Language Model-Driven Multi-Agent System for Collaborative Design
by: Ocker, Felix, et al.
Published: (2025) -
Knowledge Equivalence in Digital Twins of Intelligent Systems
by: Zhang, Nan, et al.
Published: (2022) -
LLM Scalability Risk for Agentic-AI and Model Supply Chain Security
by: Ahi, Kiarash, et al.
Published: (2026)