Saved in:
| Main Authors: | Gabriel, Adrian Garret, Ahmad, Alaa Alameer, Jeyakumar, Shankar Kumar |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.22457 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
An Agentic Evaluation Architecture for Historical Bias Detection in Educational Textbooks
by: Stefan, Gabriel, et al.
Published: (2026)
by: Stefan, Gabriel, et al.
Published: (2026)
ATOD: An Evaluation Framework and Benchmark for Agentic Task-Oriented Dialogue Systems
by: Zhang, Yifei, et al.
Published: (2026)
by: Zhang, Yifei, et al.
Published: (2026)
Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling
by: Wong, Jeffrey T. H., et al.
Published: (2026)
by: Wong, Jeffrey T. H., et al.
Published: (2026)
Adaptive Monitoring and Real-World Evaluation of Agentic AI Systems
by: Shukla, Manish
Published: (2025)
by: Shukla, Manish
Published: (2025)
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
by: Li, Zhuofeng, et al.
Published: (2025)
by: Li, Zhuofeng, et al.
Published: (2025)
AgenTracer: Who Is Inducing Failure in the LLM Agentic Systems?
by: Zhang, Guibin, et al.
Published: (2025)
by: Zhang, Guibin, et al.
Published: (2025)
MiRAGE: A Multiagent Framework for Generating Multimodal Multihop Question-Answer Dataset for RAG Evaluation
by: Sahu, Chandan Kumar, et al.
Published: (2026)
by: Sahu, Chandan Kumar, et al.
Published: (2026)
Litmus (Re)Agent: A Benchmark and Agentic System for Predictive Evaluation of Multilingual Models
by: Mittal, Avni, et al.
Published: (2026)
by: Mittal, Avni, et al.
Published: (2026)
GNNs as Predictors of Agentic Workflow Performances
by: Zhang, Yuanshuo, et al.
Published: (2025)
by: Zhang, Yuanshuo, et al.
Published: (2025)
Exploring Modularity of Agentic Systems for Drug Discovery
by: van Weesep, Laura, et al.
Published: (2025)
by: van Weesep, Laura, et al.
Published: (2025)
Unsupervised Cycle Detection in Agentic Applications
by: George, Felix, et al.
Published: (2025)
by: George, Felix, et al.
Published: (2025)
A Novel Hierarchical Multi-Agent System for Payments Using LLMs
by: Chua, Joon Kiat, et al.
Published: (2026)
by: Chua, Joon Kiat, et al.
Published: (2026)
OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning
by: Lu, Pan, et al.
Published: (2025)
by: Lu, Pan, et al.
Published: (2025)
CANTANTE: Optimizing Agentic Systems via Contrastive Credit Attribution
by: Zehle, Tom
Published: (2026)
by: Zehle, Tom
Published: (2026)
Hallucination Mitigation using Agentic AI Natural Language-Based Frameworks
by: Gosmar, Diego, et al.
Published: (2025)
by: Gosmar, Diego, et al.
Published: (2025)
Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems
by: Zhang, Shaokun, et al.
Published: (2025)
by: Zhang, Shaokun, et al.
Published: (2025)
A Survey on LLM-based Multi-Agent System: Recent Advances and New Frontiers in Application
by: Chen, Shuaihang, et al.
Published: (2024)
by: Chen, Shuaihang, et al.
Published: (2024)
From Standalone LLMs to Integrated Intelligence: A Survey of Compound Al Systems
by: Chen, Jiayi, et al.
Published: (2025)
by: Chen, Jiayi, et al.
Published: (2025)
Can Agents Judge Systematic Reviews Like Humans? Evaluating SLRs with LLM-based Multi-Agent System
by: Mushtaq, Abdullah, et al.
Published: (2025)
by: Mushtaq, Abdullah, et al.
Published: (2025)
A-MapReduce: Executing Wide Search via Agentic MapReduce
by: Chen, Mingju, et al.
Published: (2026)
by: Chen, Mingju, et al.
Published: (2026)
ColorEcosystem: Powering Personalized, Standardized, and Trustworthy Agentic Service in massive-agent Ecosystem
by: Wu, Fangwen, et al.
Published: (2025)
by: Wu, Fangwen, et al.
Published: (2025)
ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning
by: Sun, Yu, et al.
Published: (2025)
by: Sun, Yu, et al.
Published: (2025)
Agentic Medical Knowledge Graphs Enhance Medical Question Answering: Bridging the Gap Between LLMs and Evolving Medical Knowledge
by: Rezaei, Mohammad Reza, et al.
Published: (2025)
by: Rezaei, Mohammad Reza, et al.
Published: (2025)
TaskGen: A Task-Based, Memory-Infused Agentic Framework using StrictJSON
by: Tan, John Chong Min, et al.
Published: (2024)
by: Tan, John Chong Min, et al.
Published: (2024)
Theory of Mind in Action: The Instruction Inference Task in Dynamic Human-Agent Collaboration
by: Saad, Fardin, et al.
Published: (2025)
by: Saad, Fardin, et al.
Published: (2025)
A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration
by: Liu, Zijun, et al.
Published: (2023)
by: Liu, Zijun, et al.
Published: (2023)
StitchCUDA: An Automated Multi-Agents End-to-End GPU Programing Framework with Rubric-based Agentic Reinforcement Learning
by: Li, Shiyang, et al.
Published: (2026)
by: Li, Shiyang, et al.
Published: (2026)
System of Agentic AI for the Discovery of Metal-Organic Frameworks
by: Inizan, Theo Jaffrelot, et al.
Published: (2025)
by: Inizan, Theo Jaffrelot, et al.
Published: (2025)
Responsible Agentic AI Requires Explicit Provenance
by: Hu, Jinwei, et al.
Published: (2026)
by: Hu, Jinwei, et al.
Published: (2026)
PAARS: Persona Aligned Agentic Retail Shoppers
by: Mansour, Saab, et al.
Published: (2025)
by: Mansour, Saab, et al.
Published: (2025)
Beyond Static Tools: Test-Time Tool Evolution for Scientific Reasoning
by: Lu, Jiaxuan, et al.
Published: (2026)
by: Lu, Jiaxuan, et al.
Published: (2026)
AutoMedic: An Automated Evaluation Framework for Clinical Conversational Agents with Medical Dataset Grounding
by: Oh, Gyutaek, et al.
Published: (2025)
by: Oh, Gyutaek, et al.
Published: (2025)
A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems
by: Fang, Jinyuan, et al.
Published: (2025)
by: Fang, Jinyuan, et al.
Published: (2025)
Multi-agent Architecture Search via Agentic Supernet
by: Zhang, Guibin, et al.
Published: (2025)
by: Zhang, Guibin, et al.
Published: (2025)
How Brittle is Agent Safety? Rethinking Agent Risk under Intent Concealment and Task Complexity
by: Ma, Zihan, et al.
Published: (2025)
by: Ma, Zihan, et al.
Published: (2025)
MegaChat: A Synthetic Persian Q&A Dataset for High-Quality Sales Chatbot Evaluation
by: Rahmani, Mahdi, et al.
Published: (2025)
by: Rahmani, Mahdi, et al.
Published: (2025)
ReaGAN: Node-as-Agent-Reasoning Graph Agentic Network
by: Guo, Minghao, et al.
Published: (2025)
by: Guo, Minghao, et al.
Published: (2025)
Beyond Task Completion: An Assessment Framework for Evaluating Agentic AI Systems
by: Akshathala, Sreemaee, et al.
Published: (2025)
by: Akshathala, Sreemaee, et al.
Published: (2025)
OralAgent: Integrating Reasoning, Tools, and Knowledge for Interactive Dental Image Analysis
by: Hao, Jing, et al.
Published: (2026)
by: Hao, Jing, et al.
Published: (2026)
Integration of Large Vision Language Models for Efficient Post-disaster Damage Assessment and Reporting
by: Chen, Zhaohui, et al.
Published: (2024)
by: Chen, Zhaohui, et al.
Published: (2024)
Similar Items
-
An Agentic Evaluation Architecture for Historical Bias Detection in Educational Textbooks
by: Stefan, Gabriel, et al.
Published: (2026) -
ATOD: An Evaluation Framework and Benchmark for Agentic Task-Oriented Dialogue Systems
by: Zhang, Yifei, et al.
Published: (2026) -
Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling
by: Wong, Jeffrey T. H., et al.
Published: (2026) -
Adaptive Monitoring and Real-World Evaluation of Agentic AI Systems
by: Shukla, Manish
Published: (2025) -
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
by: Li, Zhuofeng, et al.
Published: (2025)