Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Acharya, Anurag, Vega, Timothy, Ashraf, Rizwan A., Sharma, Anshu, Parker, Derek, Rallo, Robert
Format:	Preprint
Published:	2026
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2601.12607
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866908774271287296
author	Acharya, Anurag Vega, Timothy Ashraf, Rizwan A. Sharma, Anshu Parker, Derek Rallo, Robert
author_facet	Acharya, Anurag Vega, Timothy Ashraf, Rizwan A. Sharma, Anshu Parker, Derek Rallo, Robert
contents	As Large Language Models (LLMs) become ubiquitous across various scientific domains, their lack of ability to perform complex tasks like running simulations or to make complex decisions limits their utility. LLM-based agents bridge this gap due to their ability to call external resources and tools and thus are now rapidly gaining popularity. However, coming up with a workflow that can balance the models, cloud providers, and external resources is very challenging, making implementing an agentic system more of a hindrance than a help. In this work, we present a domain-agnostic, model-independent workflow for an agentic framework that can act as a scientific assistant while being run entirely on cloud. Built with a supervisor agent marshaling an array of agents with individual capabilities, our framework brings together straightforward tasks like literature review and data analysis with more complex ones like simulation runs. We describe the framework here in full, including a proof-of-concept system we built to accelerate the study of Catalysts, which is highly important in the field of Chemistry and Material Science. We report the cost to operate and use this framework, including the breakdown of the cost by services use. We also evaluate our system on a custom-curated synthetic benchmark and a popular Chemistry benchmark, and also perform expert validation of the system. The results show that our system is able to route the task to the correct agent 90% of the time and successfully complete the assigned task 97.5% of the time for the synthetic tasks and 91% of the time for real-world tasks, while still achieving better or comparable accuracy to most frontier models, showing that this is a viable framework for other scientific domains to replicate.
format	Preprint
id	arxiv_https___arxiv_org_abs_2601_12607
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	A Cloud-based Multi-Agentic Workflow for Science Acharya, Anurag Vega, Timothy Ashraf, Rizwan A. Sharma, Anshu Parker, Derek Rallo, Robert Computation and Language Artificial Intelligence As Large Language Models (LLMs) become ubiquitous across various scientific domains, their lack of ability to perform complex tasks like running simulations or to make complex decisions limits their utility. LLM-based agents bridge this gap due to their ability to call external resources and tools and thus are now rapidly gaining popularity. However, coming up with a workflow that can balance the models, cloud providers, and external resources is very challenging, making implementing an agentic system more of a hindrance than a help. In this work, we present a domain-agnostic, model-independent workflow for an agentic framework that can act as a scientific assistant while being run entirely on cloud. Built with a supervisor agent marshaling an array of agents with individual capabilities, our framework brings together straightforward tasks like literature review and data analysis with more complex ones like simulation runs. We describe the framework here in full, including a proof-of-concept system we built to accelerate the study of Catalysts, which is highly important in the field of Chemistry and Material Science. We report the cost to operate and use this framework, including the breakdown of the cost by services use. We also evaluate our system on a custom-curated synthetic benchmark and a popular Chemistry benchmark, and also perform expert validation of the system. The results show that our system is able to route the task to the correct agent 90% of the time and successfully complete the assigned task 97.5% of the time for the synthetic tasks and 91% of the time for real-world tasks, while still achieving better or comparable accuracy to most frontier models, showing that this is a viable framework for other scientific domains to replicate.
title	A Cloud-based Multi-Agentic Workflow for Science
topic	Computation and Language Artificial Intelligence
url	https://arxiv.org/abs/2601.12607

Similar Items