Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Arshad, Muhammad Arbab, Roy, Tirtho, Shen, Yanben, Elango, Dinakaran, Chiranjeevi, Shivani, Singh, Asheesh K., Ganapathysubramanian, Baskar, Hegde, Chinmay, Singh, Arti, Sarkar, Soumik
Format:	Preprint
Published:	2026
Subjects:	Multiagent Systems
Online Access:	https://arxiv.org/abs/2605.09768
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866913110047064064
author	Arshad, Muhammad Arbab Roy, Tirtho Shen, Yanben Elango, Dinakaran Chiranjeevi, Shivani Singh, Asheesh K. Ganapathysubramanian, Baskar Hegde, Chinmay Singh, Arti Sarkar, Soumik
author_facet	Arshad, Muhammad Arbab Roy, Tirtho Shen, Yanben Elango, Dinakaran Chiranjeevi, Shivani Singh, Asheesh K. Ganapathysubramanian, Baskar Hegde, Chinmay Singh, Arti Sarkar, Soumik
contents	Plant disease diagnosis is critical for food security, yet training disease-recognition models that generalize across crops, pathogens, and field conditions remains challenging because labeled disease images are far less abundant and standardized than data for other biotic stresses such as insects or weeds. Frontier vision-language models offer new opportunities through improved visual reasoning, but they still struggle with fine-grained disease identification due to the lack of structured, crop-specific symptom knowledge. To address this gap, we curate the largest plant disease image--symptom dataset to date, covering 335 crops, 1{,}251 disease classes, and approximately 839K images, designed to support training-free, agentic disease prediction. A scalable automated pipeline generates source-grounded symptom descriptions in which each claim is linked to a verbatim web quote; domain experts validate sampled crops and reconcile disease-name variants across sources. As a baseline, we introduce an autonomous visual reasoning agent that identifies anatomical context, narrows candidate diseases using symptom knowledge, sequentially compares reference images, and produces a fully explainable reasoning trace. Incorporating symptom knowledge improves accuracy by 16.2 percentage points on average at the full reference budget, with consistent gains across all four evaluation crops. Because the framework only requires crop-specific reference images and symptom knowledge, it can be extended to new crops without retraining, while the agentic baseline can directly benefit from future improvements in foundation model capabilities. Dataset and code are available at:https://sage-dataset.github.io/.
format	Preprint
id	arxiv_https___arxiv_org_abs_2605_09768
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	SAGE: Scalable Agentic Grounded Evaluation for Crop Disease Diagnosis Arshad, Muhammad Arbab Roy, Tirtho Shen, Yanben Elango, Dinakaran Chiranjeevi, Shivani Singh, Asheesh K. Ganapathysubramanian, Baskar Hegde, Chinmay Singh, Arti Sarkar, Soumik Multiagent Systems Plant disease diagnosis is critical for food security, yet training disease-recognition models that generalize across crops, pathogens, and field conditions remains challenging because labeled disease images are far less abundant and standardized than data for other biotic stresses such as insects or weeds. Frontier vision-language models offer new opportunities through improved visual reasoning, but they still struggle with fine-grained disease identification due to the lack of structured, crop-specific symptom knowledge. To address this gap, we curate the largest plant disease image--symptom dataset to date, covering 335 crops, 1{,}251 disease classes, and approximately 839K images, designed to support training-free, agentic disease prediction. A scalable automated pipeline generates source-grounded symptom descriptions in which each claim is linked to a verbatim web quote; domain experts validate sampled crops and reconcile disease-name variants across sources. As a baseline, we introduce an autonomous visual reasoning agent that identifies anatomical context, narrows candidate diseases using symptom knowledge, sequentially compares reference images, and produces a fully explainable reasoning trace. Incorporating symptom knowledge improves accuracy by 16.2 percentage points on average at the full reference budget, with consistent gains across all four evaluation crops. Because the framework only requires crop-specific reference images and symptom knowledge, it can be extended to new crops without retraining, while the agentic baseline can directly benefit from future improvements in foundation model capabilities. Dataset and code are available at:https://sage-dataset.github.io/.
title	SAGE: Scalable Agentic Grounded Evaluation for Crop Disease Diagnosis
topic	Multiagent Systems
url	https://arxiv.org/abs/2605.09768

Similar Items