Saved in:
Bibliographic Details
Main Authors: Arshad, Muhammad Arbab, Roy, Tirtho, Shen, Yanben, Elango, Dinakaran, Chiranjeevi, Shivani, Singh, Asheesh K., Ganapathysubramanian, Baskar, Hegde, Chinmay, Singh, Arti, Sarkar, Soumik
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2605.09768
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913110047064064
author Arshad, Muhammad Arbab
Roy, Tirtho
Shen, Yanben
Elango, Dinakaran
Chiranjeevi, Shivani
Singh, Asheesh K.
Ganapathysubramanian, Baskar
Hegde, Chinmay
Singh, Arti
Sarkar, Soumik
author_facet Arshad, Muhammad Arbab
Roy, Tirtho
Shen, Yanben
Elango, Dinakaran
Chiranjeevi, Shivani
Singh, Asheesh K.
Ganapathysubramanian, Baskar
Hegde, Chinmay
Singh, Arti
Sarkar, Soumik
contents Plant disease diagnosis is critical for food security, yet training disease-recognition models that generalize across crops, pathogens, and field conditions remains challenging because labeled disease images are far less abundant and standardized than data for other biotic stresses such as insects or weeds. Frontier vision-language models offer new opportunities through improved visual reasoning, but they still struggle with fine-grained disease identification due to the lack of structured, crop-specific symptom knowledge. To address this gap, we curate the largest plant disease image--symptom dataset to date, covering 335 crops, 1{,}251 disease classes, and approximately 839K images, designed to support training-free, agentic disease prediction. A scalable automated pipeline generates source-grounded symptom descriptions in which each claim is linked to a verbatim web quote; domain experts validate sampled crops and reconcile disease-name variants across sources. As a baseline, we introduce an autonomous visual reasoning agent that identifies anatomical context, narrows candidate diseases using symptom knowledge, sequentially compares reference images, and produces a fully explainable reasoning trace. Incorporating symptom knowledge improves accuracy by 16.2 percentage points on average at the full reference budget, with consistent gains across all four evaluation crops. Because the framework only requires crop-specific reference images and symptom knowledge, it can be extended to new crops without retraining, while the agentic baseline can directly benefit from future improvements in foundation model capabilities. Dataset and code are available at:https://sage-dataset.github.io/.
format Preprint
id arxiv_https___arxiv_org_abs_2605_09768
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle SAGE: Scalable Agentic Grounded Evaluation for Crop Disease Diagnosis
Arshad, Muhammad Arbab
Roy, Tirtho
Shen, Yanben
Elango, Dinakaran
Chiranjeevi, Shivani
Singh, Asheesh K.
Ganapathysubramanian, Baskar
Hegde, Chinmay
Singh, Arti
Sarkar, Soumik
Multiagent Systems
Plant disease diagnosis is critical for food security, yet training disease-recognition models that generalize across crops, pathogens, and field conditions remains challenging because labeled disease images are far less abundant and standardized than data for other biotic stresses such as insects or weeds. Frontier vision-language models offer new opportunities through improved visual reasoning, but they still struggle with fine-grained disease identification due to the lack of structured, crop-specific symptom knowledge. To address this gap, we curate the largest plant disease image--symptom dataset to date, covering 335 crops, 1{,}251 disease classes, and approximately 839K images, designed to support training-free, agentic disease prediction. A scalable automated pipeline generates source-grounded symptom descriptions in which each claim is linked to a verbatim web quote; domain experts validate sampled crops and reconcile disease-name variants across sources. As a baseline, we introduce an autonomous visual reasoning agent that identifies anatomical context, narrows candidate diseases using symptom knowledge, sequentially compares reference images, and produces a fully explainable reasoning trace. Incorporating symptom knowledge improves accuracy by 16.2 percentage points on average at the full reference budget, with consistent gains across all four evaluation crops. Because the framework only requires crop-specific reference images and symptom knowledge, it can be extended to new crops without retraining, while the agentic baseline can directly benefit from future improvements in foundation model capabilities. Dataset and code are available at:https://sage-dataset.github.io/.
title SAGE: Scalable Agentic Grounded Evaluation for Crop Disease Diagnosis
topic Multiagent Systems
url https://arxiv.org/abs/2605.09768