Saved in:
Bibliographic Details
Main Authors: Tan, Da, Beck, Michael, Bidinosti, Christopher P., Gulden, Robert H., Henry, Christopher J.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2512.19632
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914214845612032
author Tan, Da
Beck, Michael
Bidinosti, Christopher P.
Gulden, Robert H.
Henry, Christopher J.
author_facet Tan, Da
Beck, Michael
Bidinosti, Christopher P.
Gulden, Robert H.
Henry, Christopher J.
contents The success of agricultural artificial intelligence depends heavily on large, diverse, and high-quality plant image datasets, yet collecting such data in real field conditions is costly, labor intensive, and seasonally constrained. This paper investigates diffusion-based generative modeling to address these challenges through plant image synthesis, indoor-to-outdoor translation, and expert preference aligned fine tuning. First, a Stable Diffusion model is fine tuned on captioned indoor and outdoor plant imagery to generate realistic, text conditioned images of canola and soybean. Evaluation using Inception Score, Frechet Inception Distance, and downstream phenotype classification shows that synthetic images effectively augment training data and improve accuracy. Second, we bridge the gap between high resolution indoor datasets and limited outdoor imagery using DreamBooth-based text inversion and image guided diffusion, generating translated images that enhance weed detection and classification with YOLOv8. Finally, a preference guided fine tuning framework trains a reward model on expert scores and applies reward weighted updates to produce more stable and expert aligned outputs. Together, these components demonstrate a practical pathway toward data efficient generative pipelines for agricultural AI.
format Preprint
id arxiv_https___arxiv_org_abs_2512_19632
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Generative diffusion models for agricultural AI: plant image generation, indoor-to-outdoor translation, and expert preference alignment
Tan, Da
Beck, Michael
Bidinosti, Christopher P.
Gulden, Robert H.
Henry, Christopher J.
Computer Vision and Pattern Recognition
The success of agricultural artificial intelligence depends heavily on large, diverse, and high-quality plant image datasets, yet collecting such data in real field conditions is costly, labor intensive, and seasonally constrained. This paper investigates diffusion-based generative modeling to address these challenges through plant image synthesis, indoor-to-outdoor translation, and expert preference aligned fine tuning. First, a Stable Diffusion model is fine tuned on captioned indoor and outdoor plant imagery to generate realistic, text conditioned images of canola and soybean. Evaluation using Inception Score, Frechet Inception Distance, and downstream phenotype classification shows that synthetic images effectively augment training data and improve accuracy. Second, we bridge the gap between high resolution indoor datasets and limited outdoor imagery using DreamBooth-based text inversion and image guided diffusion, generating translated images that enhance weed detection and classification with YOLOv8. Finally, a preference guided fine tuning framework trains a reward model on expert scores and applies reward weighted updates to produce more stable and expert aligned outputs. Together, these components demonstrate a practical pathway toward data efficient generative pipelines for agricultural AI.
title Generative diffusion models for agricultural AI: plant image generation, indoor-to-outdoor translation, and expert preference alignment
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2512.19632