Saved in:
Bibliographic Details
Main Authors: Hamilton, Brayden, Cashmore, Tim, Driscoll, Peter, Gee, Trevor, Williams, Henry
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.20196
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912854154674176
author Hamilton, Brayden
Cashmore, Tim
Driscoll, Peter
Gee, Trevor
Williams, Henry
author_facet Hamilton, Brayden
Cashmore, Tim
Driscoll, Peter
Gee, Trevor
Williams, Henry
contents Marine biofouling on vessel hulls poses major ecological, economic, and biosecurity risks. Traditional survey methods rely on diver inspections, which are hazardous and limited in scalability. This work investigates automated classification of biofouling severity on the Level of Fouling (LoF) scale using both custom computer vision models and large multimodal language models (LLMs). Convolutional neural networks, transformer-based segmentation, and zero-shot LLMs were evaluated on an expert-labelled dataset from the New Zealand Ministry for Primary Industries. Computer vision models showed high accuracy at extreme LoF categories but struggled with intermediate levels due to dataset imbalance and image framing. LLMs, guided by structured prompts and retrieval, achieved competitive performance without training and provided interpretable outputs. The results demonstrate complementary strengths across approaches and suggest that hybrid methods integrating segmentation coverage with LLM reasoning offer a promising pathway toward scalable and interpretable biofouling assessment.
format Preprint
id arxiv_https___arxiv_org_abs_2601_20196
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Automated Marine Biofouling Assessment: Benchmarking Computer Vision and Multimodal LLMs on the Level of Fouling Scale
Hamilton, Brayden
Cashmore, Tim
Driscoll, Peter
Gee, Trevor
Williams, Henry
Computer Vision and Pattern Recognition
Marine biofouling on vessel hulls poses major ecological, economic, and biosecurity risks. Traditional survey methods rely on diver inspections, which are hazardous and limited in scalability. This work investigates automated classification of biofouling severity on the Level of Fouling (LoF) scale using both custom computer vision models and large multimodal language models (LLMs). Convolutional neural networks, transformer-based segmentation, and zero-shot LLMs were evaluated on an expert-labelled dataset from the New Zealand Ministry for Primary Industries. Computer vision models showed high accuracy at extreme LoF categories but struggled with intermediate levels due to dataset imbalance and image framing. LLMs, guided by structured prompts and retrieval, achieved competitive performance without training and provided interpretable outputs. The results demonstrate complementary strengths across approaches and suggest that hybrid methods integrating segmentation coverage with LLM reasoning offer a promising pathway toward scalable and interpretable biofouling assessment.
title Automated Marine Biofouling Assessment: Benchmarking Computer Vision and Multimodal LLMs on the Level of Fouling Scale
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2601.20196