Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Hamilton, Brayden, Cashmore, Tim, Driscoll, Peter, Gee, Trevor, Williams, Henry
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2601.20196
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912854154674176
author	Hamilton, Brayden Cashmore, Tim Driscoll, Peter Gee, Trevor Williams, Henry
author_facet	Hamilton, Brayden Cashmore, Tim Driscoll, Peter Gee, Trevor Williams, Henry
contents	Marine biofouling on vessel hulls poses major ecological, economic, and biosecurity risks. Traditional survey methods rely on diver inspections, which are hazardous and limited in scalability. This work investigates automated classification of biofouling severity on the Level of Fouling (LoF) scale using both custom computer vision models and large multimodal language models (LLMs). Convolutional neural networks, transformer-based segmentation, and zero-shot LLMs were evaluated on an expert-labelled dataset from the New Zealand Ministry for Primary Industries. Computer vision models showed high accuracy at extreme LoF categories but struggled with intermediate levels due to dataset imbalance and image framing. LLMs, guided by structured prompts and retrieval, achieved competitive performance without training and provided interpretable outputs. The results demonstrate complementary strengths across approaches and suggest that hybrid methods integrating segmentation coverage with LLM reasoning offer a promising pathway toward scalable and interpretable biofouling assessment.
format	Preprint
id	arxiv_https___arxiv_org_abs_2601_20196
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Automated Marine Biofouling Assessment: Benchmarking Computer Vision and Multimodal LLMs on the Level of Fouling Scale Hamilton, Brayden Cashmore, Tim Driscoll, Peter Gee, Trevor Williams, Henry Computer Vision and Pattern Recognition Marine biofouling on vessel hulls poses major ecological, economic, and biosecurity risks. Traditional survey methods rely on diver inspections, which are hazardous and limited in scalability. This work investigates automated classification of biofouling severity on the Level of Fouling (LoF) scale using both custom computer vision models and large multimodal language models (LLMs). Convolutional neural networks, transformer-based segmentation, and zero-shot LLMs were evaluated on an expert-labelled dataset from the New Zealand Ministry for Primary Industries. Computer vision models showed high accuracy at extreme LoF categories but struggled with intermediate levels due to dataset imbalance and image framing. LLMs, guided by structured prompts and retrieval, achieved competitive performance without training and provided interpretable outputs. The results demonstrate complementary strengths across approaches and suggest that hybrid methods integrating segmentation coverage with LLM reasoning offer a promising pathway toward scalable and interpretable biofouling assessment.
title	Automated Marine Biofouling Assessment: Benchmarking Computer Vision and Multimodal LLMs on the Level of Fouling Scale
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2601.20196

Similar Items