Saved in:
Bibliographic Details
Main Authors: Filgueiras, António, Marques, Eduardo R. B., Lopes, Luís M. B., Marques, Miguel, Silva, Hugo
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2403.12072
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913007998599168
author Filgueiras, António
Marques, Eduardo R. B.
Lopes, Luís M. B.
Marques, Miguel
Silva, Hugo
author_facet Filgueiras, António
Marques, Eduardo R. B.
Lopes, Luís M. B.
Marques, Miguel
Silva, Hugo
contents Machine-learning techniques, especially deep convolutional neural networks, are pivotal for image-based identification of biological species in many Citizen Science platforms. In this paper, we describe the construction of a dataset for the Portuguese native flora based on publicly available research-grade datasets, and the derivation of a high-accuracy model from it using off-the-shelf deep convolutional neural networks. We anchored the dataset in high-quality data provided by Sociedade Portuguesa de Botânica and added further sampled data from research-grade datasets available from GBIF. We find that with a careful dataset design, off-the-shelf machine-learning cloud services such as Google's AutoML Vision produce accurate models, with results comparable to those of Pl@ntNet, a state-of-the-art citizen science platform. The best model we derived, dubbed Floralens, has been integrated into the public website of Project Biolens, where we gather models for other taxa as well. The dataset used to train the model is also publicly available on Zenodo.
format Preprint
id arxiv_https___arxiv_org_abs_2403_12072
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Floralens: a Deep Learning Model for the Portuguese Native Flora
Filgueiras, António
Marques, Eduardo R. B.
Lopes, Luís M. B.
Marques, Miguel
Silva, Hugo
Computer Vision and Pattern Recognition
Machine Learning
Machine-learning techniques, especially deep convolutional neural networks, are pivotal for image-based identification of biological species in many Citizen Science platforms. In this paper, we describe the construction of a dataset for the Portuguese native flora based on publicly available research-grade datasets, and the derivation of a high-accuracy model from it using off-the-shelf deep convolutional neural networks. We anchored the dataset in high-quality data provided by Sociedade Portuguesa de Botânica and added further sampled data from research-grade datasets available from GBIF. We find that with a careful dataset design, off-the-shelf machine-learning cloud services such as Google's AutoML Vision produce accurate models, with results comparable to those of Pl@ntNet, a state-of-the-art citizen science platform. The best model we derived, dubbed Floralens, has been integrated into the public website of Project Biolens, where we gather models for other taxa as well. The dataset used to train the model is also publicly available on Zenodo.
title Floralens: a Deep Learning Model for the Portuguese Native Flora
topic Computer Vision and Pattern Recognition
Machine Learning
url https://arxiv.org/abs/2403.12072