Saved in:
Bibliographic Details
Main Authors: Trantas, Athanasios, Mensio, Martino, Stasinos, Stylianos, Gribincea, Sebastian, Khan, Taimur, Podareanu, Damian, van der Veen, Aliene
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2507.09080
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911301034311680
author Trantas, Athanasios
Mensio, Martino
Stasinos, Stylianos
Gribincea, Sebastian
Khan, Taimur
Podareanu, Damian
van der Veen, Aliene
author_facet Trantas, Athanasios
Mensio, Martino
Stasinos, Stylianos
Gribincea, Sebastian
Khan, Taimur
Podareanu, Damian
van der Veen, Aliene
contents Multimodal Foundation Models (FMs) offer a path to learn general-purpose representations from heterogeneous ecological data, easily transferable to downstream tasks. However, practical biodiversity modelling remains fragmented; separate pipelines and models are built for each dataset and objective, which limits reuse across regions and taxa. In response, we present BioAnalyst, to our knowledge the first multimodal Foundation Model tailored to biodiversity analysis and conservation planning in Europe at $0.25^{\circ}$ spatial resolution targeting regional to national-scale applications. BioAnalyst employs a transformer-based architecture, pre-trained on extensive multimodal datasets that align species occurrence records with remote sensing indicators, climate and environmental variables. Post pre-training, the model is adapted via lightweight roll-out fine-tuning to a range of downstream tasks, including joint species distribution modelling, biodiversity dynamics and population trend forecasting. The model is evaluated on two representative downstream use cases: (i) joint species distribution modelling and with 500 vascular plant species (ii) monthly climate linear probing with temperature and precipitation data. Our findings show that BioAnalyst can provide a strong baseline both for biotic and abiotic tasks, acting as a macroecological simulator with a yearly forecasting horizon and monthly resolution, offering the first application of this type of modelling in the biodiversity domain. We have open-sourced the model weights, training and fine-tuning pipelines to advance AI-driven ecological research.
format Preprint
id arxiv_https___arxiv_org_abs_2507_09080
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle BioAnalyst: A Foundation Model for Biodiversity
Trantas, Athanasios
Mensio, Martino
Stasinos, Stylianos
Gribincea, Sebastian
Khan, Taimur
Podareanu, Damian
van der Veen, Aliene
Artificial Intelligence
Multimodal Foundation Models (FMs) offer a path to learn general-purpose representations from heterogeneous ecological data, easily transferable to downstream tasks. However, practical biodiversity modelling remains fragmented; separate pipelines and models are built for each dataset and objective, which limits reuse across regions and taxa. In response, we present BioAnalyst, to our knowledge the first multimodal Foundation Model tailored to biodiversity analysis and conservation planning in Europe at $0.25^{\circ}$ spatial resolution targeting regional to national-scale applications. BioAnalyst employs a transformer-based architecture, pre-trained on extensive multimodal datasets that align species occurrence records with remote sensing indicators, climate and environmental variables. Post pre-training, the model is adapted via lightweight roll-out fine-tuning to a range of downstream tasks, including joint species distribution modelling, biodiversity dynamics and population trend forecasting. The model is evaluated on two representative downstream use cases: (i) joint species distribution modelling and with 500 vascular plant species (ii) monthly climate linear probing with temperature and precipitation data. Our findings show that BioAnalyst can provide a strong baseline both for biotic and abiotic tasks, acting as a macroecological simulator with a yearly forecasting horizon and monthly resolution, offering the first application of this type of modelling in the biodiversity domain. We have open-sourced the model weights, training and fine-tuning pipelines to advance AI-driven ecological research.
title BioAnalyst: A Foundation Model for Biodiversity
topic Artificial Intelligence
url https://arxiv.org/abs/2507.09080