Saved in:
Bibliographic Details
Main Authors: Martínez-Minaya, Joaquín, Zumeta-Olaskoaga, Lore, Lee, Dae-Jin
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2508.07708
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912532330971136
author Martínez-Minaya, Joaquín
Zumeta-Olaskoaga, Lore
Lee, Dae-Jin
author_facet Martínez-Minaya, Joaquín
Zumeta-Olaskoaga, Lore
Lee, Dae-Jin
contents Compositional data (CoDa) plays an important role in many fields such as ecology, geology, or biology. The most widely used modeling approaches are based on the Dirichlet and the logistic-normal formulation under Aitchison geometry. Recent developments in the mathematical field on the simplex geometry allow to express the regression model in terms of coordinates and estimate its coefficients. Once the model is projected in the real space, we can employ a multivariate Gaussian regression to deal with it. However, most existing methods focus on linear models, and there is a lack of flexible alternatives such as additive or spatial models, especially within a Bayesian framework and with practical implementation details. In this work, we present a geoadditive regression model for CoDa from a Bayesian perspective using the brms package in R. The model applies the isometric log-ratio (ilr) transformation and penalized splines to incorporate nonlinear effects. We also propose two new Bayesian goodness-of-fit measures for CoDa regression: BR-CoDa-$R^2$ and BM-CoDa-$R^2$, extending the Bayesian $R^2$ to the compositional setting. These measures, alongside WAIC, support model selection and evaluation. The methodology is validated through simulation studies and applied to predict soil texture composition in the Basque Country. Results demonstrate good performance, interpretable spatial patterns, and reliable quantification of explained variability in compositional outcomes.
format Preprint
id arxiv_https___arxiv_org_abs_2508_07708
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Soil Texture Prediction with Bayesian Generalized Additive Models for Spatial Compositional Data
Martínez-Minaya, Joaquín
Zumeta-Olaskoaga, Lore
Lee, Dae-Jin
Methodology
Applications
Compositional data (CoDa) plays an important role in many fields such as ecology, geology, or biology. The most widely used modeling approaches are based on the Dirichlet and the logistic-normal formulation under Aitchison geometry. Recent developments in the mathematical field on the simplex geometry allow to express the regression model in terms of coordinates and estimate its coefficients. Once the model is projected in the real space, we can employ a multivariate Gaussian regression to deal with it. However, most existing methods focus on linear models, and there is a lack of flexible alternatives such as additive or spatial models, especially within a Bayesian framework and with practical implementation details. In this work, we present a geoadditive regression model for CoDa from a Bayesian perspective using the brms package in R. The model applies the isometric log-ratio (ilr) transformation and penalized splines to incorporate nonlinear effects. We also propose two new Bayesian goodness-of-fit measures for CoDa regression: BR-CoDa-$R^2$ and BM-CoDa-$R^2$, extending the Bayesian $R^2$ to the compositional setting. These measures, alongside WAIC, support model selection and evaluation. The methodology is validated through simulation studies and applied to predict soil texture composition in the Basque Country. Results demonstrate good performance, interpretable spatial patterns, and reliable quantification of explained variability in compositional outcomes.
title Soil Texture Prediction with Bayesian Generalized Additive Models for Spatial Compositional Data
topic Methodology
Applications
url https://arxiv.org/abs/2508.07708