Saved in:
Bibliographic Details
Main Authors: de Medeiros, Rodrigo M. R., Queiroz, Francisco F.
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.08600
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912824729534464
author de Medeiros, Rodrigo M. R.
Queiroz, Francisco F.
author_facet de Medeiros, Rodrigo M. R.
Queiroz, Francisco F.
contents The Box-Cox symmetric distributions constitute a broad class of probability models for positive continuous data, offering flexibility in modeling skewness and tail behavior. Their parameterization allows a straightforward quantile-based interpretation, which is particularly useful in regression modeling. Despite their potential, only a few specific distributions within this class have been explored in regression contexts, and zero-adjusted extensions have not yet been formally addressed in the literature. This paper formalizes the class of Box-Cox symmetric regression models and introduces a new zero-adjusted extension suitable for modeling data with a non-negligible proportion of observations equal to zero. We discuss maximum likelihood estimation, assess finite-sample performance through simulations, and develop diagnostic tools including residual analysis, local influence measures, and goodness-of-fit statistics. An empirical application on basic education expenditure illustrates the models' ability to capture complex patterns in zero-inflated and highly skewed nonnegative data. To support practical use, we developed the new BCSreg R package, which implements all proposed methods.
format Preprint
id arxiv_https___arxiv_org_abs_2601_08600
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Flexible modeling of nonnegative continuous data: Box-Cox symmetric regression and its zero-adjusted extension
de Medeiros, Rodrigo M. R.
Queiroz, Francisco F.
Methodology
The Box-Cox symmetric distributions constitute a broad class of probability models for positive continuous data, offering flexibility in modeling skewness and tail behavior. Their parameterization allows a straightforward quantile-based interpretation, which is particularly useful in regression modeling. Despite their potential, only a few specific distributions within this class have been explored in regression contexts, and zero-adjusted extensions have not yet been formally addressed in the literature. This paper formalizes the class of Box-Cox symmetric regression models and introduces a new zero-adjusted extension suitable for modeling data with a non-negligible proportion of observations equal to zero. We discuss maximum likelihood estimation, assess finite-sample performance through simulations, and develop diagnostic tools including residual analysis, local influence measures, and goodness-of-fit statistics. An empirical application on basic education expenditure illustrates the models' ability to capture complex patterns in zero-inflated and highly skewed nonnegative data. To support practical use, we developed the new BCSreg R package, which implements all proposed methods.
title Flexible modeling of nonnegative continuous data: Box-Cox symmetric regression and its zero-adjusted extension
topic Methodology
url https://arxiv.org/abs/2601.08600