Saved in:
Bibliographic Details
Main Authors: Dame, Kenenisa Tadesse, Belloni, Pietro, Moretti, Ugo, Scapini, Fabio, Tuccori, Marco, Brazzale, Alessandra R.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2512.11452
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917141801861120
author Dame, Kenenisa Tadesse
Belloni, Pietro
Moretti, Ugo
Scapini, Fabio
Tuccori, Marco
Brazzale, Alessandra R.
author_facet Dame, Kenenisa Tadesse
Belloni, Pietro
Moretti, Ugo
Scapini, Fabio
Tuccori, Marco
Brazzale, Alessandra R.
contents Post--marketing pharmacovigilance is essential for identifying adverse drug reactions (ADRs) that elude detection during pre--marketing clinical trials. This study explores a novel approach that integrates an adverse event (AE) ontology into a zero--inflated negative binomial model to improve ADR detection. By accounting for the biological similarities among correlated AEs and addressing the excess of zero counts, this method more effectively disentangles AE associations. Statistical significance is evaluated using a permutation--based maximum statistic that preserves AE correlations within individual reports. Simulations and an application to real data from the Veneto drug safety database demonstrate that the ontology--based model consistently outperforms classical models such as the Gamma--Poisson Shrinker (GPS). For post--selection inference, we furthermore explore a data thinning technique for convolution--closed families, enabling the creation of independent training and validation datasets while retaining all drug--AE pairs. This approach is compared with conventional random train/test splitting, which may leave some drugs or AEs absent from one subset, and stratified splitting, which requires expanding aggregated counts into individual instances. The data--thinning technique and stratified splitting yield very similar results, with stratified splitting showing a slight benefit, and both clearly outperform random splitting in ensuring reliable and consistent model evaluation.
format Preprint
id arxiv_https___arxiv_org_abs_2512_11452
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Advances in Ontology--Based Mining of Adverse Drug Reactions
Dame, Kenenisa Tadesse
Belloni, Pietro
Moretti, Ugo
Scapini, Fabio
Tuccori, Marco
Brazzale, Alessandra R.
Methodology
Post--marketing pharmacovigilance is essential for identifying adverse drug reactions (ADRs) that elude detection during pre--marketing clinical trials. This study explores a novel approach that integrates an adverse event (AE) ontology into a zero--inflated negative binomial model to improve ADR detection. By accounting for the biological similarities among correlated AEs and addressing the excess of zero counts, this method more effectively disentangles AE associations. Statistical significance is evaluated using a permutation--based maximum statistic that preserves AE correlations within individual reports. Simulations and an application to real data from the Veneto drug safety database demonstrate that the ontology--based model consistently outperforms classical models such as the Gamma--Poisson Shrinker (GPS). For post--selection inference, we furthermore explore a data thinning technique for convolution--closed families, enabling the creation of independent training and validation datasets while retaining all drug--AE pairs. This approach is compared with conventional random train/test splitting, which may leave some drugs or AEs absent from one subset, and stratified splitting, which requires expanding aggregated counts into individual instances. The data--thinning technique and stratified splitting yield very similar results, with stratified splitting showing a slight benefit, and both clearly outperform random splitting in ensuring reliable and consistent model evaluation.
title Advances in Ontology--Based Mining of Adverse Drug Reactions
topic Methodology
url https://arxiv.org/abs/2512.11452