Salvato in:
Dettagli Bibliografici
Autori principali: Guan, Zihan, Zhao, Zhiyuan, Tian, Fengwei, Nguyen, Dung, Bhattacharjee, Payel, Tandon, Ravi, Prakash, B. Aditya, Vullikanti, Anil
Natura: Preprint
Pubblicazione: 2025
Soggetti:
Accesso online:https://arxiv.org/abs/2506.22342
Tags: Aggiungi Tag
Nessun Tag, puoi essere il primo ad aggiungerne!!
_version_ 1866912971595186176
author Guan, Zihan
Zhao, Zhiyuan
Tian, Fengwei
Nguyen, Dung
Bhattacharjee, Payel
Tandon, Ravi
Prakash, B. Aditya
Vullikanti, Anil
author_facet Guan, Zihan
Zhao, Zhiyuan
Tian, Fengwei
Nguyen, Dung
Bhattacharjee, Payel
Tandon, Ravi
Prakash, B. Aditya
Vullikanti, Anil
contents Epidemic analyses increasingly rely on heterogeneous datasets, many of which are sensitive and require strong privacy protection. Although differential privacy (DP) has become a standard in machine learning and data sharing, its adoption in epidemiological modeling remains limited. In this work, we introduce DPEpiNN, a unified framework that integrates deep neural networks with a mechanistic SEIRM-based metapopulation model under formal DP guarantees. DPEpiNN supports multiple epidemic tasks (including multi-step forecasting, nowcasting, effective reproduction number $(R_t)$ estimation, and intervention analysis) within a single differentiable pipeline. The framework jointly learns epidemic parameters from heterogeneous public and sensitive datasets, while ensuring privacy via input perturbation mechanisms. We evaluate DPEpiNN using COVID-19 data from three regions. Results show that incorporating sensitive datasets substantially improves predictive performance even under strong privacy constraints. Compared with a deep learning baseline, DPEpiNN achieves higher accuracy in forecasting and nowcasting while producing reliable estimates of $R_t$. Furthermore, the learned epidemic transmission models remain inherently private due to the post-processing property of differential privacy, enabling downstream policy analyses such as simulation of social distancing interventions. Our work demonstrates that interpretability (through mechanistic modeling), predictive accuracy (through neural integration), and rigorous privacy guarantees can be jointly achieved in modern epidemic modeling.
format Preprint
id arxiv_https___arxiv_org_abs_2506_22342
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Improving Epidemic Analyses with Privacy-Preserving Integration of Sensitive Data
Guan, Zihan
Zhao, Zhiyuan
Tian, Fengwei
Nguyen, Dung
Bhattacharjee, Payel
Tandon, Ravi
Prakash, B. Aditya
Vullikanti, Anil
Machine Learning
Artificial Intelligence
Epidemic analyses increasingly rely on heterogeneous datasets, many of which are sensitive and require strong privacy protection. Although differential privacy (DP) has become a standard in machine learning and data sharing, its adoption in epidemiological modeling remains limited. In this work, we introduce DPEpiNN, a unified framework that integrates deep neural networks with a mechanistic SEIRM-based metapopulation model under formal DP guarantees. DPEpiNN supports multiple epidemic tasks (including multi-step forecasting, nowcasting, effective reproduction number $(R_t)$ estimation, and intervention analysis) within a single differentiable pipeline. The framework jointly learns epidemic parameters from heterogeneous public and sensitive datasets, while ensuring privacy via input perturbation mechanisms. We evaluate DPEpiNN using COVID-19 data from three regions. Results show that incorporating sensitive datasets substantially improves predictive performance even under strong privacy constraints. Compared with a deep learning baseline, DPEpiNN achieves higher accuracy in forecasting and nowcasting while producing reliable estimates of $R_t$. Furthermore, the learned epidemic transmission models remain inherently private due to the post-processing property of differential privacy, enabling downstream policy analyses such as simulation of social distancing interventions. Our work demonstrates that interpretability (through mechanistic modeling), predictive accuracy (through neural integration), and rigorous privacy guarantees can be jointly achieved in modern epidemic modeling.
title Improving Epidemic Analyses with Privacy-Preserving Integration of Sensitive Data
topic Machine Learning
Artificial Intelligence
url https://arxiv.org/abs/2506.22342