Saved in:
Bibliographic Details
Main Authors: Jamshidi, Ainaz, Arif, Muhammad, Kalhoro, Sabir Ali, Gelbukh, Alexander
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2412.16207
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915074884501504
author Jamshidi, Ainaz
Arif, Muhammad
Kalhoro, Sabir Ali
Gelbukh, Alexander
author_facet Jamshidi, Ainaz
Arif, Muhammad
Kalhoro, Sabir Ali
Gelbukh, Alexander
contents The generation of high-quality medical time series data is essential for advancing healthcare diagnostics and safeguarding patient privacy. Specifically, synthesizing realistic phonocardiogram (PCG) signals offers significant potential as a cost-effective and efficient tool for cardiac disease pre-screening. Despite its potential, the synthesis of PCG signals for this specific application received limited attention in research. In this study, we employ and compare three state-of-the-art generative models from different categories - WaveNet, DoppelGANger, and DiffWave - to generate high-quality PCG data. We use data from the George B. Moody PhysioNet Challenge 2022. Our methods are evaluated using various metrics widely used in the previous literature in the domain of time series data generation, such as mean absolute error and maximum mean discrepancy. Our results demonstrate that the generated PCG data closely resembles the original datasets, indicating the effectiveness of our generative models in producing realistic synthetic PCG data. In our future work, we plan to incorporate this method into a data augmentation pipeline to synthesize abnormal PCG signals with heart murmurs, in order to address the current scarcity of abnormal data. We hope to improve the robustness and accuracy of diagnostic tools in cardiology, enhancing their effectiveness in detecting heart murmurs.
format Preprint
id arxiv_https___arxiv_org_abs_2412_16207
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Synthetic Time Series Data Generation for Healthcare Applications: A PCG Case Study
Jamshidi, Ainaz
Arif, Muhammad
Kalhoro, Sabir Ali
Gelbukh, Alexander
Machine Learning
Computational Engineering, Finance, and Science
Signal Processing
The generation of high-quality medical time series data is essential for advancing healthcare diagnostics and safeguarding patient privacy. Specifically, synthesizing realistic phonocardiogram (PCG) signals offers significant potential as a cost-effective and efficient tool for cardiac disease pre-screening. Despite its potential, the synthesis of PCG signals for this specific application received limited attention in research. In this study, we employ and compare three state-of-the-art generative models from different categories - WaveNet, DoppelGANger, and DiffWave - to generate high-quality PCG data. We use data from the George B. Moody PhysioNet Challenge 2022. Our methods are evaluated using various metrics widely used in the previous literature in the domain of time series data generation, such as mean absolute error and maximum mean discrepancy. Our results demonstrate that the generated PCG data closely resembles the original datasets, indicating the effectiveness of our generative models in producing realistic synthetic PCG data. In our future work, we plan to incorporate this method into a data augmentation pipeline to synthesize abnormal PCG signals with heart murmurs, in order to address the current scarcity of abnormal data. We hope to improve the robustness and accuracy of diagnostic tools in cardiology, enhancing their effectiveness in detecting heart murmurs.
title Synthetic Time Series Data Generation for Healthcare Applications: A PCG Case Study
topic Machine Learning
Computational Engineering, Finance, and Science
Signal Processing
url https://arxiv.org/abs/2412.16207