Saved in:
Bibliographic Details
Main Authors: Ding, Jun-En, Hsu, Chien-Chin, Liu, Feng
Format: Preprint
Published: 2023
Subjects:
Online Access:https://arxiv.org/abs/2311.14902
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910575473197056
author Ding, Jun-En
Hsu, Chien-Chin
Liu, Feng
author_facet Ding, Jun-En
Hsu, Chien-Chin
Liu, Feng
contents Parkinson's Disease (PD) affects millions globally, impacting movement. Prior research utilized deep learning for PD prediction, primarily focusing on medical images, neglecting the data's underlying manifold structure. This work proposes a multimodal approach encompassing both image and non-image features, leveraging contrastive cross-view graph fusion for PD classification. We introduce a novel multimodal co-attention module, integrating embeddings from separate graph views derived from low-dimensional representations of images and clinical features. This enables more robust and structured feature extraction for improved multi-view data analysis. Additionally, a simplified contrastive loss-based fusion method is devised to enhance cross-view fusion learning. Our graph-view multimodal approach achieves an accuracy of 0.91 and an area under the receiver operating characteristic curve (AUC) of 0.93 in five-fold cross-validation. It also demonstrates superior predictive capabilities on non-image data compared to solely machine learning-based methods.
format Preprint
id arxiv_https___arxiv_org_abs_2311_14902
institution arXiv
publishDate 2023
record_format arxiv
spellingShingle Parkinson's Disease Classification Using Contrastive Graph Cross-View Learning with Multimodal Fusion of SPECT Images and Clinical Features
Ding, Jun-En
Hsu, Chien-Chin
Liu, Feng
Computer Vision and Pattern Recognition
Parkinson's Disease (PD) affects millions globally, impacting movement. Prior research utilized deep learning for PD prediction, primarily focusing on medical images, neglecting the data's underlying manifold structure. This work proposes a multimodal approach encompassing both image and non-image features, leveraging contrastive cross-view graph fusion for PD classification. We introduce a novel multimodal co-attention module, integrating embeddings from separate graph views derived from low-dimensional representations of images and clinical features. This enables more robust and structured feature extraction for improved multi-view data analysis. Additionally, a simplified contrastive loss-based fusion method is devised to enhance cross-view fusion learning. Our graph-view multimodal approach achieves an accuracy of 0.91 and an area under the receiver operating characteristic curve (AUC) of 0.93 in five-fold cross-validation. It also demonstrates superior predictive capabilities on non-image data compared to solely machine learning-based methods.
title Parkinson's Disease Classification Using Contrastive Graph Cross-View Learning with Multimodal Fusion of SPECT Images and Clinical Features
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2311.14902