Saved in:
Bibliographic Details
Main Authors: Castro-Silva, Juan A., Garcia, Maria N. Moreno, Peluffo-Ordoñez, Diego H.
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2603.00545
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908858493960192
author Castro-Silva, Juan A.
Garcia, Maria N. Moreno
Peluffo-Ordoñez, Diego H.
author_facet Castro-Silva, Juan A.
Garcia, Maria N. Moreno
Peluffo-Ordoñez, Diego H.
contents The current methods for diagnosing Alzheimer Disease using Magnetic Resonance Imaging (MRI) have significant limitations. Many previous studies used 2D Transformers to analyze individual brain slices independently, potentially losing critical 3D contextual information. Region of interest-based models often focus on only a few brain regions despite Alzheimer's affecting multiple areas. Additionally, most classification models rely on a single test, whereas diagnosing Alzheimer's requires a multifaceted approach integrating diverse data sources for a more accurate assessment. This study introduces a novel methodology called the Multiple Inputs and Mixed Data 3D Vision Transformer (MIMD-3DVT). This method processes consecutive slices together to capture the feature dimensions and spatial information, fuses multiple 3D ROI imaging data inputs, and integrates mixed data from demographic factors, cognitive assessments, and brain imaging. The proposed methodology was experimentally evaluated using a combined dataset that included the Alzheimer's Disease Neuroimaging Initiative (ADNI), the Australian Imaging, Biomarker, and Lifestyle Flagship Study of Ageing (AIBL), and the Open Access Series of Imaging Studies (OASIS). Our MIMD-3DVT, utilizing single or multiple ROIs, achieved an accuracy of 97.14%, outperforming the state-of-the-art methods in distinguishing between Normal Cognition and Alzheimer's Disease.
format Preprint
id arxiv_https___arxiv_org_abs_2603_00545
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Multiple Inputs and Mixwd data for Alzheimer's Disease Classification Based on 3D Vision Transformer
Castro-Silva, Juan A.
Garcia, Maria N. Moreno
Peluffo-Ordoñez, Diego H.
Computer Vision and Pattern Recognition
The current methods for diagnosing Alzheimer Disease using Magnetic Resonance Imaging (MRI) have significant limitations. Many previous studies used 2D Transformers to analyze individual brain slices independently, potentially losing critical 3D contextual information. Region of interest-based models often focus on only a few brain regions despite Alzheimer's affecting multiple areas. Additionally, most classification models rely on a single test, whereas diagnosing Alzheimer's requires a multifaceted approach integrating diverse data sources for a more accurate assessment. This study introduces a novel methodology called the Multiple Inputs and Mixed Data 3D Vision Transformer (MIMD-3DVT). This method processes consecutive slices together to capture the feature dimensions and spatial information, fuses multiple 3D ROI imaging data inputs, and integrates mixed data from demographic factors, cognitive assessments, and brain imaging. The proposed methodology was experimentally evaluated using a combined dataset that included the Alzheimer's Disease Neuroimaging Initiative (ADNI), the Australian Imaging, Biomarker, and Lifestyle Flagship Study of Ageing (AIBL), and the Open Access Series of Imaging Studies (OASIS). Our MIMD-3DVT, utilizing single or multiple ROIs, achieved an accuracy of 97.14%, outperforming the state-of-the-art methods in distinguishing between Normal Cognition and Alzheimer's Disease.
title Multiple Inputs and Mixwd data for Alzheimer's Disease Classification Based on 3D Vision Transformer
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2603.00545