Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Mazumder, Anirudh, Liu, Jianguo
Format: Preprint
Veröffentlicht: 2024
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2409.18257
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866909562591772672
author Mazumder, Anirudh
Liu, Jianguo
author_facet Mazumder, Anirudh
Liu, Jianguo
contents Lung diseases have become a prevalent problem throughout the United States, affecting over 34 million people. Accurate and timely diagnosis of the different types of lung diseases is critical, and Artificial Intelligence (AI) methods could speed up these processes. A dual-stage vision transformer is built throughout this research by integrating a Vision Transformer (ViT) and a Swin Transformer to classify 14 different lung diseases from X-ray scans of patients with these diseases. The proposed model achieved an accuracy of 92.06% on a label-level when making predictions on an unseen testing subset of the dataset after data preprocessing and training the neural network. The model showed promise for accurately classifying lung diseases and diagnosing patients who suffer from these harmful diseases.
format Preprint
id arxiv_https___arxiv_org_abs_2409_18257
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Developing a Dual-Stage Vision Transformer Model for Lung Disease Classification
Mazumder, Anirudh
Liu, Jianguo
Image and Video Processing
Computer Vision and Pattern Recognition
Lung diseases have become a prevalent problem throughout the United States, affecting over 34 million people. Accurate and timely diagnosis of the different types of lung diseases is critical, and Artificial Intelligence (AI) methods could speed up these processes. A dual-stage vision transformer is built throughout this research by integrating a Vision Transformer (ViT) and a Swin Transformer to classify 14 different lung diseases from X-ray scans of patients with these diseases. The proposed model achieved an accuracy of 92.06% on a label-level when making predictions on an unseen testing subset of the dataset after data preprocessing and training the neural network. The model showed promise for accurately classifying lung diseases and diagnosing patients who suffer from these harmful diseases.
title Developing a Dual-Stage Vision Transformer Model for Lung Disease Classification
topic Image and Video Processing
Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2409.18257