Saved in:
Bibliographic Details
Main Authors: Braga, Douglas Costa, Dantas, Daniel Oliveira
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.01026
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917182736171008
author Braga, Douglas Costa
Dantas, Daniel Oliveira
author_facet Braga, Douglas Costa
Dantas, Daniel Oliveira
contents We present a reproducible deep learning pipeline for leukemic cell classification, focusing on system architecture, experimental robustness, and software design choices for medical image analysis. Acute lymphoblastic leukemia (ALL) is the most common childhood cancer, requiring expert microscopic diagnosis that suffers from inter-observer variability and time constraints. The proposed system integrates an attention-based convolutional neural network combining EfficientNetV2-B3 with Squeeze-and-Excitation mechanisms for automated ALL cell classification. Our approach employs comprehensive data augmentation, focal loss for class imbalance, and patient-wise data splitting to ensure robust and reproducible evaluation. On the C-NMC 2019 dataset (12,528 original images from 62 patients), the system achieves a 97.89% F1-score and 97.89% accuracy on the test set, with statistical validation through 100-iteration Monte Carlo experiments confirming significant improvements (p < 0.001) over baseline methods. The proposed pipeline outperforms existing approaches by up to 4.67% while using 89% fewer parameters than VGG16 (15.2M vs. 138M). The attention mechanism provides interpretable visualizations of diagnostically relevant cellular features, demonstrating that modern attention-based architectures can improve leukemic cell classification while maintaining computational efficiency suitable for clinical deployment.
format Preprint
id arxiv_https___arxiv_org_abs_2601_01026
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Enhanced Leukemic Cell Classification Using Attention-Based CNN and Data Augmentation
Braga, Douglas Costa
Dantas, Daniel Oliveira
Computer Vision and Pattern Recognition
Artificial Intelligence
Machine Learning
Software Engineering
68T07, 92C55
I.4.9; I.5.4; J.3
We present a reproducible deep learning pipeline for leukemic cell classification, focusing on system architecture, experimental robustness, and software design choices for medical image analysis. Acute lymphoblastic leukemia (ALL) is the most common childhood cancer, requiring expert microscopic diagnosis that suffers from inter-observer variability and time constraints. The proposed system integrates an attention-based convolutional neural network combining EfficientNetV2-B3 with Squeeze-and-Excitation mechanisms for automated ALL cell classification. Our approach employs comprehensive data augmentation, focal loss for class imbalance, and patient-wise data splitting to ensure robust and reproducible evaluation. On the C-NMC 2019 dataset (12,528 original images from 62 patients), the system achieves a 97.89% F1-score and 97.89% accuracy on the test set, with statistical validation through 100-iteration Monte Carlo experiments confirming significant improvements (p < 0.001) over baseline methods. The proposed pipeline outperforms existing approaches by up to 4.67% while using 89% fewer parameters than VGG16 (15.2M vs. 138M). The attention mechanism provides interpretable visualizations of diagnostically relevant cellular features, demonstrating that modern attention-based architectures can improve leukemic cell classification while maintaining computational efficiency suitable for clinical deployment.
title Enhanced Leukemic Cell Classification Using Attention-Based CNN and Data Augmentation
topic Computer Vision and Pattern Recognition
Artificial Intelligence
Machine Learning
Software Engineering
68T07, 92C55
I.4.9; I.5.4; J.3
url https://arxiv.org/abs/2601.01026