Saved in:
Bibliographic Details
Main Author: Sabrin, Md. Sanaullah Chowdhury Lameya
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2504.12652
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908429162905600
author Sabrin, Md. Sanaullah Chowdhury Lameya
author_facet Sabrin, Md. Sanaullah Chowdhury Lameya
contents This paper introduces AdaptoVision, a novel convolutional neural network (CNN) architecture designed to efficiently balance computational complexity and classification accuracy. By leveraging enhanced residual units, depth-wise separable convolutions, and hierarchical skip connections, AdaptoVision significantly reduces parameter count and computational requirements while preserving competitive performance across various benchmark and medical image datasets. Extensive experimentation demonstrates that AdaptoVision achieves state-of-the-art on BreakHis dataset and comparable accuracy levels, notably 95.3\% on CIFAR-10 and 85.77\% on CIFAR-100, without relying on any pretrained weights. The model's streamlined architecture and strategic simplifications promote effective feature extraction and robust generalization, making it particularly suitable for deployment in real-time and resource-constrained environments.
format Preprint
id arxiv_https___arxiv_org_abs_2504_12652
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle AdaptoVision: A Multi-Resolution Image Recognition Model for Robust and Scalable Classification
Sabrin, Md. Sanaullah Chowdhury Lameya
Computer Vision and Pattern Recognition
This paper introduces AdaptoVision, a novel convolutional neural network (CNN) architecture designed to efficiently balance computational complexity and classification accuracy. By leveraging enhanced residual units, depth-wise separable convolutions, and hierarchical skip connections, AdaptoVision significantly reduces parameter count and computational requirements while preserving competitive performance across various benchmark and medical image datasets. Extensive experimentation demonstrates that AdaptoVision achieves state-of-the-art on BreakHis dataset and comparable accuracy levels, notably 95.3\% on CIFAR-10 and 85.77\% on CIFAR-100, without relying on any pretrained weights. The model's streamlined architecture and strategic simplifications promote effective feature extraction and robust generalization, making it particularly suitable for deployment in real-time and resource-constrained environments.
title AdaptoVision: A Multi-Resolution Image Recognition Model for Robust and Scalable Classification
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2504.12652