Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Author:	Sabrin, Md. Sanaullah Chowdhury Lameya
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2504.12652
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866908429162905600
author	Sabrin, Md. Sanaullah Chowdhury Lameya
author_facet	Sabrin, Md. Sanaullah Chowdhury Lameya
contents	This paper introduces AdaptoVision, a novel convolutional neural network (CNN) architecture designed to efficiently balance computational complexity and classification accuracy. By leveraging enhanced residual units, depth-wise separable convolutions, and hierarchical skip connections, AdaptoVision significantly reduces parameter count and computational requirements while preserving competitive performance across various benchmark and medical image datasets. Extensive experimentation demonstrates that AdaptoVision achieves state-of-the-art on BreakHis dataset and comparable accuracy levels, notably 95.3\% on CIFAR-10 and 85.77\% on CIFAR-100, without relying on any pretrained weights. The model's streamlined architecture and strategic simplifications promote effective feature extraction and robust generalization, making it particularly suitable for deployment in real-time and resource-constrained environments.
format	Preprint
id	arxiv_https___arxiv_org_abs_2504_12652
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	AdaptoVision: A Multi-Resolution Image Recognition Model for Robust and Scalable Classification Sabrin, Md. Sanaullah Chowdhury Lameya Computer Vision and Pattern Recognition This paper introduces AdaptoVision, a novel convolutional neural network (CNN) architecture designed to efficiently balance computational complexity and classification accuracy. By leveraging enhanced residual units, depth-wise separable convolutions, and hierarchical skip connections, AdaptoVision significantly reduces parameter count and computational requirements while preserving competitive performance across various benchmark and medical image datasets. Extensive experimentation demonstrates that AdaptoVision achieves state-of-the-art on BreakHis dataset and comparable accuracy levels, notably 95.3\% on CIFAR-10 and 85.77\% on CIFAR-100, without relying on any pretrained weights. The model's streamlined architecture and strategic simplifications promote effective feature extraction and robust generalization, making it particularly suitable for deployment in real-time and resource-constrained environments.
title	AdaptoVision: A Multi-Resolution Image Recognition Model for Robust and Scalable Classification
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2504.12652

Similar Items