Saved in:
| Main Authors: | Ovi, Md Sultanul Islam, Hossain, Mainul, Biswas, Md Badsha |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.06331 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Performance Characterization of Distributed Deep Learning Strategies: A Quantitative Evaluation of DDP, FSDP, and Parameter Server Architectures on GPU Clusters
by: Ovi, Md Sultanul Islam
Published: (2025)
by: Ovi, Md Sultanul Islam
Published: (2025)
AttMetNet: Attention-Enhanced Deep Neural Network for Methane Plume Detection in Sentinel-2 Satellite Imagery
by: Ahsan, Rakib, et al.
Published: (2025)
by: Ahsan, Rakib, et al.
Published: (2025)
IKIWISI: An Interactive Visual Pattern Generator for Evaluating the Reliability of Vision-Language Models Without Ground Truth
by: Islam, Md Touhidul, et al.
Published: (2025)
by: Islam, Md Touhidul, et al.
Published: (2025)
An Embedded Real-time Object Alert System for Visually Impaired: A Monocular Depth Estimation based Approach through Computer Vision
by: Anjom, Jareen, et al.
Published: (2025)
by: Anjom, Jareen, et al.
Published: (2025)
Real-Time Currency Detection and Voice Feedback for Visually Impaired Individuals
by: Shreya, Saraf Anzum, et al.
Published: (2025)
by: Shreya, Saraf Anzum, et al.
Published: (2025)
A Two-Stage Multitask Vision-Language Framework for Explainable Crop Disease Visual Question Answering
by: Hossain, Md. Zahid, et al.
Published: (2026)
by: Hossain, Md. Zahid, et al.
Published: (2026)
Explainable AI-Driven Detection of Human Monkeypox Using Deep Learning and Vision Transformers: A Comprehensive Analysis
by: Hossain, Md. Zahid, et al.
Published: (2025)
by: Hossain, Md. Zahid, et al.
Published: (2025)
BD Currency Detection: A CNN Based Approach with Mobile App Integration
by: Jaman, Syed Jubayer, et al.
Published: (2025)
by: Jaman, Syed Jubayer, et al.
Published: (2025)
CountFormer: A Transformer Framework for Learning Visual Repetition and Structure in Class-Agnostic Object Counting
by: Hossain, Md Tanvir, et al.
Published: (2025)
by: Hossain, Md Tanvir, et al.
Published: (2025)
A Robust Deep Learning Framework for Bangla License Plate Recognition Using YOLO and Vision-Language OCR
by: Hasin, Nayeb, et al.
Published: (2026)
by: Hasin, Nayeb, et al.
Published: (2026)
Vision-Based Lane Following and Traffic Sign Recognition for Resource-Constrained Autonomous Vehicles
by: Islam, Md Tanjemul, et al.
Published: (2026)
by: Islam, Md Tanjemul, et al.
Published: (2026)
ChatGPT in Research and Education: Exploring Benefits and Threats
by: Miah, Abu Saleh Musa, et al.
Published: (2024)
by: Miah, Abu Saleh Musa, et al.
Published: (2024)
Visual Robustness Benchmark for Visual Question Answering (VQA)
by: Ishmam, Md Farhan, et al.
Published: (2024)
by: Ishmam, Md Farhan, et al.
Published: (2024)
Vision-Language Models for Automated Chest X-ray Interpretation: Leveraging ViT and GPT-2
by: Islam, Md. Rakibul, et al.
Published: (2025)
by: Islam, Md. Rakibul, et al.
Published: (2025)
Personalized Federated Segmentation with Shared Feature Aggregation and Boundary-Focused Calibration
by: Tashdeed, Ishmam, et al.
Published: (2025)
by: Tashdeed, Ishmam, et al.
Published: (2025)
VisText-Mosquito: A Unified Multimodal Dataset for Visual Detection, Segmentation, and Textual Explanation on Mosquito Breeding Sites
by: Islam, Md. Adnanul, et al.
Published: (2025)
by: Islam, Md. Adnanul, et al.
Published: (2025)
FUSE: Unifying Spectral and Semantic Cues for Robust AI-Generated Image Detection
by: Hossain, Md. Zahid, et al.
Published: (2025)
by: Hossain, Md. Zahid, et al.
Published: (2025)
From Image to Language: A Critical Analysis of Visual Question Answering (VQA) Approaches, Challenges, and Opportunities
by: Ishmam, Md Farhan, et al.
Published: (2023)
by: Ishmam, Md Farhan, et al.
Published: (2023)
Reliable Deep Learning for Small-Scale Classifications: Experiments on Real-World Image Datasets from Bangladesh
by: Suny, Alfe, et al.
Published: (2026)
by: Suny, Alfe, et al.
Published: (2026)
MSRANetV2: An Explainable Deep Learning Architecture for Multi-class Classification of Colorectal Histopathological Images
by: Sarkar, Ovi, et al.
Published: (2025)
by: Sarkar, Ovi, et al.
Published: (2025)
How Good LLMs Are at Answering Bangla Medical Visual Questions? Dataset and Benchmarking
by: Ahmed, Rafid, et al.
Published: (2026)
by: Ahmed, Rafid, et al.
Published: (2026)
A Domain-Adapted Lightweight Ensemble for Resource-Efficient Few-Shot Plant Disease Classification
by: Islam, Anika, et al.
Published: (2025)
by: Islam, Anika, et al.
Published: (2025)
Involution-Infused DenseNet with Two-Step Compression for Resource-Efficient Plant Disease Classification
by: Ahmed, T., et al.
Published: (2025)
by: Ahmed, T., et al.
Published: (2025)
HeBA: Heterogeneous Bottleneck Adapters for Robust Vision-Language Models
by: Islam, Md Jahidul
Published: (2026)
by: Islam, Md Jahidul
Published: (2026)
Protecting Student Mental Health with a Context-Aware Machine Learning Framework for Stress Monitoring
by: Ovi, Md Sultanul Islam, et al.
Published: (2025)
by: Ovi, Md Sultanul Islam, et al.
Published: (2025)
Transformation of Biological Networks into Images via Semantic Cartography for Visual Interpretation and Scalable Deep Analysis
by: Mostafa, Sakib, et al.
Published: (2025)
by: Mostafa, Sakib, et al.
Published: (2025)
Real-Time Detection and Analysis of Vehicles and Pedestrians using Deep Learning
by: Sadik, Md Nahid, et al.
Published: (2024)
by: Sadik, Md Nahid, et al.
Published: (2024)
A Lightweight and Explainable DenseNet-121 Framework for Grape Leaf Disease Classification
by: Haque, Md. Ehsanul, et al.
Published: (2026)
by: Haque, Md. Ehsanul, et al.
Published: (2026)
Identifying Crucial Objects in Blind and Low-Vision Individuals' Navigation
by: Islam, Md Touhidul, et al.
Published: (2024)
by: Islam, Md Touhidul, et al.
Published: (2024)
Co-AttenDWG: Co-Attentive Dimension-Wise Gating and Expert Fusion for Multi-Modal Offensive Content Detection
by: Hossain, Md. Mithun, et al.
Published: (2025)
by: Hossain, Md. Mithun, et al.
Published: (2025)
In-Depth Analysis of Automated Acne Disease Recognition and Classification
by: Jeny, Afsana Ahsan, et al.
Published: (2025)
by: Jeny, Afsana Ahsan, et al.
Published: (2025)
Adaptive Local Binary Pattern: A Novel Feature Descriptor for Enhanced Analysis of Kidney Abnormalities in CT Scan Images using ensemble based Machine Learning Approach
by: Hossain, Tahmim, et al.
Published: (2024)
by: Hossain, Tahmim, et al.
Published: (2024)
An Image Dataset of Common Skin Diseases of Bangladesh and Benchmarking Performance with Machine Learning Models
by: Hossain, Sazzad, et al.
Published: (2026)
by: Hossain, Sazzad, et al.
Published: (2026)
A Critical Analysis on Machine Learning Techniques for Video-based Human Activity Recognition of Surveillance Systems: A Review
by: Jahan, Shahriar, et al.
Published: (2024)
by: Jahan, Shahriar, et al.
Published: (2024)
CAST: Channel-Aware Spatial Transfer Learning with Pseudo-Image Radar for Sign Language Recognition
by: Shujon, Md. Shakhoyat Rahman, et al.
Published: (2026)
by: Shujon, Md. Shakhoyat Rahman, et al.
Published: (2026)
In-Pixel Foreground and Contrast Enhancement Circuits with Customizable Mapping
by: Udoy, Md Rahatul Islam, et al.
Published: (2024)
by: Udoy, Md Rahatul Islam, et al.
Published: (2024)
PhishGuard: A Multi-Layered Ensemble Model for Optimal Phishing Website Detection
by: Ovi, Md Sultanul Islam, et al.
Published: (2024)
by: Ovi, Md Sultanul Islam, et al.
Published: (2024)
An Advanced Deep Learning Based Three-Stream Hybrid Model for Dynamic Hand Gesture Recognition
by: Rahim, Md Abdur, et al.
Published: (2024)
by: Rahim, Md Abdur, et al.
Published: (2024)
Unsupervised Search for Ethnic Minorities' Medical Segmentation Training Set
by: Chen, Yixiao, et al.
Published: (2025)
by: Chen, Yixiao, et al.
Published: (2025)
BD Open LULC Map: High-resolution land use land cover mapping & benchmarking for urban development in Dhaka, Bangladesh
by: Hossain, Mir Sazzat, et al.
Published: (2025)
by: Hossain, Mir Sazzat, et al.
Published: (2025)
Similar Items
-
Performance Characterization of Distributed Deep Learning Strategies: A Quantitative Evaluation of DDP, FSDP, and Parameter Server Architectures on GPU Clusters
by: Ovi, Md Sultanul Islam
Published: (2025) -
AttMetNet: Attention-Enhanced Deep Neural Network for Methane Plume Detection in Sentinel-2 Satellite Imagery
by: Ahsan, Rakib, et al.
Published: (2025) -
IKIWISI: An Interactive Visual Pattern Generator for Evaluating the Reliability of Vision-Language Models Without Ground Truth
by: Islam, Md Touhidul, et al.
Published: (2025) -
An Embedded Real-time Object Alert System for Visually Impaired: A Monocular Depth Estimation based Approach through Computer Vision
by: Anjom, Jareen, et al.
Published: (2025) -
Real-Time Currency Detection and Voice Feedback for Visually Impaired Individuals
by: Shreya, Saraf Anzum, et al.
Published: (2025)