Saved in:
| Main Author: | Sabrin, Md. Sanaullah Chowdhury Lameya |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.12652 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
S2M-Net: Spectral-Spatial Mixing for Medical Image Segmentation with Morphology-Aware Adaptive Loss
by: Sabrin, Md. Sanaullah Chowdhury Lameya
Published: (2026)
by: Sabrin, Md. Sanaullah Chowdhury Lameya
Published: (2026)
Med-2D SegNet: A Light Weight Deep Neural Network for Medical 2D Image Segmentation
by: Sabrin, Lameya, et al.
Published: (2025)
by: Sabrin, Lameya, et al.
Published: (2025)
Zero-Shot Fine-Grained Image Classification Using Large Vision-Language Models
by: Atabuzzaman, Md., et al.
Published: (2025)
by: Atabuzzaman, Md., et al.
Published: (2025)
MSRANetV2: An Explainable Deep Learning Architecture for Multi-class Classification of Colorectal Histopathological Images
by: Sarkar, Ovi, et al.
Published: (2025)
by: Sarkar, Ovi, et al.
Published: (2025)
High Resolution Multi-Scale RAFT (Robust Vision Challenge 2022)
by: Jahedi, Azin, et al.
Published: (2022)
by: Jahedi, Azin, et al.
Published: (2022)
HeBA: Heterogeneous Bottleneck Adapters for Robust Vision-Language Models
by: Islam, Md Jahidul
Published: (2026)
by: Islam, Md Jahidul
Published: (2026)
Vision Transformers for End-to-End Quark-Gluon Jet Classification from Calorimeter Images
by: Jahin, Md Abrar, et al.
Published: (2025)
by: Jahin, Md Abrar, et al.
Published: (2025)
In-Depth Analysis of Automated Acne Disease Recognition and Classification
by: Jeny, Afsana Ahsan, et al.
Published: (2025)
by: Jeny, Afsana Ahsan, et al.
Published: (2025)
Deep Neural Network-Based Sign Language Recognition: A Comprehensive Approach Using Transfer Learning with Explainability
by: Ridwan, A. E. M, et al.
Published: (2024)
by: Ridwan, A. E. M, et al.
Published: (2024)
Brain Tumor Classification in MRI Images: A Computationally Efficient Convolutional Neural Network
by: Chowdhury, Md Fahimul Kabir, et al.
Published: (2026)
by: Chowdhury, Md Fahimul Kabir, et al.
Published: (2026)
Towards SAR Automatic Target Recognition MultiCategory SAR Image Classification Based on Light Weight Vision Transformer
by: Zhao, Guibin, et al.
Published: (2024)
by: Zhao, Guibin, et al.
Published: (2024)
VOSR: A Vision-Only Generative Model for Image Super-Resolution
by: Wu, Rongyuan, et al.
Published: (2026)
by: Wu, Rongyuan, et al.
Published: (2026)
A Robust Deep Learning Framework for Bangla License Plate Recognition Using YOLO and Vision-Language OCR
by: Hasin, Nayeb, et al.
Published: (2026)
by: Hasin, Nayeb, et al.
Published: (2026)
Feature Coding for Scalable Machine Vision
by: Eimon, Md Eimran Hossain, et al.
Published: (2025)
by: Eimon, Md Eimran Hossain, et al.
Published: (2025)
CLFace: A Scalable and Resource-Efficient Continual Learning Framework for Lifelong Face Recognition
by: Hasan, Md Mahedi, et al.
Published: (2024)
by: Hasan, Md Mahedi, et al.
Published: (2024)
MangoLeafViT: Leveraging Lightweight Vision Transformer with Runtime Augmentation for Efficient Mango Leaf Disease Classification
by: Chowdhury, Rafi Hassan, et al.
Published: (2025)
by: Chowdhury, Rafi Hassan, et al.
Published: (2025)
DSVTLA: Deep Swin Vision Transformer-Based Transfer Learning Architecture for Multi-Type Cancer Histopathological Cancer Image Classification
by: Khan, Muazzem Hussain, et al.
Published: (2026)
by: Khan, Muazzem Hussain, et al.
Published: (2026)
InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis
by: Han, Tao, et al.
Published: (2025)
by: Han, Tao, et al.
Published: (2025)
Real-Time Multi-Modal Embedded Vision Framework for Object Detection Facial Emotion Recognition and Biometric Identification on Low-Power Edge Platforms
by: Zahid, S. M. Khalid Bin, et al.
Published: (2026)
by: Zahid, S. M. Khalid Bin, et al.
Published: (2026)
Maximizing Generalization: The Effect of Different Augmentation Techniques on Lightweight Vision Transformer for Bengali Character Classification
by: Chowdhury, Rafi Hassan, et al.
Published: (2026)
by: Chowdhury, Rafi Hassan, et al.
Published: (2026)
In-Model Merging for Enhancing the Robustness of Medical Imaging Classification Models
by: Wang, Hu, et al.
Published: (2025)
by: Wang, Hu, et al.
Published: (2025)
Robustness of Vision Language Models Against Split-Image Harmful Input Attacks
by: Rashid, Md Rafi Ur, et al.
Published: (2026)
by: Rashid, Md Rafi Ur, et al.
Published: (2026)
Neural Network-based Study for Rice Leaf Disease Recognition and Classification: A Comparative Analysis Between Feature-based Model and Direct Imaging Model
by: Prity, Farida Siddiqi, et al.
Published: (2025)
by: Prity, Farida Siddiqi, et al.
Published: (2025)
Large Language Models Facilitate Vision Reflection in Image Classification
by: An, Guoyuan, et al.
Published: (2025)
by: An, Guoyuan, et al.
Published: (2025)
Beyond Image Super-Resolution for Image Recognition with Task-Driven Perceptual Loss
by: Kim, Jaeha, et al.
Published: (2024)
by: Kim, Jaeha, et al.
Published: (2024)
Image Recognition with Vision and Language Embeddings of VLMs
by: Volkov, Illia, et al.
Published: (2025)
by: Volkov, Illia, et al.
Published: (2025)
MedSR-Vision: Deep Learning Framework for Multi-Domain Medical Image Super-Resolution
by: Gurappa, Subhash, et al.
Published: (2026)
by: Gurappa, Subhash, et al.
Published: (2026)
Efficient Domain-Adaptive Multi-Task Dense Prediction with Vision Foundation Models
by: Kang, Beomseok, et al.
Published: (2025)
by: Kang, Beomseok, et al.
Published: (2025)
CAST: Channel-Aware Spatial Transfer Learning with Pseudo-Image Radar for Sign Language Recognition
by: Shujon, Md. Shakhoyat Rahman, et al.
Published: (2026)
by: Shujon, Md. Shakhoyat Rahman, et al.
Published: (2026)
LOOPE: Learnable Optimal Patch Order in Positional Embeddings for Vision Transformers
by: Chowdhury, Md Abtahi Majeed, et al.
Published: (2025)
by: Chowdhury, Md Abtahi Majeed, et al.
Published: (2025)
An IoT-Enabled Smart Aquarium System for Real-Time Water Quality Monitoring and Automated Feeding
by: Ayon, MD Fatin Ishraque, et al.
Published: (2026)
by: Ayon, MD Fatin Ishraque, et al.
Published: (2026)
DCT-Shield: A Robust Frequency Domain Defense against Malicious Image Editing
by: Bala, Aniruddha, et al.
Published: (2025)
by: Bala, Aniruddha, et al.
Published: (2025)
Understanding Robustness of Visual State Space Models for Image Classification
by: Du, Chengbin, et al.
Published: (2024)
by: Du, Chengbin, et al.
Published: (2024)
Image Recognition with Online Lightweight Vision Transformer: A Survey
by: Zhang, Zherui, et al.
Published: (2025)
by: Zhang, Zherui, et al.
Published: (2025)
Revisiting 2D Foundation Models for Scalable 3D Medical Image Classification
by: Liu, Han, et al.
Published: (2025)
by: Liu, Han, et al.
Published: (2025)
PEFT A2Z: Parameter-Efficient Fine-Tuning Survey for Large Language and Vision Models
by: Prottasha, Nusrat Jahan, et al.
Published: (2025)
by: Prottasha, Nusrat Jahan, et al.
Published: (2025)
GastroViT: A Vision Transformer Based Ensemble Learning Approach for Gastrointestinal Disease Classification with Grad CAM & SHAP Visualization
by: Tabassum, Sumaiya, et al.
Published: (2025)
by: Tabassum, Sumaiya, et al.
Published: (2025)
JaiLIP: Jailbreaking Vision-Language Models via Loss Guided Image Perturbation
by: Mia, Md Jueal, et al.
Published: (2025)
by: Mia, Md Jueal, et al.
Published: (2025)
Read or Ignore? A Unified Benchmark for Typographic-Attack Robustness and Text Recognition in Vision-Language Models
by: Waseda, Futa, et al.
Published: (2025)
by: Waseda, Futa, et al.
Published: (2025)
Accelerating Image Super-Resolution Networks with Pixel-Level Classification
by: Jeong, Jinho, et al.
Published: (2024)
by: Jeong, Jinho, et al.
Published: (2024)
Similar Items
-
S2M-Net: Spectral-Spatial Mixing for Medical Image Segmentation with Morphology-Aware Adaptive Loss
by: Sabrin, Md. Sanaullah Chowdhury Lameya
Published: (2026) -
Med-2D SegNet: A Light Weight Deep Neural Network for Medical 2D Image Segmentation
by: Sabrin, Lameya, et al.
Published: (2025) -
Zero-Shot Fine-Grained Image Classification Using Large Vision-Language Models
by: Atabuzzaman, Md., et al.
Published: (2025) -
MSRANetV2: An Explainable Deep Learning Architecture for Multi-class Classification of Colorectal Histopathological Images
by: Sarkar, Ovi, et al.
Published: (2025) -
High Resolution Multi-Scale RAFT (Robust Vision Challenge 2022)
by: Jahedi, Azin, et al.
Published: (2022)