:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Xueyang, Wang, Zongren, Zhang, Yuliang, Pan, Zixuan, Chen, Yu-Jen, Sapkota, Nishchal, Xu, Gelei, Chen, Danny Z., Shi, Yiyu
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2511.13869
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SwIPE: Efficient and Robust Medical Image Segmentation with Implicit Patch Embeddings
by: Zhang, Yejia, et al.
Published: (2023)

Path-GPTOmic: A Balanced Multi-modal Learning Framework for Survival Outcome Prediction
by: Wang, Hongxiao, et al.
Published: (2024)

AT-CXR: Uncertainty-Aware Agentic Triage for Chest X-rays
by: Li, Xueyang, et al.
Published: (2025)

RepViT: Revisiting Mobile CNN From ViT Perspective
by: Wang, Ao, et al.
Published: (2023)

CNN-ViT Fusion with Adaptive Attention Gate for Brain Tumor MRI Classification: A Hybrid Deep Learning Model
by: Hasnain, Syed Ibad, et al.
Published: (2026)

Attention-enabled Explainable AI for Bladder Cancer Recurrence Prediction
by: Abbas, Saram, et al.
Published: (2025)

ViT Registers and Fractal ViT
by: Chou, Jason Chuan-Chih, et al.
Published: (2026)

MIPHEI-ViT: Multiplex Immunofluorescence Prediction from H&E Images using ViT Foundation Models
by: Balezo, Guillaume, et al.
Published: (2025)

MedSAM-CA: A CNN-Augmented ViT with Attention-Enhanced Multi-Scale Fusion for Medical Image Segmentation
by: Tian, Peiting, et al.
Published: (2025)

Unsupervised Out-of-Distribution Detection in Medical Imaging Using Multi-Exit Class Activation Maps and Feature Masking
by: Chen, Yu-Jen, et al.
Published: (2025)

ODE-ViT: Plug & Play Attention Layer from the Generalization of the ViT as an Ordinary Differential Equation
by: Riera, Carlos Boned, et al.
Published: (2025)

On the Role of ViT and CNN in Semantic Communications: Analysis and Prototype Validation
by: Yoo, Hanju, et al.
Published: (2023)

Boosting Medical Image Classification with Segmentation Foundation Model
by: Gu, Pengfei, et al.
Published: (2024)

When Swin Transformer Meets KANs: An Improved Transformer Architecture for Medical Image Segmentation
by: Sapkota, Nishchal, et al.
Published: (2025)

Hybrid CNN-ViT Framework for Motion-Blurred Scene Text Restoration
by: Rashid, Umar, et al.
Published: (2025)

Exploiting Lightweight Hierarchical ViT and Dynamic Framework for Efficient Visual Tracking
by: Kang, Ben, et al.
Published: (2025)

A Hybrid Framework Bridging CNN and ViT based on Theory of Evidence for Diabetic Retinopathy Grading
by: Qiu, Junlai, et al.
Published: (2025)

I&S-ViT: An Inclusive & Stable Method for Pushing the Limit of Post-Training ViTs Quantization
by: Zhong, Yunshan, et al.
Published: (2023)

Masked Diffusion as Self-supervised Representation Learner
by: Pan, Zixuan, et al.
Published: (2023)

Combined CNN and ViT features off-the-shelf: Another astounding baseline for recognition
by: Alonso-Fernandez, Fernando, et al.
Published: (2024)

CNN and ViT Efficiency Study on Tiny ImageNet and DermaMNIST Datasets
by: Amangeldi, Aidar, et al.
Published: (2025)

ViT$^3$: Unlocking Test-Time Training in Vision
by: Han, Dongchen, et al.
Published: (2025)

FCL-ViT: Task-Aware Attention Tuning for Continual Learning
by: Kaimakamidis, Anestis, et al.
Published: (2024)

ViT-AdaLA: Adapting Vision Transformers with Linear Attention
by: Li, Yifan, et al.
Published: (2026)

AE-ViT: Token Enhancement for Vision Transformers via CNN-Based Autoencoder Ensembles
by: AIRCC
Published: (2025)

Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection
by: Gao, Xiangyu, et al.
Published: (2025)

CNN-ViT Hybrid for Pneumonia Detection: Theory and Empiric on Limited Data without Pretraining
by: Basnet, Prashant Singh, et al.
Published: (2025)

Surface Defect Identification of Strip Steel Using ViT‐RepVGG
by: Zhihuan Wang, et al.
Published: (2024)

SHMC-Net: A Mask-guided Feature Fusion Network for Sperm Head Morphology Classification
by: Sapkota, Nishchal, et al.
Published: (2024)

LAMM-ViT: AI Face Detection via Layer-Aware Modulation of Region-Guided Attention
by: Zhang, Jiangling, et al.
Published: (2025)

An Empirical Study of Agent Skills for Healthcare: Practice, Gaps, and Governance
by: Xu, Gelei, et al.
Published: (2026)

Incorporating Rather Than Eliminating: Achieving Fairness for Skin Disease Diagnosis Through Group-Specific Expert
by: Xu, Gelei, et al.
Published: (2025)

CNN+ViT+SigLiP: A Real-Time Framework for Smart Urban Mobility
by: Chowdhury, Koushik
Published: (2025)

Learning CNN on ViT: A Hybrid Model to Explicitly Class-specific Boundaries for Domain Adaptation
by: Ngo, Ba Hung, et al.
Published: (2024)

VTCNet: A Feature Fusion DL Model Based on CNN and ViT for the Classification of Cervical Cells
by: Mingzhe Li, et al.
Published: (2024)

Alias-Free ViT: Fractional Shift Invariance via Linear Attention
by: Michaeli, Hagay, et al.
Published: (2025)

Deeper Inside Deep ViT
by: Hong, Sungrae
Published: (2025)

Exploring Pembrolizumab‐Induced IM3OS in a Patient With Bladder Cancer
by: Ching‐Cheng Chan, et al.
Published: (2026)

CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs
by: Ramachandran, Akshat, et al.
Published: (2024)

STRAP-ViT: Segregated Tokens with Randomized -- Transformations for Defense against Adversarial Patches in ViTs
by: Chattopadhyay, Nandish, et al.
Published: (2026)