Saved in:
| Main Authors: | Li, Xueyang, Wang, Zongren, Zhang, Yuliang, Pan, Zixuan, Chen, Yu-Jen, Sapkota, Nishchal, Xu, Gelei, Chen, Danny Z., Shi, Yiyu |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.13869 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SwIPE: Efficient and Robust Medical Image Segmentation with Implicit Patch Embeddings
by: Zhang, Yejia, et al.
Published: (2023)
by: Zhang, Yejia, et al.
Published: (2023)
Path-GPTOmic: A Balanced Multi-modal Learning Framework for Survival Outcome Prediction
by: Wang, Hongxiao, et al.
Published: (2024)
by: Wang, Hongxiao, et al.
Published: (2024)
AT-CXR: Uncertainty-Aware Agentic Triage for Chest X-rays
by: Li, Xueyang, et al.
Published: (2025)
by: Li, Xueyang, et al.
Published: (2025)
RepViT: Revisiting Mobile CNN From ViT Perspective
by: Wang, Ao, et al.
Published: (2023)
by: Wang, Ao, et al.
Published: (2023)
CNN-ViT Fusion with Adaptive Attention Gate for Brain Tumor MRI Classification: A Hybrid Deep Learning Model
by: Hasnain, Syed Ibad, et al.
Published: (2026)
by: Hasnain, Syed Ibad, et al.
Published: (2026)
Attention-enabled Explainable AI for Bladder Cancer Recurrence Prediction
by: Abbas, Saram, et al.
Published: (2025)
by: Abbas, Saram, et al.
Published: (2025)
ViT Registers and Fractal ViT
by: Chou, Jason Chuan-Chih, et al.
Published: (2026)
by: Chou, Jason Chuan-Chih, et al.
Published: (2026)
MIPHEI-ViT: Multiplex Immunofluorescence Prediction from H&E Images using ViT Foundation Models
by: Balezo, Guillaume, et al.
Published: (2025)
by: Balezo, Guillaume, et al.
Published: (2025)
MedSAM-CA: A CNN-Augmented ViT with Attention-Enhanced Multi-Scale Fusion for Medical Image Segmentation
by: Tian, Peiting, et al.
Published: (2025)
by: Tian, Peiting, et al.
Published: (2025)
Unsupervised Out-of-Distribution Detection in Medical Imaging Using Multi-Exit Class Activation Maps and Feature Masking
by: Chen, Yu-Jen, et al.
Published: (2025)
by: Chen, Yu-Jen, et al.
Published: (2025)
ODE-ViT: Plug & Play Attention Layer from the Generalization of the ViT as an Ordinary Differential Equation
by: Riera, Carlos Boned, et al.
Published: (2025)
by: Riera, Carlos Boned, et al.
Published: (2025)
On the Role of ViT and CNN in Semantic Communications: Analysis and Prototype Validation
by: Yoo, Hanju, et al.
Published: (2023)
by: Yoo, Hanju, et al.
Published: (2023)
Boosting Medical Image Classification with Segmentation Foundation Model
by: Gu, Pengfei, et al.
Published: (2024)
by: Gu, Pengfei, et al.
Published: (2024)
When Swin Transformer Meets KANs: An Improved Transformer Architecture for Medical Image Segmentation
by: Sapkota, Nishchal, et al.
Published: (2025)
by: Sapkota, Nishchal, et al.
Published: (2025)
Hybrid CNN-ViT Framework for Motion-Blurred Scene Text Restoration
by: Rashid, Umar, et al.
Published: (2025)
by: Rashid, Umar, et al.
Published: (2025)
Exploiting Lightweight Hierarchical ViT and Dynamic Framework for Efficient Visual Tracking
by: Kang, Ben, et al.
Published: (2025)
by: Kang, Ben, et al.
Published: (2025)
A Hybrid Framework Bridging CNN and ViT based on Theory of Evidence for Diabetic Retinopathy Grading
by: Qiu, Junlai, et al.
Published: (2025)
by: Qiu, Junlai, et al.
Published: (2025)
I&S-ViT: An Inclusive & Stable Method for Pushing the Limit of Post-Training ViTs Quantization
by: Zhong, Yunshan, et al.
Published: (2023)
by: Zhong, Yunshan, et al.
Published: (2023)
Masked Diffusion as Self-supervised Representation Learner
by: Pan, Zixuan, et al.
Published: (2023)
by: Pan, Zixuan, et al.
Published: (2023)
Combined CNN and ViT features off-the-shelf: Another astounding baseline for recognition
by: Alonso-Fernandez, Fernando, et al.
Published: (2024)
by: Alonso-Fernandez, Fernando, et al.
Published: (2024)
CNN and ViT Efficiency Study on Tiny ImageNet and DermaMNIST Datasets
by: Amangeldi, Aidar, et al.
Published: (2025)
by: Amangeldi, Aidar, et al.
Published: (2025)
ViT$^3$: Unlocking Test-Time Training in Vision
by: Han, Dongchen, et al.
Published: (2025)
by: Han, Dongchen, et al.
Published: (2025)
FCL-ViT: Task-Aware Attention Tuning for Continual Learning
by: Kaimakamidis, Anestis, et al.
Published: (2024)
by: Kaimakamidis, Anestis, et al.
Published: (2024)
ViT-AdaLA: Adapting Vision Transformers with Linear Attention
by: Li, Yifan, et al.
Published: (2026)
by: Li, Yifan, et al.
Published: (2026)
AE-ViT: Token Enhancement for Vision Transformers via CNN-Based Autoencoder Ensembles
by: AIRCC
Published: (2025)
by: AIRCC
Published: (2025)
Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection
by: Gao, Xiangyu, et al.
Published: (2025)
by: Gao, Xiangyu, et al.
Published: (2025)
CNN-ViT Hybrid for Pneumonia Detection: Theory and Empiric on Limited Data without Pretraining
by: Basnet, Prashant Singh, et al.
Published: (2025)
by: Basnet, Prashant Singh, et al.
Published: (2025)
Surface Defect Identification of Strip Steel Using ViT‐RepVGG
by: Zhihuan Wang, et al.
Published: (2024)
by: Zhihuan Wang, et al.
Published: (2024)
SHMC-Net: A Mask-guided Feature Fusion Network for Sperm Head Morphology Classification
by: Sapkota, Nishchal, et al.
Published: (2024)
by: Sapkota, Nishchal, et al.
Published: (2024)
LAMM-ViT: AI Face Detection via Layer-Aware Modulation of Region-Guided Attention
by: Zhang, Jiangling, et al.
Published: (2025)
by: Zhang, Jiangling, et al.
Published: (2025)
An Empirical Study of Agent Skills for Healthcare: Practice, Gaps, and Governance
by: Xu, Gelei, et al.
Published: (2026)
by: Xu, Gelei, et al.
Published: (2026)
Incorporating Rather Than Eliminating: Achieving Fairness for Skin Disease Diagnosis Through Group-Specific Expert
by: Xu, Gelei, et al.
Published: (2025)
by: Xu, Gelei, et al.
Published: (2025)
CNN+ViT+SigLiP: A Real-Time Framework for Smart Urban Mobility
by: Chowdhury, Koushik
Published: (2025)
by: Chowdhury, Koushik
Published: (2025)
Learning CNN on ViT: A Hybrid Model to Explicitly Class-specific Boundaries for Domain Adaptation
by: Ngo, Ba Hung, et al.
Published: (2024)
by: Ngo, Ba Hung, et al.
Published: (2024)
VTCNet: A Feature Fusion DL Model Based on CNN and ViT for the Classification of Cervical Cells
by: Mingzhe Li, et al.
Published: (2024)
by: Mingzhe Li, et al.
Published: (2024)
Alias-Free ViT: Fractional Shift Invariance via Linear Attention
by: Michaeli, Hagay, et al.
Published: (2025)
by: Michaeli, Hagay, et al.
Published: (2025)
Deeper Inside Deep ViT
by: Hong, Sungrae
Published: (2025)
by: Hong, Sungrae
Published: (2025)
Exploring Pembrolizumab‐Induced IM3OS in a Patient With Bladder Cancer
by: Ching‐Cheng Chan, et al.
Published: (2026)
by: Ching‐Cheng Chan, et al.
Published: (2026)
CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs
by: Ramachandran, Akshat, et al.
Published: (2024)
by: Ramachandran, Akshat, et al.
Published: (2024)
STRAP-ViT: Segregated Tokens with Randomized -- Transformations for Defense against Adversarial Patches in ViTs
by: Chattopadhyay, Nandish, et al.
Published: (2026)
by: Chattopadhyay, Nandish, et al.
Published: (2026)
Similar Items
-
SwIPE: Efficient and Robust Medical Image Segmentation with Implicit Patch Embeddings
by: Zhang, Yejia, et al.
Published: (2023) -
Path-GPTOmic: A Balanced Multi-modal Learning Framework for Survival Outcome Prediction
by: Wang, Hongxiao, et al.
Published: (2024) -
AT-CXR: Uncertainty-Aware Agentic Triage for Chest X-rays
by: Li, Xueyang, et al.
Published: (2025) -
RepViT: Revisiting Mobile CNN From ViT Perspective
by: Wang, Ao, et al.
Published: (2023) -
CNN-ViT Fusion with Adaptive Attention Gate for Brain Tumor MRI Classification: A Hybrid Deep Learning Model
by: Hasnain, Syed Ibad, et al.
Published: (2026)