Saved in:
| Main Authors: | Alomar, Khaled, Aysel, Halil Ibrahim, Cai, Xiaohao |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.06162 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Semantic Segmentation by Semantic Proportions
by: Aysel, Halil Ibrahim, et al.
Published: (2023)
by: Aysel, Halil Ibrahim, et al.
Published: (2023)
VORTEX: Challenging CNNs at Texture Recognition by using Vision Transformers with Orderless and Randomized Token Encodings
by: Scabini, Leonardo, et al.
Published: (2025)
by: Scabini, Leonardo, et al.
Published: (2025)
Concept-Based Explainable Artificial Intelligence: Metrics and Benchmarks
by: Aysel, Halil Ibrahim, et al.
Published: (2025)
by: Aysel, Halil Ibrahim, et al.
Published: (2025)
TraNCE: Transformative Non-linear Concept Explainer for CNNs
by: Akpudo, Ugochukwu Ejike, et al.
Published: (2025)
by: Akpudo, Ugochukwu Ejike, et al.
Published: (2025)
Real-Time Human Action Recognition on Embedded Platforms
by: Wang, Ruiqi, et al.
Published: (2024)
by: Wang, Ruiqi, et al.
Published: (2024)
Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers
by: Hatefi, Sayed Mohammad Vakilzadeh, et al.
Published: (2024)
by: Hatefi, Sayed Mohammad Vakilzadeh, et al.
Published: (2024)
Transformer-based Models to Deal with Heterogeneous Environments in Human Activity Recognition
by: EK, Sannara, et al.
Published: (2022)
by: EK, Sannara, et al.
Published: (2022)
Do Transformers Understand Ancient Roman Coin Motifs Better than CNNs?
by: Reid, David, et al.
Published: (2026)
by: Reid, David, et al.
Published: (2026)
Explaining Model Overfitting in CNNs via GMM Clustering
by: Dou, Hui, et al.
Published: (2024)
by: Dou, Hui, et al.
Published: (2024)
Survey of Action Recognition, Spotting and Spatio-Temporal Localization in Soccer -- Current Trends and Research Perspectives
by: Seweryn, Karolina, et al.
Published: (2023)
by: Seweryn, Karolina, et al.
Published: (2023)
Grounding Foundational Vision Models with 3D Human Poses for Robust Action Recognition
by: Babey, Nicholas, et al.
Published: (2025)
by: Babey, Nicholas, et al.
Published: (2025)
Efficient Hyperparameter Importance Assessment for CNNs
by: Wang, Ruinan, et al.
Published: (2024)
by: Wang, Ruinan, et al.
Published: (2024)
CNNs Avoid Curse of Dimensionality by Learning on Patches
by: Madala, Vamshi C., et al.
Published: (2022)
by: Madala, Vamshi C., et al.
Published: (2022)
Explaning with trees: interpreting CNNs using hierarchies
by: Rodrigues, Caroline Mazini, et al.
Published: (2024)
by: Rodrigues, Caroline Mazini, et al.
Published: (2024)
Detecção da Psoríase Utilizando Visão Computacional: Uma Abordagem Comparativa Entre CNNs e Vision Transformers
by: Lucena, Natanael, et al.
Published: (2025)
by: Lucena, Natanael, et al.
Published: (2025)
SkelVIT: Consensus of Vision Transformers for a Lightweight Skeleton-Based Action Recognition System
by: Karadag, Ozge Oztimur
Published: (2023)
by: Karadag, Ozge Oztimur
Published: (2023)
From Ground to Air: Noise Robustness in Vision Transformers and CNNs for Event-Based Vehicle Classification with Potential UAV Applications
by: Almesafri, Nouf, et al.
Published: (2025)
by: Almesafri, Nouf, et al.
Published: (2025)
Feature Hallucination for Self-supervised Action Recognition
by: Wang, Lei, et al.
Published: (2025)
by: Wang, Lei, et al.
Published: (2025)
Evolving Skeletons: Motion Dynamics in Action Recognition
by: Qiu, Jushang, et al.
Published: (2025)
by: Qiu, Jushang, et al.
Published: (2025)
Hybrid Training for Vision-Language-Action Models
by: Mazzaglia, Pietro, et al.
Published: (2025)
by: Mazzaglia, Pietro, et al.
Published: (2025)
Evaluating the Stability of Semantic Concept Representations in CNNs for Robust Explainability
by: Mikriukov, Georgii, et al.
Published: (2023)
by: Mikriukov, Georgii, et al.
Published: (2023)
HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
by: Tang, Haotian, et al.
Published: (2024)
by: Tang, Haotian, et al.
Published: (2024)
A Survey on Efficient Vision-Language-Action Models
by: Yu, Zhaoshu, et al.
Published: (2025)
by: Yu, Zhaoshu, et al.
Published: (2025)
Reliable Evaluation of Attribution Maps in CNNs: A Perturbation-Based Approach
by: Nieradzik, Lars, et al.
Published: (2024)
by: Nieradzik, Lars, et al.
Published: (2024)
Stacked Ensemble of Fine-Tuned CNNs for Knee Osteoarthritis Severity Grading
by: Gupta, Adarsh, et al.
Published: (2025)
by: Gupta, Adarsh, et al.
Published: (2025)
SkelMamba: A State Space Model for Efficient Skeleton Action Recognition of Neurological Disorders
by: Martinel, Niki, et al.
Published: (2024)
by: Martinel, Niki, et al.
Published: (2024)
Skeleton-based Action Recognition with Non-linear Dependency Modeling and Hilbert-Schmidt Independence Criterion
by: Yang, Yuheng
Published: (2024)
by: Yang, Yuheng
Published: (2024)
Multimodal Attack Detection for Action Recognition Models
by: Mumcu, Furkan, et al.
Published: (2024)
by: Mumcu, Furkan, et al.
Published: (2024)
A Comparative Study of Custom CNNs, Pre-trained Models, and Transfer Learning Across Multiple Visual Datasets
by: Akhand, Annoor Sharara
Published: (2026)
by: Akhand, Annoor Sharara
Published: (2026)
Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation
by: Li, Wenhao, et al.
Published: (2023)
by: Li, Wenhao, et al.
Published: (2023)
Pose Matters: Evaluating Vision Transformers and CNNs for Human Action Recognition on Small COCO Subsets
by: Tang, MingZe, et al.
Published: (2025)
by: Tang, MingZe, et al.
Published: (2025)
Hypergraph-based Multi-View Action Recognition using Event Cameras
by: Gao, Yue, et al.
Published: (2024)
by: Gao, Yue, et al.
Published: (2024)
One-Frame Calibration with Siamese Network in Facial Action Unit Recognition
by: Feng, Shuangquan, et al.
Published: (2024)
by: Feng, Shuangquan, et al.
Published: (2024)
EITNet: An IoT-Enhanced Framework for Real-Time Basketball Action Recognition
by: Liu, Jingyu, et al.
Published: (2024)
by: Liu, Jingyu, et al.
Published: (2024)
Federated Learning for Video Violence Detection: Complementary Roles of Lightweight CNNs and Vision-Language Models for Energy-Efficient Use
by: Thuau, Sébastien, et al.
Published: (2025)
by: Thuau, Sébastien, et al.
Published: (2025)
Beyond Conventional Transformers: The Medical X-ray Attention (MXA) Block for Improved Multi-Label Diagnosis Using Knowledge Distillation
by: Rand, Amit, et al.
Published: (2025)
by: Rand, Amit, et al.
Published: (2025)
Bird Eye-View to Street-View: A Survey
by: Bajbaa, Khawlah, et al.
Published: (2024)
by: Bajbaa, Khawlah, et al.
Published: (2024)
Advanced Arabic Alphabet Sign Language Recognition Using Transfer Learning and Transformer Models
by: Balat, Mazen, et al.
Published: (2024)
by: Balat, Mazen, et al.
Published: (2024)
A Survey of Deep Learning for Group-level Emotion Recognition
by: Huang, Xiaohua, et al.
Published: (2024)
by: Huang, Xiaohua, et al.
Published: (2024)
Large-Scale Evaluation of Open-Set Image Classification Techniques
by: Bisgin, Halil, et al.
Published: (2024)
by: Bisgin, Halil, et al.
Published: (2024)
Similar Items
-
Semantic Segmentation by Semantic Proportions
by: Aysel, Halil Ibrahim, et al.
Published: (2023) -
VORTEX: Challenging CNNs at Texture Recognition by using Vision Transformers with Orderless and Randomized Token Encodings
by: Scabini, Leonardo, et al.
Published: (2025) -
Concept-Based Explainable Artificial Intelligence: Metrics and Benchmarks
by: Aysel, Halil Ibrahim, et al.
Published: (2025) -
TraNCE: Transformative Non-linear Concept Explainer for CNNs
by: Akpudo, Ugochukwu Ejike, et al.
Published: (2025) -
Real-Time Human Action Recognition on Embedded Platforms
by: Wang, Ruiqi, et al.
Published: (2024)