Saved in:
| Main Authors: | Inkawhich, Matthew, Inkawhich, Nathan, Yang, Hao, Zhang, Jingyang, Linderman, Randolph, Chen, Yiran |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.10865 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Tunable Hybrid Proposal Networks for the Open World
by: Inkawhich, Matthew, et al.
Published: (2022)
by: Inkawhich, Matthew, et al.
Published: (2022)
On the Status of Foundation Models for SAR Imagery
by: Inkawhich, Nathan
Published: (2025)
by: Inkawhich, Nathan
Published: (2025)
Comprehensive OOD Detection Improvements
by: Lakkapragada, Anish, et al.
Published: (2024)
by: Lakkapragada, Anish, et al.
Published: (2024)
Out-of-Distribution Detection via Deep Multi-Comprehension Ensemble
by: Xu, Chenhui, et al.
Published: (2024)
by: Xu, Chenhui, et al.
Published: (2024)
Multi-layer Radial Basis Function Networks for Out-of-distribution Detection
by: Khanna, Amol, et al.
Published: (2025)
by: Khanna, Amol, et al.
Published: (2025)
Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection
by: Gao, Xiangyu, et al.
Published: (2025)
by: Gao, Xiangyu, et al.
Published: (2025)
LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors
by: Suri, Saksham, et al.
Published: (2024)
by: Suri, Saksham, et al.
Published: (2024)
Knowledge Distillation in YOLOX-ViT for Side-Scan Sonar Object Detection
by: Aubard, Martin, et al.
Published: (2024)
by: Aubard, Martin, et al.
Published: (2024)
Exploring Plain ViT Reconstruction for Multi-class Unsupervised Anomaly Detection
by: Zhang, Jiangning, et al.
Published: (2023)
by: Zhang, Jiangning, et al.
Published: (2023)
I&S-ViT: An Inclusive & Stable Method for Pushing the Limit of Post-Training ViTs Quantization
by: Zhong, Yunshan, et al.
Published: (2023)
by: Zhong, Yunshan, et al.
Published: (2023)
RepViT: Revisiting Mobile CNN From ViT Perspective
by: Wang, Ao, et al.
Published: (2023)
by: Wang, Ao, et al.
Published: (2023)
Revisiting Token Compression for Accelerating ViT-based Sparse Multi-View 3D Object Detectors
by: Ji, Mingqian, et al.
Published: (2026)
by: Ji, Mingqian, et al.
Published: (2026)
Deeper Inside Deep ViT
by: Hong, Sungrae
Published: (2025)
by: Hong, Sungrae
Published: (2025)
A Hybrid Framework Bridging CNN and ViT based on Theory of Evidence for Diabetic Retinopathy Grading
by: Qiu, Junlai, et al.
Published: (2025)
by: Qiu, Junlai, et al.
Published: (2025)
SPAR: Single-Pass Any-Resolution ViT for Open-vocabulary Segmentation
by: Kombol, Naomi, et al.
Published: (2026)
by: Kombol, Naomi, et al.
Published: (2026)
How to train your ViT for OOD Detection
by: Mueller, Maximilian, et al.
Published: (2024)
by: Mueller, Maximilian, et al.
Published: (2024)
EA-ViT: Efficient Adaptation for Elastic Vision Transformer
by: Zhu, Chen, et al.
Published: (2025)
by: Zhu, Chen, et al.
Published: (2025)
UADet: A Remarkably Simple Yet Effective Uncertainty-Aware Open-Set Object Detection Framework
by: Cheng, Silin, et al.
Published: (2024)
by: Cheng, Silin, et al.
Published: (2024)
Unsupervised Object Localization in the Era of Self-Supervised ViTs: A Survey
by: Siméoni, Oriane, et al.
Published: (2023)
by: Siméoni, Oriane, et al.
Published: (2023)
ViT-5: Vision Transformers for The Mid-2020s
by: Wang, Feng, et al.
Published: (2026)
by: Wang, Feng, et al.
Published: (2026)
LAMM-ViT: AI Face Detection via Layer-Aware Modulation of Region-Guided Attention
by: Zhang, Jiangling, et al.
Published: (2025)
by: Zhang, Jiangling, et al.
Published: (2025)
STRAP-ViT: Segregated Tokens with Randomized -- Transformations for Defense against Adversarial Patches in ViTs
by: Chattopadhyay, Nandish, et al.
Published: (2026)
by: Chattopadhyay, Nandish, et al.
Published: (2026)
Mobile U-ViT: Revisiting large kernel and U-shaped ViT for efficient medical image segmentation
by: Tang, Fenghe, et al.
Published: (2025)
by: Tang, Fenghe, et al.
Published: (2025)
Scene-Graph ViT: End-to-End Open-Vocabulary Visual Relationship Detection
by: Salzmann, Tim, et al.
Published: (2024)
by: Salzmann, Tim, et al.
Published: (2024)
ViTCAE: ViT-based Class-conditioned Autoencoder
by: Jebraeeli, Vahid, et al.
Published: (2025)
by: Jebraeeli, Vahid, et al.
Published: (2025)
Exploiting Lightweight Hierarchical ViT and Dynamic Framework for Efficient Visual Tracking
by: Kang, Ben, et al.
Published: (2025)
by: Kang, Ben, et al.
Published: (2025)
Rethinking Random Masking in Self-Distillation on ViT
by: Seong, Jihyeon, et al.
Published: (2025)
by: Seong, Jihyeon, et al.
Published: (2025)
YOLO-Former: YOLO Shakes Hand With ViT
by: Khoramdel, Javad, et al.
Published: (2024)
by: Khoramdel, Javad, et al.
Published: (2024)
Your ViT is Secretly an Image Segmentation Model
by: Kerssies, Tommie, et al.
Published: (2025)
by: Kerssies, Tommie, et al.
Published: (2025)
LogitDynamics: Reliable ViT Error Detection from Layerwise Logit Trajectories
by: Beigelman, Ido, et al.
Published: (2026)
by: Beigelman, Ido, et al.
Published: (2026)
U-REPA: Aligning Diffusion U-Nets to ViTs
by: Tian, Yuchuan, et al.
Published: (2025)
by: Tian, Yuchuan, et al.
Published: (2025)
Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation
by: Zhao, Wangbo, et al.
Published: (2024)
by: Zhao, Wangbo, et al.
Published: (2024)
CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications
by: Zhang, Tianfang, et al.
Published: (2024)
by: Zhang, Tianfang, et al.
Published: (2024)
Intriguing Frequency Interpretation of Adversarial Robustness for CNNs and ViTs
by: Chen, Lu, et al.
Published: (2025)
by: Chen, Lu, et al.
Published: (2025)
Purrturbed but Stable: Human-Cat Invariant Representations Across CNNs, ViTs and Self-Supervised ViTs
by: Shah, Arya, et al.
Published: (2025)
by: Shah, Arya, et al.
Published: (2025)
A Hybrid CNN-ViT-GNN Framework with GAN-Based Augmentation for Intelligent Weed Detection in Precision Agriculture
by: V, Pandiyaraju, et al.
Published: (2025)
by: V, Pandiyaraju, et al.
Published: (2025)
ViT-Lens: Towards Omni-modal Representations
by: Lei, Weixian, et al.
Published: (2023)
by: Lei, Weixian, et al.
Published: (2023)
Harnessing the Computation Redundancy in ViTs to Boost Adversarial Transferability
by: Liu, Jiani, et al.
Published: (2025)
by: Liu, Jiani, et al.
Published: (2025)
Multimodal Informative ViT: Information Aggregation and Distribution for Hyperspectral and LiDAR Classification
by: Zhang, Jiaqing, et al.
Published: (2024)
by: Zhang, Jiaqing, et al.
Published: (2024)
Vanilla ViT for Automotive Point Cloud Semantic Segmentation
by: Puy, Gilles, et al.
Published: (2026)
by: Puy, Gilles, et al.
Published: (2026)
Similar Items
-
Tunable Hybrid Proposal Networks for the Open World
by: Inkawhich, Matthew, et al.
Published: (2022) -
On the Status of Foundation Models for SAR Imagery
by: Inkawhich, Nathan
Published: (2025) -
Comprehensive OOD Detection Improvements
by: Lakkapragada, Anish, et al.
Published: (2024) -
Out-of-Distribution Detection via Deep Multi-Comprehension Ensemble
by: Xu, Chenhui, et al.
Published: (2024) -
Multi-layer Radial Basis Function Networks for Out-of-distribution Detection
by: Khanna, Amol, et al.
Published: (2025)