Saved in:
| Main Authors: | Madan, Chetan, Gupta, Mayuna, Basu, Soumen, Gupta, Pankaj, Arora, Chetan |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.00374 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
FocusMAE: Gallbladder Cancer Detection from Ultrasound Videos with Focused Masked Autoencoders
by: Basu, Soumen, et al.
Published: (2024)
by: Basu, Soumen, et al.
Published: (2024)
Focus on Texture: Rethinking Pre-training in Masked Autoencoders for Medical Image Classification
by: Madan, Chetan, et al.
Published: (2025)
by: Madan, Chetan, et al.
Published: (2025)
Backdoor Attacks on Open Vocabulary Object Detectors via Multi-Modal Prompt Tuning
by: Raj, Ankita, et al.
Published: (2025)
by: Raj, Ankita, et al.
Published: (2025)
D-MASTER: Mask Annealed Transformer for Unsupervised Domain Adaptation in Breast Cancer Detection from Mammograms
by: Ashraf, Tajamul, et al.
Published: (2024)
by: Ashraf, Tajamul, et al.
Published: (2024)
Instant Video Models: Universal Adapters for Stabilizing Image-Based Networks
by: Dutson, Matthew, et al.
Published: (2025)
by: Dutson, Matthew, et al.
Published: (2025)
Gallbladder Cancer Detection in Ultrasound Images based on YOLO and Faster R-CNN
by: Dadjouy, Sara, et al.
Published: (2024)
by: Dadjouy, Sara, et al.
Published: (2024)
Q-Adapter: Visual Query Adapter for Extracting Textually-related Features in Video Captioning
by: Chen, Junan, et al.
Published: (2025)
by: Chen, Junan, et al.
Published: (2025)
Mimicking Human Visual Development for Learning Robust Image Representations
by: Raj, Ankita, et al.
Published: (2025)
by: Raj, Ankita, et al.
Published: (2025)
DeepFake-Adapter: Dual-Level Adapter for DeepFake Detection
by: Shao, Rui, et al.
Published: (2023)
by: Shao, Rui, et al.
Published: (2023)
LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors
by: Suri, Saksham, et al.
Published: (2024)
by: Suri, Saksham, et al.
Published: (2024)
Inv-Adapter: ID Customization Generation via Image Inversion and Lightweight Adapter
by: Xing, Peng, et al.
Published: (2024)
by: Xing, Peng, et al.
Published: (2024)
ViSTA: Visual Storytelling using Multi-modal Adapters for Text-to-Image Diffusion Models
by: Dong, Sibo, et al.
Published: (2025)
by: Dong, Sibo, et al.
Published: (2025)
Ultrasound SAM Adapter: Adapting SAM for Breast Lesion Segmentation in Ultrasound Images
by: Tu, Zhengzheng, et al.
Published: (2024)
by: Tu, Zhengzheng, et al.
Published: (2024)
microCLIP: Unsupervised CLIP Adaptation via Coarse-Fine Token Fusion for Fine-Grained Image Classification
by: Silva, Sathira, et al.
Published: (2025)
by: Silva, Sathira, et al.
Published: (2025)
ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation
by: Patni, Suraj, et al.
Published: (2024)
by: Patni, Suraj, et al.
Published: (2024)
Identifying Physically Realizable Triggers for Backdoored Face Recognition Networks
by: Raj, Ankita, et al.
Published: (2025)
by: Raj, Ankita, et al.
Published: (2025)
Examining the Threat Landscape: Foundation Models and Model Stealing
by: Raj, Ankita, et al.
Published: (2025)
by: Raj, Ankita, et al.
Published: (2025)
DeNAS-ViT: Data Efficient NAS-Optimized Vision Transformer for Ultrasound Image Segmentation
by: Chen, Renqi, et al.
Published: (2024)
by: Chen, Renqi, et al.
Published: (2024)
I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models
by: Guo, Xun, et al.
Published: (2023)
by: Guo, Xun, et al.
Published: (2023)
ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation
by: Liu, Jiaming, et al.
Published: (2023)
by: Liu, Jiaming, et al.
Published: (2023)
Reliable Active Learning from Unreliable Labels via Neural Collapse Geometry
by: Goel, Atharv, et al.
Published: (2025)
by: Goel, Atharv, et al.
Published: (2025)
Your ViT is Secretly an Image Segmentation Model
by: Kerssies, Tommie, et al.
Published: (2025)
by: Kerssies, Tommie, et al.
Published: (2025)
Assessing Risk of Stealing Proprietary Models for Medical Imaging Tasks
by: Raj, Ankita, et al.
Published: (2025)
by: Raj, Ankita, et al.
Published: (2025)
Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP
by: Balasubramanian, Sriram, et al.
Published: (2024)
by: Balasubramanian, Sriram, et al.
Published: (2024)
Towards Fine-Grained Adaptation of CLIP via a Self-Trained Alignment Score
by: Ali, Eman, et al.
Published: (2025)
by: Ali, Eman, et al.
Published: (2025)
UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
by: Duan, Lunhao, et al.
Published: (2024)
by: Duan, Lunhao, et al.
Published: (2024)
ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models
by: Cheng, Jiaxiang, et al.
Published: (2024)
by: Cheng, Jiaxiang, et al.
Published: (2024)
Few-Shot Class-Incremental Model Attribution Using Learnable Representation From CLIP-ViT Features
by: Lee, Hanbyul, et al.
Published: (2025)
by: Lee, Hanbyul, et al.
Published: (2025)
Prompting without Panic: Attribute-aware, Zero-shot, Test-Time Calibration
by: Hebbalaguppe, Ramya, et al.
Published: (2025)
by: Hebbalaguppe, Ramya, et al.
Published: (2025)
VA-Adapter: Adapting Ultrasound Foundation Model to Echocardiography Probe Guidance
by: Wang, Teng, et al.
Published: (2025)
by: Wang, Teng, et al.
Published: (2025)
FairAdapter: Detecting AI-generated Images with Improved Fairness
by: Ding, Feng, et al.
Published: (2024)
by: Ding, Feng, et al.
Published: (2024)
VGGT-SLAM++
by: Mandal, Avilasha, et al.
Published: (2026)
by: Mandal, Avilasha, et al.
Published: (2026)
Deeper Inside Deep ViT
by: Hong, Sungrae
Published: (2025)
by: Hong, Sungrae
Published: (2025)
CLIP-Adapter: Better Vision-Language Models with Feature Adapters
by: Gao, Peng, et al.
Published: (2021)
by: Gao, Peng, et al.
Published: (2021)
QueryAdapter: Rapid Adaptation of Vision-Language Models in Response to Natural Language Queries
by: Chapman, Nicolas Harvey, et al.
Published: (2025)
by: Chapman, Nicolas Harvey, et al.
Published: (2025)
CLIP-Inspector: Model-Level Backdoor Detection for Prompt-Tuned CLIP via OOD Trigger Inversion
by: Jindal, Akshit, et al.
Published: (2026)
by: Jindal, Akshit, et al.
Published: (2026)
I&S-ViT: An Inclusive & Stable Method for Pushing the Limit of Post-Training ViTs Quantization
by: Zhong, Yunshan, et al.
Published: (2023)
by: Zhong, Yunshan, et al.
Published: (2023)
How to train your ViT for OOD Detection
by: Mueller, Maximilian, et al.
Published: (2024)
by: Mueller, Maximilian, et al.
Published: (2024)
CapS-Adapter: Caption-based MultiModal Adapter in Zero-Shot Classification
by: Wang, Qijie, et al.
Published: (2024)
by: Wang, Qijie, et al.
Published: (2024)
T-Gated Adapter: A Lightweight Temporal Adapter for Vision-Language Medical Segmentation
by: Khadka, Pranjal
Published: (2026)
by: Khadka, Pranjal
Published: (2026)
Similar Items
-
FocusMAE: Gallbladder Cancer Detection from Ultrasound Videos with Focused Masked Autoencoders
by: Basu, Soumen, et al.
Published: (2024) -
Focus on Texture: Rethinking Pre-training in Masked Autoencoders for Medical Image Classification
by: Madan, Chetan, et al.
Published: (2025) -
Backdoor Attacks on Open Vocabulary Object Detectors via Multi-Modal Prompt Tuning
by: Raj, Ankita, et al.
Published: (2025) -
D-MASTER: Mask Annealed Transformer for Unsupervised Domain Adaptation in Breast Cancer Detection from Mammograms
by: Ashraf, Tajamul, et al.
Published: (2024) -
Instant Video Models: Universal Adapters for Stabilizing Image-Based Networks
by: Dutson, Matthew, et al.
Published: (2025)