:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Madan, Chetan, Gupta, Mayuna, Basu, Soumen, Gupta, Pankaj, Arora, Chetan
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2412.00374
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

FocusMAE: Gallbladder Cancer Detection from Ultrasound Videos with Focused Masked Autoencoders
by: Basu, Soumen, et al.
Published: (2024)

Focus on Texture: Rethinking Pre-training in Masked Autoencoders for Medical Image Classification
by: Madan, Chetan, et al.
Published: (2025)

Backdoor Attacks on Open Vocabulary Object Detectors via Multi-Modal Prompt Tuning
by: Raj, Ankita, et al.
Published: (2025)

D-MASTER: Mask Annealed Transformer for Unsupervised Domain Adaptation in Breast Cancer Detection from Mammograms
by: Ashraf, Tajamul, et al.
Published: (2024)

Instant Video Models: Universal Adapters for Stabilizing Image-Based Networks
by: Dutson, Matthew, et al.
Published: (2025)

Gallbladder Cancer Detection in Ultrasound Images based on YOLO and Faster R-CNN
by: Dadjouy, Sara, et al.
Published: (2024)

Q-Adapter: Visual Query Adapter for Extracting Textually-related Features in Video Captioning
by: Chen, Junan, et al.
Published: (2025)

Mimicking Human Visual Development for Learning Robust Image Representations
by: Raj, Ankita, et al.
Published: (2025)

DeepFake-Adapter: Dual-Level Adapter for DeepFake Detection
by: Shao, Rui, et al.
Published: (2023)

LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors
by: Suri, Saksham, et al.
Published: (2024)

Inv-Adapter: ID Customization Generation via Image Inversion and Lightweight Adapter
by: Xing, Peng, et al.
Published: (2024)

ViSTA: Visual Storytelling using Multi-modal Adapters for Text-to-Image Diffusion Models
by: Dong, Sibo, et al.
Published: (2025)

Ultrasound SAM Adapter: Adapting SAM for Breast Lesion Segmentation in Ultrasound Images
by: Tu, Zhengzheng, et al.
Published: (2024)

microCLIP: Unsupervised CLIP Adaptation via Coarse-Fine Token Fusion for Fine-Grained Image Classification
by: Silva, Sathira, et al.
Published: (2025)

ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation
by: Patni, Suraj, et al.
Published: (2024)

Identifying Physically Realizable Triggers for Backdoored Face Recognition Networks
by: Raj, Ankita, et al.
Published: (2025)

Examining the Threat Landscape: Foundation Models and Model Stealing
by: Raj, Ankita, et al.
Published: (2025)

DeNAS-ViT: Data Efficient NAS-Optimized Vision Transformer for Ultrasound Image Segmentation
by: Chen, Renqi, et al.
Published: (2024)

I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models
by: Guo, Xun, et al.
Published: (2023)

ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation
by: Liu, Jiaming, et al.
Published: (2023)

Reliable Active Learning from Unreliable Labels via Neural Collapse Geometry
by: Goel, Atharv, et al.
Published: (2025)

Your ViT is Secretly an Image Segmentation Model
by: Kerssies, Tommie, et al.
Published: (2025)

Assessing Risk of Stealing Proprietary Models for Medical Imaging Tasks
by: Raj, Ankita, et al.
Published: (2025)

Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP
by: Balasubramanian, Sriram, et al.
Published: (2024)

Towards Fine-Grained Adaptation of CLIP via a Self-Trained Alignment Score
by: Ali, Eman, et al.
Published: (2025)

UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
by: Duan, Lunhao, et al.
Published: (2024)

ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models
by: Cheng, Jiaxiang, et al.
Published: (2024)

Few-Shot Class-Incremental Model Attribution Using Learnable Representation From CLIP-ViT Features
by: Lee, Hanbyul, et al.
Published: (2025)

Prompting without Panic: Attribute-aware, Zero-shot, Test-Time Calibration
by: Hebbalaguppe, Ramya, et al.
Published: (2025)

VA-Adapter: Adapting Ultrasound Foundation Model to Echocardiography Probe Guidance
by: Wang, Teng, et al.
Published: (2025)

FairAdapter: Detecting AI-generated Images with Improved Fairness
by: Ding, Feng, et al.
Published: (2024)

VGGT-SLAM++
by: Mandal, Avilasha, et al.
Published: (2026)

Deeper Inside Deep ViT
by: Hong, Sungrae
Published: (2025)

CLIP-Adapter: Better Vision-Language Models with Feature Adapters
by: Gao, Peng, et al.
Published: (2021)

QueryAdapter: Rapid Adaptation of Vision-Language Models in Response to Natural Language Queries
by: Chapman, Nicolas Harvey, et al.
Published: (2025)

CLIP-Inspector: Model-Level Backdoor Detection for Prompt-Tuned CLIP via OOD Trigger Inversion
by: Jindal, Akshit, et al.
Published: (2026)

I&S-ViT: An Inclusive & Stable Method for Pushing the Limit of Post-Training ViTs Quantization
by: Zhong, Yunshan, et al.
Published: (2023)

How to train your ViT for OOD Detection
by: Mueller, Maximilian, et al.
Published: (2024)

CapS-Adapter: Caption-based MultiModal Adapter in Zero-Shot Classification
by: Wang, Qijie, et al.
Published: (2024)

T-Gated Adapter: A Lightweight Temporal Adapter for Vision-Language Medical Segmentation
by: Khadka, Pranjal
Published: (2026)