Saved in:
| Main Authors: | Raj, Ankita, Prajaapat, Kaashika, Gandhi, Tapan Kumar, Arora, Chetan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.14360 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Backdoor Attacks on Open Vocabulary Object Detectors via Multi-Modal Prompt Tuning
by: Raj, Ankita, et al.
Published: (2025)
by: Raj, Ankita, et al.
Published: (2025)
Identifying Physically Realizable Triggers for Backdoored Face Recognition Networks
by: Raj, Ankita, et al.
Published: (2025)
by: Raj, Ankita, et al.
Published: (2025)
Examining the Threat Landscape: Foundation Models and Model Stealing
by: Raj, Ankita, et al.
Published: (2025)
by: Raj, Ankita, et al.
Published: (2025)
Assessing Risk of Stealing Proprietary Models for Medical Imaging Tasks
by: Raj, Ankita, et al.
Published: (2025)
by: Raj, Ankita, et al.
Published: (2025)
CrossMed: A Multimodal Cross-Task Benchmark for Compositional Generalization in Medical Imaging
by: Singh, Pooja, et al.
Published: (2025)
by: Singh, Pooja, et al.
Published: (2025)
LQ-Adapter: ViT-Adapter with Learnable Queries for Gallbladder Cancer Detection from Ultrasound Image
by: Madan, Chetan, et al.
Published: (2024)
by: Madan, Chetan, et al.
Published: (2024)
VGGT-SLAM++
by: Mandal, Avilasha, et al.
Published: (2026)
by: Mandal, Avilasha, et al.
Published: (2026)
Focus on Texture: Rethinking Pre-training in Masked Autoencoders for Medical Image Classification
by: Madan, Chetan, et al.
Published: (2025)
by: Madan, Chetan, et al.
Published: (2025)
Feature Space Perturbation: A Panacea to Enhanced Transferability Estimation
by: Khoba, Prafful Kumar, et al.
Published: (2025)
by: Khoba, Prafful Kumar, et al.
Published: (2025)
Reliable Active Learning from Unreliable Labels via Neural Collapse Geometry
by: Goel, Atharv, et al.
Published: (2025)
by: Goel, Atharv, et al.
Published: (2025)
microCLIP: Unsupervised CLIP Adaptation via Coarse-Fine Token Fusion for Fine-Grained Image Classification
by: Silva, Sathira, et al.
Published: (2025)
by: Silva, Sathira, et al.
Published: (2025)
Learning Disentangled Representation in Object-Centric Models for Visual Dynamics Prediction via Transformers
by: Gandhi, Sanket, et al.
Published: (2024)
by: Gandhi, Sanket, et al.
Published: (2024)
Use of Metric Learning for the Recognition of Handwritten Digits, and its Application to Increase the Outreach of Voice-based Communication Platforms
by: Pant, Devesh, et al.
Published: (2025)
by: Pant, Devesh, et al.
Published: (2025)
FocusMAE: Gallbladder Cancer Detection from Ultrasound Videos with Focused Masked Autoencoders
by: Basu, Soumen, et al.
Published: (2024)
by: Basu, Soumen, et al.
Published: (2024)
Mimicking-Bench: A Benchmark for Generalizable Humanoid-Scene Interaction Learning via Human Mimicking
by: Liu, Yun, et al.
Published: (2024)
by: Liu, Yun, et al.
Published: (2024)
Towards Fine-Grained Adaptation of CLIP via a Self-Trained Alignment Score
by: Ali, Eman, et al.
Published: (2025)
by: Ali, Eman, et al.
Published: (2025)
WavShadow: Wavelet Based Shadow Segmentation and Removal
by: Jain, Shreyans, et al.
Published: (2024)
by: Jain, Shreyans, et al.
Published: (2024)
Improving Image Captioning by Mimicking Human Reformulation Feedback at Inference-time
by: Berger, Uri, et al.
Published: (2025)
by: Berger, Uri, et al.
Published: (2025)
D-MASTER: Mask Annealed Transformer for Unsupervised Domain Adaptation in Breast Cancer Detection from Mammograms
by: Ashraf, Tajamul, et al.
Published: (2024)
by: Ashraf, Tajamul, et al.
Published: (2024)
ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation
by: Patni, Suraj, et al.
Published: (2024)
by: Patni, Suraj, et al.
Published: (2024)
Personalized Image Generation from an Author Writing Style
by: Gandhi, Sagar, et al.
Published: (2025)
by: Gandhi, Sagar, et al.
Published: (2025)
Prompting without Panic: Attribute-aware, Zero-shot, Test-Time Calibration
by: Hebbalaguppe, Ramya, et al.
Published: (2025)
by: Hebbalaguppe, Ramya, et al.
Published: (2025)
Learning Robust Representations via Bidirectional Transition for Visual Reinforcement Learning
by: Hu, Xiaobo, et al.
Published: (2023)
by: Hu, Xiaobo, et al.
Published: (2023)
Can Reasons Help Improve Pedestrian Intent Estimation? A Cross-Modal Approach
by: Khindkar, Vaishnavi, et al.
Published: (2024)
by: Khindkar, Vaishnavi, et al.
Published: (2024)
MicroDetect-Net (MDN): Leveraging Deep Learning to Detect Microplastics in Clam Blood, a Step Towards Human Blood Analysis
by: Marwah, Riju, et al.
Published: (2025)
by: Marwah, Riju, et al.
Published: (2025)
NeoARCADE: Robust Calibration for Distance Estimation to Support Assistive Drones for the Visually Impaired
by: Raj, Suman, et al.
Published: (2025)
by: Raj, Suman, et al.
Published: (2025)
Alignment and Safety of Diffusion Models via Reinforcement Learning and Reward Modeling: A Survey
by: Lamba, Preeti, et al.
Published: (2025)
by: Lamba, Preeti, et al.
Published: (2025)
Vision Mamba for Permeability Prediction of Porous Media
by: Kashefi, Ali, et al.
Published: (2025)
by: Kashefi, Ali, et al.
Published: (2025)
A novel Fourier neural operator framework for classification of multi-sized images: Application to three dimensional digital porous media
by: Kashefi, Ali, et al.
Published: (2024)
by: Kashefi, Ali, et al.
Published: (2024)
Attend what matters: Leveraging vision foundational models for breast cancer classification using mammograms
by: Sanghvi, Samyak, et al.
Published: (2026)
by: Sanghvi, Samyak, et al.
Published: (2026)
Robust Human Trajectory Prediction via Self-Supervised Skeleton Representation Learning
by: Arashima, Taishu, et al.
Published: (2026)
by: Arashima, Taishu, et al.
Published: (2026)
Image registration of 2D optical thin sections in a 3D porous medium: Application to a Berea sandstone digital rock image
by: Chung, Jaehong, et al.
Published: (2025)
by: Chung, Jaehong, et al.
Published: (2025)
An Exploratory Study on Abstract Images and Visual Representations Learned from Them
by: Li, Haotian, et al.
Published: (2025)
by: Li, Haotian, et al.
Published: (2025)
Reasoning as Representation: Rethinking Visual Reinforcement Learning in Image Quality Assessment
by: Zhao, Shijie, et al.
Published: (2025)
by: Zhao, Shijie, et al.
Published: (2025)
LeOCLR: Leveraging Original Images for Contrastive Learning of Visual Representations
by: Alkhalefi, Mohammad, et al.
Published: (2024)
by: Alkhalefi, Mohammad, et al.
Published: (2024)
Robust Visual Representation Learning with Multi-modal Prior Knowledge for Image Classification Under Distribution Shift
by: Zhou, Hongkuan, et al.
Published: (2024)
by: Zhou, Hongkuan, et al.
Published: (2024)
V2M: Visual 2-Dimensional Mamba for Image Representation Learning
by: Wang, Chengkun, et al.
Published: (2024)
by: Wang, Chengkun, et al.
Published: (2024)
Improving Adversarial Robustness via Decoupled Visual Representation Masking
by: Liu, Decheng, et al.
Published: (2024)
by: Liu, Decheng, et al.
Published: (2024)
Efficient High-Resolution Visual Representation Learning with State Space Model for Human Pose Estimation
by: Zhang, Hao, et al.
Published: (2024)
by: Zhang, Hao, et al.
Published: (2024)
Through the PRISM: Principle-Aware, Interpretable, and Multi-Scale Evaluation of Visual Designs
by: Gandhi, Mona, et al.
Published: (2026)
by: Gandhi, Mona, et al.
Published: (2026)
Similar Items
-
Backdoor Attacks on Open Vocabulary Object Detectors via Multi-Modal Prompt Tuning
by: Raj, Ankita, et al.
Published: (2025) -
Identifying Physically Realizable Triggers for Backdoored Face Recognition Networks
by: Raj, Ankita, et al.
Published: (2025) -
Examining the Threat Landscape: Foundation Models and Model Stealing
by: Raj, Ankita, et al.
Published: (2025) -
Assessing Risk of Stealing Proprietary Models for Medical Imaging Tasks
by: Raj, Ankita, et al.
Published: (2025) -
CrossMed: A Multimodal Cross-Task Benchmark for Compositional Generalization in Medical Imaging
by: Singh, Pooja, et al.
Published: (2025)