:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Raj, Ankita, Prajaapat, Kaashika, Gandhi, Tapan Kumar, Arora, Chetan
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2512.14360
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Backdoor Attacks on Open Vocabulary Object Detectors via Multi-Modal Prompt Tuning
by: Raj, Ankita, et al.
Published: (2025)

Identifying Physically Realizable Triggers for Backdoored Face Recognition Networks
by: Raj, Ankita, et al.
Published: (2025)

Examining the Threat Landscape: Foundation Models and Model Stealing
by: Raj, Ankita, et al.
Published: (2025)

Assessing Risk of Stealing Proprietary Models for Medical Imaging Tasks
by: Raj, Ankita, et al.
Published: (2025)

CrossMed: A Multimodal Cross-Task Benchmark for Compositional Generalization in Medical Imaging
by: Singh, Pooja, et al.
Published: (2025)

LQ-Adapter: ViT-Adapter with Learnable Queries for Gallbladder Cancer Detection from Ultrasound Image
by: Madan, Chetan, et al.
Published: (2024)

VGGT-SLAM++
by: Mandal, Avilasha, et al.
Published: (2026)

Focus on Texture: Rethinking Pre-training in Masked Autoencoders for Medical Image Classification
by: Madan, Chetan, et al.
Published: (2025)

Feature Space Perturbation: A Panacea to Enhanced Transferability Estimation
by: Khoba, Prafful Kumar, et al.
Published: (2025)

Reliable Active Learning from Unreliable Labels via Neural Collapse Geometry
by: Goel, Atharv, et al.
Published: (2025)

microCLIP: Unsupervised CLIP Adaptation via Coarse-Fine Token Fusion for Fine-Grained Image Classification
by: Silva, Sathira, et al.
Published: (2025)

Learning Disentangled Representation in Object-Centric Models for Visual Dynamics Prediction via Transformers
by: Gandhi, Sanket, et al.
Published: (2024)

Use of Metric Learning for the Recognition of Handwritten Digits, and its Application to Increase the Outreach of Voice-based Communication Platforms
by: Pant, Devesh, et al.
Published: (2025)

FocusMAE: Gallbladder Cancer Detection from Ultrasound Videos with Focused Masked Autoencoders
by: Basu, Soumen, et al.
Published: (2024)

Mimicking-Bench: A Benchmark for Generalizable Humanoid-Scene Interaction Learning via Human Mimicking
by: Liu, Yun, et al.
Published: (2024)

Towards Fine-Grained Adaptation of CLIP via a Self-Trained Alignment Score
by: Ali, Eman, et al.
Published: (2025)

WavShadow: Wavelet Based Shadow Segmentation and Removal
by: Jain, Shreyans, et al.
Published: (2024)

Improving Image Captioning by Mimicking Human Reformulation Feedback at Inference-time
by: Berger, Uri, et al.
Published: (2025)

D-MASTER: Mask Annealed Transformer for Unsupervised Domain Adaptation in Breast Cancer Detection from Mammograms
by: Ashraf, Tajamul, et al.
Published: (2024)

ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth Estimation
by: Patni, Suraj, et al.
Published: (2024)

Personalized Image Generation from an Author Writing Style
by: Gandhi, Sagar, et al.
Published: (2025)

Prompting without Panic: Attribute-aware, Zero-shot, Test-Time Calibration
by: Hebbalaguppe, Ramya, et al.
Published: (2025)

Learning Robust Representations via Bidirectional Transition for Visual Reinforcement Learning
by: Hu, Xiaobo, et al.
Published: (2023)

Can Reasons Help Improve Pedestrian Intent Estimation? A Cross-Modal Approach
by: Khindkar, Vaishnavi, et al.
Published: (2024)

MicroDetect-Net (MDN): Leveraging Deep Learning to Detect Microplastics in Clam Blood, a Step Towards Human Blood Analysis
by: Marwah, Riju, et al.
Published: (2025)

NeoARCADE: Robust Calibration for Distance Estimation to Support Assistive Drones for the Visually Impaired
by: Raj, Suman, et al.
Published: (2025)

Alignment and Safety of Diffusion Models via Reinforcement Learning and Reward Modeling: A Survey
by: Lamba, Preeti, et al.
Published: (2025)

Vision Mamba for Permeability Prediction of Porous Media
by: Kashefi, Ali, et al.
Published: (2025)

A novel Fourier neural operator framework for classification of multi-sized images: Application to three dimensional digital porous media
by: Kashefi, Ali, et al.
Published: (2024)

Attend what matters: Leveraging vision foundational models for breast cancer classification using mammograms
by: Sanghvi, Samyak, et al.
Published: (2026)

Robust Human Trajectory Prediction via Self-Supervised Skeleton Representation Learning
by: Arashima, Taishu, et al.
Published: (2026)

Image registration of 2D optical thin sections in a 3D porous medium: Application to a Berea sandstone digital rock image
by: Chung, Jaehong, et al.
Published: (2025)

An Exploratory Study on Abstract Images and Visual Representations Learned from Them
by: Li, Haotian, et al.
Published: (2025)

Reasoning as Representation: Rethinking Visual Reinforcement Learning in Image Quality Assessment
by: Zhao, Shijie, et al.
Published: (2025)

LeOCLR: Leveraging Original Images for Contrastive Learning of Visual Representations
by: Alkhalefi, Mohammad, et al.
Published: (2024)

Robust Visual Representation Learning with Multi-modal Prior Knowledge for Image Classification Under Distribution Shift
by: Zhou, Hongkuan, et al.
Published: (2024)

V2M: Visual 2-Dimensional Mamba for Image Representation Learning
by: Wang, Chengkun, et al.
Published: (2024)

Improving Adversarial Robustness via Decoupled Visual Representation Masking
by: Liu, Decheng, et al.
Published: (2024)

Efficient High-Resolution Visual Representation Learning with State Space Model for Human Pose Estimation
by: Zhang, Hao, et al.
Published: (2024)

Through the PRISM: Principle-Aware, Interpretable, and Multi-Scale Evaluation of Visual Designs
by: Gandhi, Mona, et al.
Published: (2026)