:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Paranjape, Jay N., de Melo, Celso, Patel, Vishal M.
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2504.02801
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

A Mamba-based Siamese Network for Remote Sensing Change Detection
by: Paranjape, Jay N., et al.
Published: (2024)

Referring Change Detection in Remote Sensing Imagery
by: Korkmaz, Yilmaz, et al.
Published: (2025)

ViTA-Seg: Vision Transformer for Amodal Segmentation in Robotics
by: Caramia, Donato, et al.
Published: (2025)

Thermo-VL: Extending Vision-Language Models to Thermal Infrared Perception
by: Thushara, Rusiru, et al.
Published: (2026)

S-SAM: SVD-based Fine-Tuning of Segment Anything Model for Medical Image Segmentation
by: Paranjape, Jay N., et al.
Published: (2024)

ViTA-PAR: Visual and Textual Attribute Alignment with Attribute Prompting for Pedestrian Attribute Recognition
by: Park, Minjeong, et al.
Published: (2025)

Blackbox Adaptation for Medical Image Segmentation
by: Paranjape, Jay N., et al.
Published: (2024)

Federated Black-Box Adaptation for Semantic Segmentation
by: Paranjape, Jay N., et al.
Published: (2024)

GenDeg: Diffusion-based Degradation Synthesis for Generalizable All-In-One Image Restoration
by: Rajagopalan, Sudarshan, et al.
Published: (2024)

Zero-Shot Scene Understanding for Automatic Target Recognition Using Large Vision-Language Models
by: Ranasinghe, Yasiru, et al.
Published: (2025)

FreeViS: Training-free Video Stylization with Inconsistent References
by: Xu, Jiacong, et al.
Published: (2025)

Thermal-Det: Language-Guided Cross-Modal Distillation for Open-Vocabulary Thermal Object Detection
by: Ranasinghe, Yasiru, et al.
Published: (2026)

Certainty and Uncertainty Guided Active Domain Adaptation
by: Safaei, Bardia, et al.
Published: (2025)

CGCE: Classifier-Guided Concept Erasure in Generative Models
by: Nguyen, Viet, et al.
Published: (2025)

ViLCo-Bench: VIdeo Language COntinual learning Benchmark
by: Tang, Tianqi, et al.
Published: (2024)

I2I-Galip: Unsupervised Medical Image Translation Using Generative Adversarial CLIP
by: Korkmaz, Yilmaz, et al.
Published: (2024)

Silhouette-based Gait Foundation Model
by: Ye, Dingqiang, et al.
Published: (2025)

Active Learning for Vision-Language Models
by: Safaei, Bardia, et al.
Published: (2024)

ModelMix: A New Model-Mixup Strategy to Minimize Vicinal Risk across Tasks for Few-scribble based Cardiac Segmentation
by: Zhang, Ke, et al.
Published: (2024)

ViLReF: An Expert Knowledge Enabled Vision-Language Retinal Foundation Model
by: Yang, Shengzhu, et al.
Published: (2024)

RemoteVAR: Autoregressive Visual Modeling for Remote Sensing Change Detection
by: Korkmaz, Yilmaz, et al.
Published: (2026)

Attention Prompt Tuning: Parameter-efficient Adaptation of Pre-trained Models for Spatiotemporal Modeling
by: Bandara, Wele Gedara Chaminda, et al.
Published: (2024)

Leveraging Thermal Modality to Enhance Reconstruction in Low-Light Conditions
by: Xu, Jiacong, et al.
Published: (2024)

CanViT: Toward Active-Vision Foundation Models
by: Berreby, Yohaï-Eliel, et al.
Published: (2026)

MedCL: Learning Consistent Anatomy Distribution for Scribble-supervised Medical Image Segmentation
by: Zhang, Ke, et al.
Published: (2025)

AWRaCLe: All-Weather Image Restoration using Visual In-Context Learning
by: Rajagopalan, Sudarshan, et al.
Published: (2024)

Not All Tokens Need 40 Steps: Heterogeneous Step Allocation in Diffusion Transformers for Efficient Video Generation
by: Chu, Ernie, et al.
Published: (2026)

Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation
by: Rajagopalan, Sudarshan, et al.
Published: (2024)

Hyp-OC: Hyperbolic One Class Classification for Face Anti-Spoofing
by: Narayan, Kartik, et al.
Published: (2024)

Implicit Neural Representations: A Signal Processing Perspective
by: Jayasundara, Dhananjaya, et al.
Published: (2026)

Your Pre-trained Diffusion Model Secretly Knows Restoration
by: Rajagopalan, Sudarshan, et al.
Published: (2026)

Face-to-Face: A Video Dataset for Multi-Person Interaction Modeling
by: Chu, Ernie, et al.
Published: (2026)

Frame by Familiar Frame: Understanding Replication in Video Diffusion Models
by: Rahman, Aimon, et al.
Published: (2024)

Latent Feature-Guided Diffusion Models for Shadow Removal
by: Mei, Kangfu, et al.
Published: (2023)

SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models
by: Ma, Wufei, et al.
Published: (2025)

Dreamguider: Improved Training free Diffusion-based Conditional Generation
by: Nair, Nithin Gopalakrishnan, et al.
Published: (2024)

MambaRecon: MRI Reconstruction with Structured State Space Models
by: Korkmaz, Yilmaz, et al.
Published: (2024)

Diagnosis of diabetic retinopathy using machine learning & deep learning technique
by: Shah, Eric, et al.
Published: (2024)

UNIV: Unified Foundation Model for Infrared and Visible Modalities
by: Mao, Fangyuan, et al.
Published: (2025)

FaceXBench: Evaluating Multimodal LLMs on Face Understanding
by: Narayan, Kartik, et al.
Published: (2025)