Saved in:
| Main Authors: | Paranjape, Jay N., de Melo, Celso, Patel, Vishal M. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.02801 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Mamba-based Siamese Network for Remote Sensing Change Detection
by: Paranjape, Jay N., et al.
Published: (2024)
by: Paranjape, Jay N., et al.
Published: (2024)
Referring Change Detection in Remote Sensing Imagery
by: Korkmaz, Yilmaz, et al.
Published: (2025)
by: Korkmaz, Yilmaz, et al.
Published: (2025)
ViTA-Seg: Vision Transformer for Amodal Segmentation in Robotics
by: Caramia, Donato, et al.
Published: (2025)
by: Caramia, Donato, et al.
Published: (2025)
Thermo-VL: Extending Vision-Language Models to Thermal Infrared Perception
by: Thushara, Rusiru, et al.
Published: (2026)
by: Thushara, Rusiru, et al.
Published: (2026)
S-SAM: SVD-based Fine-Tuning of Segment Anything Model for Medical Image Segmentation
by: Paranjape, Jay N., et al.
Published: (2024)
by: Paranjape, Jay N., et al.
Published: (2024)
ViTA-PAR: Visual and Textual Attribute Alignment with Attribute Prompting for Pedestrian Attribute Recognition
by: Park, Minjeong, et al.
Published: (2025)
by: Park, Minjeong, et al.
Published: (2025)
Blackbox Adaptation for Medical Image Segmentation
by: Paranjape, Jay N., et al.
Published: (2024)
by: Paranjape, Jay N., et al.
Published: (2024)
Federated Black-Box Adaptation for Semantic Segmentation
by: Paranjape, Jay N., et al.
Published: (2024)
by: Paranjape, Jay N., et al.
Published: (2024)
GenDeg: Diffusion-based Degradation Synthesis for Generalizable All-In-One Image Restoration
by: Rajagopalan, Sudarshan, et al.
Published: (2024)
by: Rajagopalan, Sudarshan, et al.
Published: (2024)
Zero-Shot Scene Understanding for Automatic Target Recognition Using Large Vision-Language Models
by: Ranasinghe, Yasiru, et al.
Published: (2025)
by: Ranasinghe, Yasiru, et al.
Published: (2025)
FreeViS: Training-free Video Stylization with Inconsistent References
by: Xu, Jiacong, et al.
Published: (2025)
by: Xu, Jiacong, et al.
Published: (2025)
Thermal-Det: Language-Guided Cross-Modal Distillation for Open-Vocabulary Thermal Object Detection
by: Ranasinghe, Yasiru, et al.
Published: (2026)
by: Ranasinghe, Yasiru, et al.
Published: (2026)
Certainty and Uncertainty Guided Active Domain Adaptation
by: Safaei, Bardia, et al.
Published: (2025)
by: Safaei, Bardia, et al.
Published: (2025)
CGCE: Classifier-Guided Concept Erasure in Generative Models
by: Nguyen, Viet, et al.
Published: (2025)
by: Nguyen, Viet, et al.
Published: (2025)
ViLCo-Bench: VIdeo Language COntinual learning Benchmark
by: Tang, Tianqi, et al.
Published: (2024)
by: Tang, Tianqi, et al.
Published: (2024)
I2I-Galip: Unsupervised Medical Image Translation Using Generative Adversarial CLIP
by: Korkmaz, Yilmaz, et al.
Published: (2024)
by: Korkmaz, Yilmaz, et al.
Published: (2024)
Silhouette-based Gait Foundation Model
by: Ye, Dingqiang, et al.
Published: (2025)
by: Ye, Dingqiang, et al.
Published: (2025)
Active Learning for Vision-Language Models
by: Safaei, Bardia, et al.
Published: (2024)
by: Safaei, Bardia, et al.
Published: (2024)
ModelMix: A New Model-Mixup Strategy to Minimize Vicinal Risk across Tasks for Few-scribble based Cardiac Segmentation
by: Zhang, Ke, et al.
Published: (2024)
by: Zhang, Ke, et al.
Published: (2024)
ViLReF: An Expert Knowledge Enabled Vision-Language Retinal Foundation Model
by: Yang, Shengzhu, et al.
Published: (2024)
by: Yang, Shengzhu, et al.
Published: (2024)
RemoteVAR: Autoregressive Visual Modeling for Remote Sensing Change Detection
by: Korkmaz, Yilmaz, et al.
Published: (2026)
by: Korkmaz, Yilmaz, et al.
Published: (2026)
Attention Prompt Tuning: Parameter-efficient Adaptation of Pre-trained Models for Spatiotemporal Modeling
by: Bandara, Wele Gedara Chaminda, et al.
Published: (2024)
by: Bandara, Wele Gedara Chaminda, et al.
Published: (2024)
Leveraging Thermal Modality to Enhance Reconstruction in Low-Light Conditions
by: Xu, Jiacong, et al.
Published: (2024)
by: Xu, Jiacong, et al.
Published: (2024)
CanViT: Toward Active-Vision Foundation Models
by: Berreby, Yohaï-Eliel, et al.
Published: (2026)
by: Berreby, Yohaï-Eliel, et al.
Published: (2026)
MedCL: Learning Consistent Anatomy Distribution for Scribble-supervised Medical Image Segmentation
by: Zhang, Ke, et al.
Published: (2025)
by: Zhang, Ke, et al.
Published: (2025)
AWRaCLe: All-Weather Image Restoration using Visual In-Context Learning
by: Rajagopalan, Sudarshan, et al.
Published: (2024)
by: Rajagopalan, Sudarshan, et al.
Published: (2024)
Not All Tokens Need 40 Steps: Heterogeneous Step Allocation in Diffusion Transformers for Efficient Video Generation
by: Chu, Ernie, et al.
Published: (2026)
by: Chu, Ernie, et al.
Published: (2026)
Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation
by: Rajagopalan, Sudarshan, et al.
Published: (2024)
by: Rajagopalan, Sudarshan, et al.
Published: (2024)
Hyp-OC: Hyperbolic One Class Classification for Face Anti-Spoofing
by: Narayan, Kartik, et al.
Published: (2024)
by: Narayan, Kartik, et al.
Published: (2024)
Implicit Neural Representations: A Signal Processing Perspective
by: Jayasundara, Dhananjaya, et al.
Published: (2026)
by: Jayasundara, Dhananjaya, et al.
Published: (2026)
Your Pre-trained Diffusion Model Secretly Knows Restoration
by: Rajagopalan, Sudarshan, et al.
Published: (2026)
by: Rajagopalan, Sudarshan, et al.
Published: (2026)
Face-to-Face: A Video Dataset for Multi-Person Interaction Modeling
by: Chu, Ernie, et al.
Published: (2026)
by: Chu, Ernie, et al.
Published: (2026)
Frame by Familiar Frame: Understanding Replication in Video Diffusion Models
by: Rahman, Aimon, et al.
Published: (2024)
by: Rahman, Aimon, et al.
Published: (2024)
Latent Feature-Guided Diffusion Models for Shadow Removal
by: Mei, Kangfu, et al.
Published: (2023)
by: Mei, Kangfu, et al.
Published: (2023)
SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models
by: Ma, Wufei, et al.
Published: (2025)
by: Ma, Wufei, et al.
Published: (2025)
Dreamguider: Improved Training free Diffusion-based Conditional Generation
by: Nair, Nithin Gopalakrishnan, et al.
Published: (2024)
by: Nair, Nithin Gopalakrishnan, et al.
Published: (2024)
MambaRecon: MRI Reconstruction with Structured State Space Models
by: Korkmaz, Yilmaz, et al.
Published: (2024)
by: Korkmaz, Yilmaz, et al.
Published: (2024)
Diagnosis of diabetic retinopathy using machine learning & deep learning technique
by: Shah, Eric, et al.
Published: (2024)
by: Shah, Eric, et al.
Published: (2024)
UNIV: Unified Foundation Model for Infrared and Visible Modalities
by: Mao, Fangyuan, et al.
Published: (2025)
by: Mao, Fangyuan, et al.
Published: (2025)
FaceXBench: Evaluating Multimodal LLMs on Face Understanding
by: Narayan, Kartik, et al.
Published: (2025)
by: Narayan, Kartik, et al.
Published: (2025)
Similar Items
-
A Mamba-based Siamese Network for Remote Sensing Change Detection
by: Paranjape, Jay N., et al.
Published: (2024) -
Referring Change Detection in Remote Sensing Imagery
by: Korkmaz, Yilmaz, et al.
Published: (2025) -
ViTA-Seg: Vision Transformer for Amodal Segmentation in Robotics
by: Caramia, Donato, et al.
Published: (2025) -
Thermo-VL: Extending Vision-Language Models to Thermal Infrared Perception
by: Thushara, Rusiru, et al.
Published: (2026) -
S-SAM: SVD-based Fine-Tuning of Segment Anything Model for Medical Image Segmentation
by: Paranjape, Jay N., et al.
Published: (2024)