:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Shangguan, Zeyu, Seita, Daniel, Rostami, Mohammad
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2502.16469
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Cross-domain Multi-modal Few-shot Object Detection via Rich Text
by: Shangguan, Zeyu, et al.
Published: (2024)

FAD: Frequency Adaptation and Diversion for Cross-domain Few-shot Learning
by: Shi, Ruixiao, et al.
Published: (2025)

Reviving In-domain Fine-tuning Methods for Source-Free Cross-domain Few-shot Learning
by: Zhao, Yaze, et al.
Published: (2026)

Mind the Discriminability Trap in Source-Free Cross-domain Few-shot Learning
by: Zhang, Zhenyu, et al.
Published: (2026)

Cross-domain Few-shot In-context Learning for Enhancing Traffic Sign Recognition
by: Gan, Yaozong, et al.
Published: (2024)

Fusion-Mamba for Cross-modality Object Detection
by: Dong, Wenhao, et al.
Published: (2024)

Stability Plasticity Decoupled Fine-tuning For Few-shot end-to-end Object Detection
by: Yin, Yuantao, et al.
Published: (2024)

Adaptive Multi-prompt Contrastive Network for Few-shot Out-of-distribution Detection
by: Fang, Xiang, et al.
Published: (2025)

Small Object Few-shot Segmentation for Vision-based Industrial Inspection
by: Zhang, Zilong, et al.
Published: (2024)

VVTRec: Radio Interferometric Reconstruction through Visual and Textual Modality Enrichment
by: Cheng, Kai, et al.
Published: (2026)

NexViTAD: Few-shot Unsupervised Cross-Domain Defect Detection via Vision Foundation Models and Multi-Task Learning
by: Mu, Tianwei, et al.
Published: (2025)

Spectral Discrepancy and Cross-modal Semantic Consistency Learning for Object Detection in Hyperspectral Image
by: He, Xiao, et al.
Published: (2025)

CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature Confusion
by: Meng, Boyuan, et al.
Published: (2025)

NTIRE 2025 Challenge on Cross-Domain Few-Shot Object Detection: Methods and Results
by: Fu, Yuqian, et al.
Published: (2025)

Awesome Multi-modal Object Tracking
by: Zhang, Chunhui, et al.
Published: (2024)

CluMo: Cluster-based Modality Fusion Prompt for Continual Learning in Visual Question Answering
by: Cai, Yuliang, et al.
Published: (2024)

The Second Challenge on Cross-Domain Few-Shot Object Detection at NTIRE 2026: Methods and Results
by: Qiu, Xingyu, et al.
Published: (2026)

IPFormer-VideoLLM: Enhancing Multi-modal Video Understanding for Multi-shot Scenes
by: Liang, Yujia, et al.
Published: (2025)

Large Multi-modal Model Cartographic Map Comprehension for Textual Locality Georeferencing
by: Wijegunarathna, Kalana, et al.
Published: (2025)

Analyzing the Impact of Low-Rank Adaptation for Cross-Domain Few-Shot Object Detection in Aerial Images
by: Talaoubrid, Hicham, et al.
Published: (2025)

Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object Detection
by: Pan, Jiancheng, et al.
Published: (2025)

Hierarchical Multi-modal Transformer for Cross-modal Long Document Classification
by: Liu, Tengfei, et al.
Published: (2024)

GiPL: Generative augmented iterative Pseudo-Labeling for Cross-Domain Few-Shot Object Detection
by: Liu, Jiacong, et al.
Published: (2026)

MOCHA: Multi-modal Objects-aware Cross-arcHitecture Alignment
by: Camuffo, Elena, et al.
Published: (2025)

ViewSAM: Learning View-aware Cross-modal Semantics for Weakly Supervised Cross-view Referring Multi-Object Tracking
by: Ge, Jiawei, et al.
Published: (2026)

Cross-domain Multi-step Thinking: Zero-shot Fine-grained Traffic Sign Recognition in the Wild
by: Gan, Yaozong, et al.
Published: (2024)

Robust Domain Generalization for Multi-modal Object Recognition
by: Qiao, Yuxin, et al.
Published: (2024)

Temporal Object-Aware Vision Transformer for Few-Shot Video Object Detection
by: Kumar, Yogesh, et al.
Published: (2025)

Few-Shot LoRA Adaptation of a Flow-Matching Foundation Model for Cross-Spectral Object Detection
by: Clouser, Maxim, et al.
Published: (2026)

TNG-CLIP:Training-Time Negation Data Generation for Negation Awareness of CLIP
by: Cai, Yuliang, et al.
Published: (2025)

An Intermediate Fusion ViT Enables Efficient Text-Image Alignment in Diffusion Models
by: Hu, Zizhao, et al.
Published: (2024)

Active Multimodal Distillation for Few-shot Action Recognition
by: Feng, Weijia, et al.
Published: (2025)

Reliable Few-shot Learning under Dual Noises
by: Zhang, Ji, et al.
Published: (2025)

Few-shot Implicit Function Generation via Equivariance
by: Huang, Suizhi, et al.
Published: (2025)

Few-shot Semantic Encoding and Decoding for Video Surveillance
by: Cheng, Baoping, et al.
Published: (2025)

Siamese Transformer Networks for Few-shot Image Classification
by: Jiang, Weihao, et al.
Published: (2024)

LLMTrack: Semantic Multi-Object Tracking with Multi-modal Large Language Models
by: Liao, Pan, et al.
Published: (2026)

On the Adversarial Robustness of Camera-based 3D Object Detection
by: Xie, Shaoyuan, et al.
Published: (2023)

Unsupervised Federated Domain Adaptation for Segmentation of MRI Images
by: Nananukul, Navapat, et al.
Published: (2024)

Few-shot Writer Adaptation via Multimodal In-Context Learning
by: Simon, Tom, et al.
Published: (2026)