Saved in:
| Main Authors: | Shangguan, Zeyu, Seita, Daniel, Rostami, Mohammad |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.16469 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Cross-domain Multi-modal Few-shot Object Detection via Rich Text
by: Shangguan, Zeyu, et al.
Published: (2024)
by: Shangguan, Zeyu, et al.
Published: (2024)
FAD: Frequency Adaptation and Diversion for Cross-domain Few-shot Learning
by: Shi, Ruixiao, et al.
Published: (2025)
by: Shi, Ruixiao, et al.
Published: (2025)
Reviving In-domain Fine-tuning Methods for Source-Free Cross-domain Few-shot Learning
by: Zhao, Yaze, et al.
Published: (2026)
by: Zhao, Yaze, et al.
Published: (2026)
Mind the Discriminability Trap in Source-Free Cross-domain Few-shot Learning
by: Zhang, Zhenyu, et al.
Published: (2026)
by: Zhang, Zhenyu, et al.
Published: (2026)
Cross-domain Few-shot In-context Learning for Enhancing Traffic Sign Recognition
by: Gan, Yaozong, et al.
Published: (2024)
by: Gan, Yaozong, et al.
Published: (2024)
Fusion-Mamba for Cross-modality Object Detection
by: Dong, Wenhao, et al.
Published: (2024)
by: Dong, Wenhao, et al.
Published: (2024)
Stability Plasticity Decoupled Fine-tuning For Few-shot end-to-end Object Detection
by: Yin, Yuantao, et al.
Published: (2024)
by: Yin, Yuantao, et al.
Published: (2024)
Adaptive Multi-prompt Contrastive Network for Few-shot Out-of-distribution Detection
by: Fang, Xiang, et al.
Published: (2025)
by: Fang, Xiang, et al.
Published: (2025)
Small Object Few-shot Segmentation for Vision-based Industrial Inspection
by: Zhang, Zilong, et al.
Published: (2024)
by: Zhang, Zilong, et al.
Published: (2024)
VVTRec: Radio Interferometric Reconstruction through Visual and Textual Modality Enrichment
by: Cheng, Kai, et al.
Published: (2026)
by: Cheng, Kai, et al.
Published: (2026)
NexViTAD: Few-shot Unsupervised Cross-Domain Defect Detection via Vision Foundation Models and Multi-Task Learning
by: Mu, Tianwei, et al.
Published: (2025)
by: Mu, Tianwei, et al.
Published: (2025)
Spectral Discrepancy and Cross-modal Semantic Consistency Learning for Object Detection in Hyperspectral Image
by: He, Xiao, et al.
Published: (2025)
by: He, Xiao, et al.
Published: (2025)
CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature Confusion
by: Meng, Boyuan, et al.
Published: (2025)
by: Meng, Boyuan, et al.
Published: (2025)
NTIRE 2025 Challenge on Cross-Domain Few-Shot Object Detection: Methods and Results
by: Fu, Yuqian, et al.
Published: (2025)
by: Fu, Yuqian, et al.
Published: (2025)
Awesome Multi-modal Object Tracking
by: Zhang, Chunhui, et al.
Published: (2024)
by: Zhang, Chunhui, et al.
Published: (2024)
CluMo: Cluster-based Modality Fusion Prompt for Continual Learning in Visual Question Answering
by: Cai, Yuliang, et al.
Published: (2024)
by: Cai, Yuliang, et al.
Published: (2024)
The Second Challenge on Cross-Domain Few-Shot Object Detection at NTIRE 2026: Methods and Results
by: Qiu, Xingyu, et al.
Published: (2026)
by: Qiu, Xingyu, et al.
Published: (2026)
IPFormer-VideoLLM: Enhancing Multi-modal Video Understanding for Multi-shot Scenes
by: Liang, Yujia, et al.
Published: (2025)
by: Liang, Yujia, et al.
Published: (2025)
Large Multi-modal Model Cartographic Map Comprehension for Textual Locality Georeferencing
by: Wijegunarathna, Kalana, et al.
Published: (2025)
by: Wijegunarathna, Kalana, et al.
Published: (2025)
Analyzing the Impact of Low-Rank Adaptation for Cross-Domain Few-Shot Object Detection in Aerial Images
by: Talaoubrid, Hicham, et al.
Published: (2025)
by: Talaoubrid, Hicham, et al.
Published: (2025)
Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object Detection
by: Pan, Jiancheng, et al.
Published: (2025)
by: Pan, Jiancheng, et al.
Published: (2025)
Hierarchical Multi-modal Transformer for Cross-modal Long Document Classification
by: Liu, Tengfei, et al.
Published: (2024)
by: Liu, Tengfei, et al.
Published: (2024)
GiPL: Generative augmented iterative Pseudo-Labeling for Cross-Domain Few-Shot Object Detection
by: Liu, Jiacong, et al.
Published: (2026)
by: Liu, Jiacong, et al.
Published: (2026)
MOCHA: Multi-modal Objects-aware Cross-arcHitecture Alignment
by: Camuffo, Elena, et al.
Published: (2025)
by: Camuffo, Elena, et al.
Published: (2025)
ViewSAM: Learning View-aware Cross-modal Semantics for Weakly Supervised Cross-view Referring Multi-Object Tracking
by: Ge, Jiawei, et al.
Published: (2026)
by: Ge, Jiawei, et al.
Published: (2026)
Cross-domain Multi-step Thinking: Zero-shot Fine-grained Traffic Sign Recognition in the Wild
by: Gan, Yaozong, et al.
Published: (2024)
by: Gan, Yaozong, et al.
Published: (2024)
Robust Domain Generalization for Multi-modal Object Recognition
by: Qiao, Yuxin, et al.
Published: (2024)
by: Qiao, Yuxin, et al.
Published: (2024)
Temporal Object-Aware Vision Transformer for Few-Shot Video Object Detection
by: Kumar, Yogesh, et al.
Published: (2025)
by: Kumar, Yogesh, et al.
Published: (2025)
Few-Shot LoRA Adaptation of a Flow-Matching Foundation Model for Cross-Spectral Object Detection
by: Clouser, Maxim, et al.
Published: (2026)
by: Clouser, Maxim, et al.
Published: (2026)
TNG-CLIP:Training-Time Negation Data Generation for Negation Awareness of CLIP
by: Cai, Yuliang, et al.
Published: (2025)
by: Cai, Yuliang, et al.
Published: (2025)
An Intermediate Fusion ViT Enables Efficient Text-Image Alignment in Diffusion Models
by: Hu, Zizhao, et al.
Published: (2024)
by: Hu, Zizhao, et al.
Published: (2024)
Active Multimodal Distillation for Few-shot Action Recognition
by: Feng, Weijia, et al.
Published: (2025)
by: Feng, Weijia, et al.
Published: (2025)
Reliable Few-shot Learning under Dual Noises
by: Zhang, Ji, et al.
Published: (2025)
by: Zhang, Ji, et al.
Published: (2025)
Few-shot Implicit Function Generation via Equivariance
by: Huang, Suizhi, et al.
Published: (2025)
by: Huang, Suizhi, et al.
Published: (2025)
Few-shot Semantic Encoding and Decoding for Video Surveillance
by: Cheng, Baoping, et al.
Published: (2025)
by: Cheng, Baoping, et al.
Published: (2025)
Siamese Transformer Networks for Few-shot Image Classification
by: Jiang, Weihao, et al.
Published: (2024)
by: Jiang, Weihao, et al.
Published: (2024)
LLMTrack: Semantic Multi-Object Tracking with Multi-modal Large Language Models
by: Liao, Pan, et al.
Published: (2026)
by: Liao, Pan, et al.
Published: (2026)
On the Adversarial Robustness of Camera-based 3D Object Detection
by: Xie, Shaoyuan, et al.
Published: (2023)
by: Xie, Shaoyuan, et al.
Published: (2023)
Unsupervised Federated Domain Adaptation for Segmentation of MRI Images
by: Nananukul, Navapat, et al.
Published: (2024)
by: Nananukul, Navapat, et al.
Published: (2024)
Few-shot Writer Adaptation via Multimodal In-Context Learning
by: Simon, Tom, et al.
Published: (2026)
by: Simon, Tom, et al.
Published: (2026)
Similar Items
-
Cross-domain Multi-modal Few-shot Object Detection via Rich Text
by: Shangguan, Zeyu, et al.
Published: (2024) -
FAD: Frequency Adaptation and Diversion for Cross-domain Few-shot Learning
by: Shi, Ruixiao, et al.
Published: (2025) -
Reviving In-domain Fine-tuning Methods for Source-Free Cross-domain Few-shot Learning
by: Zhao, Yaze, et al.
Published: (2026) -
Mind the Discriminability Trap in Source-Free Cross-domain Few-shot Learning
by: Zhang, Zhenyu, et al.
Published: (2026) -
Cross-domain Few-shot In-context Learning for Enhancing Traffic Sign Recognition
by: Gan, Yaozong, et al.
Published: (2024)