Saved in:
| Main Authors: | Wolf, Fabian, Tüselmann, Oliver, Matei, Arthur, Hennies, Lukas, Rass, Christoph, Fink, Gernot A. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.04214 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Self-Supervised Vision Transformers for Writer Retrieval
by: Raven, Tim, et al.
Published: (2024)
by: Raven, Tim, et al.
Published: (2024)
Exploring Architectures for CNN-Based Word Spotting
by: Rusakov, Eugen, et al.
Published: (2018)
by: Rusakov, Eugen, et al.
Published: (2018)
Boosting Few-Shot Detection with Large Language Models and Layout-to-Image Synthesis
by: Abdullah, Ahmed, et al.
Published: (2024)
by: Abdullah, Ahmed, et al.
Published: (2024)
A Closer Look at the Few-Shot Adaptation of Large Vision-Language Models
by: Silva-Rodríguez, Julio, et al.
Published: (2023)
by: Silva-Rodríguez, Julio, et al.
Published: (2023)
Generative Compositor for Few-Shot Visual Information Extraction
by: Yang, Zhibo, et al.
Published: (2025)
by: Yang, Zhibo, et al.
Published: (2025)
Semi-Supervised Few-Shot Adaptation of Vision-Language Models
by: Silva-Rodríguez, Julio, et al.
Published: (2026)
by: Silva-Rodríguez, Julio, et al.
Published: (2026)
Low-Rank Few-Shot Adaptation of Vision-Language Models
by: Zanella, Maxime, et al.
Published: (2024)
by: Zanella, Maxime, et al.
Published: (2024)
Revisiting Few-Shot Object Detection with Vision-Language Models
by: Madan, Anish, et al.
Published: (2023)
by: Madan, Anish, et al.
Published: (2023)
Calibrated Cache Model for Few-Shot Vision-Language Model Adaptation
by: Ding, Kun, et al.
Published: (2024)
by: Ding, Kun, et al.
Published: (2024)
Enhancing Few-Shot Vision-Language Classification with Large Multimodal Model Features
by: Mitra, Chancharik, et al.
Published: (2024)
by: Mitra, Chancharik, et al.
Published: (2024)
Few-Shot Adaptation Benchmark for Remote Sensing Vision-Language Models
by: Khoury, Karim El, et al.
Published: (2025)
by: Khoury, Karim El, et al.
Published: (2025)
Auxiliary Descriptive Knowledge for Few-Shot Adaptation of Vision-Language Model
by: Lee, SuBeen, et al.
Published: (2025)
by: Lee, SuBeen, et al.
Published: (2025)
Efficient Few-Shot Continual Learning in Vision-Language Models
by: Panos, Aristeidis, et al.
Published: (2025)
by: Panos, Aristeidis, et al.
Published: (2025)
Benchmarking Vision-Language and Multimodal Large Language Models in Zero-shot and Few-shot Scenarios: A study on Christian Iconography
by: Spinaci, Gianmarco, et al.
Published: (2025)
by: Spinaci, Gianmarco, et al.
Published: (2025)
FSDAM: Few-Shot Driving Attention Modeling via Vision-Language Coupling
by: Hamid, Kaiser, et al.
Published: (2025)
by: Hamid, Kaiser, et al.
Published: (2025)
Few-Shot Image Quality Assessment via Adaptation of Vision-Language Models
by: Li, Xudong, et al.
Published: (2024)
by: Li, Xudong, et al.
Published: (2024)
Vision-Language In-Context Learning Driven Few-Shot Visual Inspection Model
by: Ueno, Shiryu, et al.
Published: (2025)
by: Ueno, Shiryu, et al.
Published: (2025)
Noise-Tolerant Few-Shot Unsupervised Adapter for Vision-Language Models
by: Ali, Eman, et al.
Published: (2023)
by: Ali, Eman, et al.
Published: (2023)
ContextVLM: Zero-Shot and Few-Shot Context Understanding for Autonomous Driving using Vision Language Models
by: Sural, Shounak, et al.
Published: (2024)
by: Sural, Shounak, et al.
Published: (2024)
Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages
by: Farina, Matteo, et al.
Published: (2025)
by: Farina, Matteo, et al.
Published: (2025)
Cluster-Aware Prompt Ensemble Learning for Few-Shot Vision-Language Model Adaptation
by: Chen, Zhi, et al.
Published: (2025)
by: Chen, Zhi, et al.
Published: (2025)
Complementary Subspace Low-Rank Adaptation of Vision-Language Models for Few-Shot Classification
by: Wang, Zhongqi, et al.
Published: (2025)
by: Wang, Zhongqi, et al.
Published: (2025)
Pre-trained Vision and Language Transformers Are Few-Shot Incremental Learners
by: Park, Keon-Hee, et al.
Published: (2024)
by: Park, Keon-Hee, et al.
Published: (2024)
Few-Shot Adversarial Prompt Learning on Vision-Language Models
by: Zhou, Yiwei, et al.
Published: (2024)
by: Zhou, Yiwei, et al.
Published: (2024)
LLaFS: When Large Language Models Meet Few-Shot Segmentation
by: Zhu, Lanyun, et al.
Published: (2023)
by: Zhu, Lanyun, et al.
Published: (2023)
Enhancing Vision-Language Few-Shot Adaptation with Negative Learning
by: Zhang, Ce, et al.
Published: (2024)
by: Zhang, Ce, et al.
Published: (2024)
Efficient Few-Shot Learning in Remote Sensing: Fusing Vision and Vision-Language Models
by: Chua, Jia Yun, et al.
Published: (2025)
by: Chua, Jia Yun, et al.
Published: (2025)
Preserve and Sculpt: Manifold-Aligned Fine-tuning of Vision-Language Models for Few-Shot Learning
by: Chen, Dexia, et al.
Published: (2025)
by: Chen, Dexia, et al.
Published: (2025)
Few-Shot Learning from Gigapixel Images via Hierarchical Vision-Language Alignment and Modeling
by: Wong, Bryan, et al.
Published: (2025)
by: Wong, Bryan, et al.
Published: (2025)
Few-Shot Image Classification and Segmentation as Visual Question Answering Using Vision-Language Models
by: Meng, Tian, et al.
Published: (2024)
by: Meng, Tian, et al.
Published: (2024)
Few-Shot Relation Extraction with Hybrid Visual Evidence
by: Gong, Jiaying, et al.
Published: (2024)
by: Gong, Jiaying, et al.
Published: (2024)
Proto-CLIP: Vision-Language Prototypical Network for Few-Shot Learning
by: P, Jishnu Jaykumar, et al.
Published: (2023)
by: P, Jishnu Jaykumar, et al.
Published: (2023)
ProKeR: A Kernel Perspective on Few-Shot Adaptation of Large Vision-Language Models
by: Bendou, Yassir, et al.
Published: (2025)
by: Bendou, Yassir, et al.
Published: (2025)
Mechanistic Finetuning of Vision-Language-Action Models via Few-Shot Demonstrations
by: Mitra, Chancharik, et al.
Published: (2025)
by: Mitra, Chancharik, et al.
Published: (2025)
DDFAV: Remote Sensing Large Vision Language Models Dataset and Evaluation Benchmark
by: Li, Haodong, et al.
Published: (2024)
by: Li, Haodong, et al.
Published: (2024)
A data-centric approach to class-specific bias in image data augmentation
by: Angelakis, Athanasios, et al.
Published: (2024)
by: Angelakis, Athanasios, et al.
Published: (2024)
Language-Aware Information Maximization for Transductive Few-Shot CLIP
by: Baklouti, Ghassen, et al.
Published: (2025)
by: Baklouti, Ghassen, et al.
Published: (2025)
Few-Shot Vision-Language Reasoning for Satellite Imagery via Verifiable Rewards
by: Koksal, Aybora, et al.
Published: (2025)
by: Koksal, Aybora, et al.
Published: (2025)
Towards Fine-Grained Vision-Language Alignment for Few-Shot Anomaly Detection
by: Fan, Yuanting, et al.
Published: (2025)
by: Fan, Yuanting, et al.
Published: (2025)
Towards Efficient and General-Purpose Few-Shot Misclassification Detection for Vision-Language Models
by: Zeng, Fanhu, et al.
Published: (2025)
by: Zeng, Fanhu, et al.
Published: (2025)
Similar Items
-
Self-Supervised Vision Transformers for Writer Retrieval
by: Raven, Tim, et al.
Published: (2024) -
Exploring Architectures for CNN-Based Word Spotting
by: Rusakov, Eugen, et al.
Published: (2018) -
Boosting Few-Shot Detection with Large Language Models and Layout-to-Image Synthesis
by: Abdullah, Ahmed, et al.
Published: (2024) -
A Closer Look at the Few-Shot Adaptation of Large Vision-Language Models
by: Silva-Rodríguez, Julio, et al.
Published: (2023) -
Generative Compositor for Few-Shot Visual Information Extraction
by: Yang, Zhibo, et al.
Published: (2025)