Saved in:
| Main Authors: | Loedeman, Jochem, Stol, Maarten C., Han, Tengda, Asano, Yuki M. |
|---|---|
| Format: | Preprint |
| Published: |
2022
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2210.06466 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Learning to Count without Annotations
by: Knobel, Lukas, et al.
Published: (2023)
by: Knobel, Lukas, et al.
Published: (2023)
Self-Masking Networks for Unsupervised Adaptation
by: Warmerdam, Alfonso Taboada, et al.
Published: (2024)
by: Warmerdam, Alfonso Taboada, et al.
Published: (2024)
No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations
by: Simoncini, Walter, et al.
Published: (2024)
by: Simoncini, Walter, et al.
Published: (2024)
It's Just Another Day: Unique Video Captioning by Discriminative Prompting
by: Perrett, Toby, et al.
Published: (2024)
by: Perrett, Toby, et al.
Published: (2024)
Test-Time Modification: Inverse Domain Transformation for Robust Perception
by: Jadon, Arpit, et al.
Published: (2025)
by: Jadon, Arpit, et al.
Published: (2025)
GMOS: Grounding Moving Object Segmentation in 3D Space and Time
by: Xie, Junyu, et al.
Published: (2026)
by: Xie, Junyu, et al.
Published: (2026)
In-context Prompt Learning for Test-time Vision Recognition with Frozen Vision-language Model
by: Yin, Junhui, et al.
Published: (2024)
by: Yin, Junhui, et al.
Published: (2024)
Stale Diffusion: Hyper-realistic 5D Movie Generation Using Old-school Methods
by: Henriques, Joao F., et al.
Published: (2024)
by: Henriques, Joao F., et al.
Published: (2024)
Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs
by: Shukor, Mustafa, et al.
Published: (2024)
by: Shukor, Mustafa, et al.
Published: (2024)
PMT: Plain Mask Transformer for Image and Video Segmentation with Frozen Vision Encoders
by: Cavagnero, Niccolò, et al.
Published: (2026)
by: Cavagnero, Niccolò, et al.
Published: (2026)
CountGD: Multi-Modal Open-World Counting
by: Amini-Naieni, Niki, et al.
Published: (2024)
by: Amini-Naieni, Niki, et al.
Published: (2024)
Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
by: Li, Hongyu, et al.
Published: (2024)
by: Li, Hongyu, et al.
Published: (2024)
SAMSelect: A Spectral Index Search for Marine Debris Visualization using Segment Anything
by: van Dalen, Joost, et al.
Published: (2025)
by: van Dalen, Joost, et al.
Published: (2025)
Prompt-based Adaptation in Large-scale Vision Models: A Survey
by: Xiao, Xi, et al.
Published: (2025)
by: Xiao, Xi, et al.
Published: (2025)
Cascade Prompt Learning for Vision-Language Model Adaptation
by: Wu, Ge, et al.
Published: (2024)
by: Wu, Ge, et al.
Published: (2024)
Source-Free Domain Adaptation with Frozen Multimodal Foundation Model
by: Tang, Song, et al.
Published: (2023)
by: Tang, Song, et al.
Published: (2023)
Beyond Grids: Exploring Elastic Input Sampling for Vision Transformers
by: Pardyl, Adam, et al.
Published: (2023)
by: Pardyl, Adam, et al.
Published: (2023)
LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation
by: Jin, Can, et al.
Published: (2025)
by: Jin, Can, et al.
Published: (2025)
Federated Learning with a Single Shared Image
by: Soni, Sunny, et al.
Published: (2024)
by: Soni, Sunny, et al.
Published: (2024)
Unsupervised Parameter Efficient Source-free Post-pretraining
by: Jha, Abhishek, et al.
Published: (2025)
by: Jha, Abhishek, et al.
Published: (2025)
GeneralAD: Anomaly Detection Across Domains by Attending to Distorted Features
by: Sträter, Luc P. J., et al.
Published: (2024)
by: Sträter, Luc P. J., et al.
Published: (2024)
Evolving Prompt Adaptation for Vision-Language Models
by: Zhang, Enming, et al.
Published: (2026)
by: Zhang, Enming, et al.
Published: (2026)
Character-Centric Understanding of Animated Movies
by: Gui, Zhongrui, et al.
Published: (2025)
by: Gui, Zhongrui, et al.
Published: (2025)
Dynamic Reflections: Probing Video Representations with Text Alignment
by: Zhu, Tyler, et al.
Published: (2025)
by: Zhu, Tyler, et al.
Published: (2025)
Seeing without Pixels: Perception from Camera Trajectories
by: Xue, Zihui, et al.
Published: (2025)
by: Xue, Zihui, et al.
Published: (2025)
Foundation Model or Finetune? Evaluation of few-shot semantic segmentation for river pollution
by: Don, Marga, et al.
Published: (2024)
by: Don, Marga, et al.
Published: (2024)
Little Data, Big Impact: Privacy-Aware Visual Language Models via Minimal Tuning
by: Samson, Laurens, et al.
Published: (2024)
by: Samson, Laurens, et al.
Published: (2024)
CROC: Evaluating and Training T2I Metrics with Pseudo- and Human-Labeled Contrastive Robustness Checks
by: Leiter, Christoph, et al.
Published: (2025)
by: Leiter, Christoph, et al.
Published: (2025)
HIRI-ViT: Scaling Vision Transformer with High Resolution Inputs
by: Yao, Ting, et al.
Published: (2024)
by: Yao, Ting, et al.
Published: (2024)
Burst Image Super-Resolution via Multi-Cross Attention Encoding and Multi-Scan State-Space Decoding
by: Huang, Tengda, et al.
Published: (2025)
by: Huang, Tengda, et al.
Published: (2025)
Segment Any 3D-Part in a Scene from a Sentence
by: Wu, Hongyu, et al.
Published: (2025)
by: Wu, Hongyu, et al.
Published: (2025)
PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs
by: Dorkenwald, Michael, et al.
Published: (2024)
by: Dorkenwald, Michael, et al.
Published: (2024)
EA-ViT: Efficient Adaptation for Elastic Vision Transformer
by: Zhu, Chen, et al.
Published: (2025)
by: Zhu, Chen, et al.
Published: (2025)
APLA: A Simple Adaptation Method for Vision Transformers
by: Sorkhei, Moein, et al.
Published: (2025)
by: Sorkhei, Moein, et al.
Published: (2025)
Joint Post-Training Quantization of Vision Transformers with Learned Prompt-Guided Data Generation
by: Li, Shile, et al.
Published: (2026)
by: Li, Shile, et al.
Published: (2026)
Patch Pruning Strategy Based on Robust Statistical Measures of Attention Weight Diversity in Vision Transformers
by: Igaue, Yuki, et al.
Published: (2025)
by: Igaue, Yuki, et al.
Published: (2025)
Learning Visual Prompts for Guiding the Attention of Vision Transformers
by: Rezaei, Razieh, et al.
Published: (2024)
by: Rezaei, Razieh, et al.
Published: (2024)
Auto-Vocabulary Semantic Segmentation
by: Ülger, Osman, et al.
Published: (2023)
by: Ülger, Osman, et al.
Published: (2023)
Masked Language Prompting for Generative Data Augmentation in Few-shot Fashion Style Recognition
by: Hirakawa, Yuki, et al.
Published: (2025)
by: Hirakawa, Yuki, et al.
Published: (2025)
Prompt Estimation from Prototypes for Federated Prompt Tuning of Vision Transformers
by: Yashwanth, M, et al.
Published: (2025)
by: Yashwanth, M, et al.
Published: (2025)
Similar Items
-
Learning to Count without Annotations
by: Knobel, Lukas, et al.
Published: (2023) -
Self-Masking Networks for Unsupervised Adaptation
by: Warmerdam, Alfonso Taboada, et al.
Published: (2024) -
No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations
by: Simoncini, Walter, et al.
Published: (2024) -
It's Just Another Day: Unique Video Captioning by Discriminative Prompting
by: Perrett, Toby, et al.
Published: (2024) -
Test-Time Modification: Inverse Domain Transformation for Robust Perception
by: Jadon, Arpit, et al.
Published: (2025)