:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Loedeman, Jochem, Stol, Maarten C., Han, Tengda, Asano, Yuki M.
Format:	Preprint
Published:	2022
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2210.06466
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Learning to Count without Annotations
by: Knobel, Lukas, et al.
Published: (2023)

Self-Masking Networks for Unsupervised Adaptation
by: Warmerdam, Alfonso Taboada, et al.
Published: (2024)

No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations
by: Simoncini, Walter, et al.
Published: (2024)

It's Just Another Day: Unique Video Captioning by Discriminative Prompting
by: Perrett, Toby, et al.
Published: (2024)

Test-Time Modification: Inverse Domain Transformation for Robust Perception
by: Jadon, Arpit, et al.
Published: (2025)

GMOS: Grounding Moving Object Segmentation in 3D Space and Time
by: Xie, Junyu, et al.
Published: (2026)

In-context Prompt Learning for Test-time Vision Recognition with Frozen Vision-language Model
by: Yin, Junhui, et al.
Published: (2024)

Stale Diffusion: Hyper-realistic 5D Movie Generation Using Old-school Methods
by: Henriques, Joao F., et al.
Published: (2024)

Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs
by: Shukor, Mustafa, et al.
Published: (2024)

PMT: Plain Mask Transformer for Image and Video Segmentation with Frozen Vision Encoders
by: Cavagnero, Niccolò, et al.
Published: (2026)

CountGD: Multi-Modal Open-World Counting
by: Amini-Naieni, Niki, et al.
Published: (2024)

Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
by: Li, Hongyu, et al.
Published: (2024)

SAMSelect: A Spectral Index Search for Marine Debris Visualization using Segment Anything
by: van Dalen, Joost, et al.
Published: (2025)

Prompt-based Adaptation in Large-scale Vision Models: A Survey
by: Xiao, Xi, et al.
Published: (2025)

Cascade Prompt Learning for Vision-Language Model Adaptation
by: Wu, Ge, et al.
Published: (2024)

Source-Free Domain Adaptation with Frozen Multimodal Foundation Model
by: Tang, Song, et al.
Published: (2023)

Beyond Grids: Exploring Elastic Input Sampling for Vision Transformers
by: Pardyl, Adam, et al.
Published: (2023)

LoR-VP: Low-Rank Visual Prompting for Efficient Vision Model Adaptation
by: Jin, Can, et al.
Published: (2025)

Federated Learning with a Single Shared Image
by: Soni, Sunny, et al.
Published: (2024)

Unsupervised Parameter Efficient Source-free Post-pretraining
by: Jha, Abhishek, et al.
Published: (2025)

GeneralAD: Anomaly Detection Across Domains by Attending to Distorted Features
by: Sträter, Luc P. J., et al.
Published: (2024)

Evolving Prompt Adaptation for Vision-Language Models
by: Zhang, Enming, et al.
Published: (2026)

Character-Centric Understanding of Animated Movies
by: Gui, Zhongrui, et al.
Published: (2025)

Dynamic Reflections: Probing Video Representations with Text Alignment
by: Zhu, Tyler, et al.
Published: (2025)

Seeing without Pixels: Perception from Camera Trajectories
by: Xue, Zihui, et al.
Published: (2025)

Foundation Model or Finetune? Evaluation of few-shot semantic segmentation for river pollution
by: Don, Marga, et al.
Published: (2024)

Little Data, Big Impact: Privacy-Aware Visual Language Models via Minimal Tuning
by: Samson, Laurens, et al.
Published: (2024)

CROC: Evaluating and Training T2I Metrics with Pseudo- and Human-Labeled Contrastive Robustness Checks
by: Leiter, Christoph, et al.
Published: (2025)

HIRI-ViT: Scaling Vision Transformer with High Resolution Inputs
by: Yao, Ting, et al.
Published: (2024)

Burst Image Super-Resolution via Multi-Cross Attention Encoding and Multi-Scan State-Space Decoding
by: Huang, Tengda, et al.
Published: (2025)

Segment Any 3D-Part in a Scene from a Sentence
by: Wu, Hongyu, et al.
Published: (2025)

PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs
by: Dorkenwald, Michael, et al.
Published: (2024)

EA-ViT: Efficient Adaptation for Elastic Vision Transformer
by: Zhu, Chen, et al.
Published: (2025)

APLA: A Simple Adaptation Method for Vision Transformers
by: Sorkhei, Moein, et al.
Published: (2025)

Joint Post-Training Quantization of Vision Transformers with Learned Prompt-Guided Data Generation
by: Li, Shile, et al.
Published: (2026)

Patch Pruning Strategy Based on Robust Statistical Measures of Attention Weight Diversity in Vision Transformers
by: Igaue, Yuki, et al.
Published: (2025)

Learning Visual Prompts for Guiding the Attention of Vision Transformers
by: Rezaei, Razieh, et al.
Published: (2024)

Auto-Vocabulary Semantic Segmentation
by: Ülger, Osman, et al.
Published: (2023)

Masked Language Prompting for Generative Data Augmentation in Few-shot Fashion Style Recognition
by: Hirakawa, Yuki, et al.
Published: (2025)

Prompt Estimation from Prototypes for Federated Prompt Tuning of Vision Transformers
by: Yashwanth, M, et al.
Published: (2025)