Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Wang, Xiaomeng, Zhao, Zhengyu, Larson, Martha
Format:	Preprint
Published:	2025
Subjects:	Cryptography and Security
Online Access:	https://arxiv.org/abs/2502.08193
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866929711860416512
author	Wang, Xiaomeng Zhao, Zhengyu Larson, Martha
author_facet	Wang, Xiaomeng Zhao, Zhengyu Larson, Martha
contents	Large Vision-Language Models (LVLMs) are susceptible to typographic attacks, which are misclassifications caused by an attack text that is added to an image. In this paper, we introduce a multi-image setting for studying typographic attacks, broadening the current emphasis of the literature on attacking individual images. Specifically, our focus is on attacking image sets without repeating the attack query. Such non-repeating attacks are stealthier, as they are more likely to evade a gatekeeper than attacks that repeat the same attack text. We introduce two attack strategies for the multi-image setting, leveraging the difficulty of the target image, the strength of the attack text, and text-image similarity. Our text-image similarity approach improves attack success rates by 21% over random, non-specific methods on the CLIP model using ImageNet while maintaining stealth in a multi-image scenario. An additional experiment demonstrates transferability, i.e., text-image similarity calculated using CLIP transfers when attacking InstructBLIP.
format	Preprint
id	arxiv_https___arxiv_org_abs_2502_08193
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Typographic Attacks in a Multi-Image Setting Wang, Xiaomeng Zhao, Zhengyu Larson, Martha Cryptography and Security Large Vision-Language Models (LVLMs) are susceptible to typographic attacks, which are misclassifications caused by an attack text that is added to an image. In this paper, we introduce a multi-image setting for studying typographic attacks, broadening the current emphasis of the literature on attacking individual images. Specifically, our focus is on attacking image sets without repeating the attack query. Such non-repeating attacks are stealthier, as they are more likely to evade a gatekeeper than attacks that repeat the same attack text. We introduce two attack strategies for the multi-image setting, leveraging the difficulty of the target image, the strength of the attack text, and text-image similarity. Our text-image similarity approach improves attack success rates by 21% over random, non-specific methods on the CLIP model using ImageNet while maintaining stealth in a multi-image scenario. An additional experiment demonstrates transferability, i.e., text-image similarity calculated using CLIP transfers when attacking InstructBLIP.
title	Typographic Attacks in a Multi-Image Setting
topic	Cryptography and Security
url	https://arxiv.org/abs/2502.08193

Similar Items