:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Zhiwei, Pang, Yitian, Wang, Weining, Sun, Zhenan, Li, Qi
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2512.16523
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

AGC: Adaptive Geodesic Correction for Adversarial Robustness on Vision-Language Models
by: Li, Zhiwei, et al.
Published: (2026)

Towards High Fidelity Face Swapping: A Comprehensive Survey and New Benchmark
by: Li, Qi, et al.
Published: (2026)

Enhancing Adversarial Robustness of Vision-Language Models through Low-Rank Adaptation
by: Ji, Yuheng, et al.
Published: (2024)

Frustratingly Easy Test-Time Adaptation of Vision-Language Models
by: Farina, Matteo, et al.
Published: (2024)

ProtoDCS: Towards Robust and Efficient Open-Set Test-Time Adaptation for Vision-Language Models
by: Luo, Wei, et al.
Published: (2026)

StableTTA: Improving Vision Model Performance by Training-free Test-Time Adaptation Methods
by: Li, Zheng, et al.
Published: (2026)

Evolving Prompt Adaptation for Vision-Language Models
by: Zhang, Enming, et al.
Published: (2026)

Subspace Alignment for Vision-Language Model Test-time Adaptation
by: Zeng, Zhichen, et al.
Published: (2026)

DOTA: Distributional Test-Time Adaptation of Vision-Language Models
by: Han, Zongbo, et al.
Published: (2024)

Navigating the Trade-off: A Synthesis of Defensive Strategies for Zero-Shot Adversarial Robustness in Vision-Language Models
by: Xu, Zane, et al.
Published: (2025)

QAPruner: Quantization-Aware Vision Token Pruning for Multimodal Large Language Models
by: Wang, Xinhao, et al.
Published: (2026)

On the Adversarial Robustness of Large Vision-Language Models under Visual Token Compression
by: Zhang, Xinwei, et al.
Published: (2026)

Doubly Debiased Test-Time Prompt Tuning for Vision-Language Models
by: Song, Fei, et al.
Published: (2025)

Robust SAM: On the Adversarial Robustness of Vision Foundation Models
by: Long, Jiahuan, et al.
Published: (2025)

Probing the Robustness of Vision-Language Pretrained Models: A Multimodal Adversarial Attack Approach
by: Guan, Jiwei, et al.
Published: (2024)

PIP: Detecting Adversarial Examples in Large Vision-Language Models via Attention Patterns of Irrelevant Probe Questions
by: Zhang, Yudong, et al.
Published: (2024)

NAP-Tuning: Neural Augmented Prompt Tuning for Adversarially Robust Vision-Language Models
by: Zhang, Jiaming, et al.
Published: (2025)

Spurious Feature Eraser: Stabilizing Test-Time Adaptation for Vision-Language Foundation Model
by: Ma, Huan, et al.
Published: (2024)

Adversarial Prompt Tuning for Vision-Language Models
by: Zhang, Jiaming, et al.
Published: (2023)

A Hybrid Defense Strategy for Boosting Adversarial Robustness in Vision-Language Models
by: Liang, Yuhan, et al.
Published: (2024)

MoPE-CLIP: Structured Pruning for Efficient Vision-Language Models with Module-wise Pruning Error Metric
by: Lin, Haokun, et al.
Published: (2024)

Multi-Cache Enhanced Prototype Learning for Test-Time Generalization of Vision-Language Models
by: Chen, Xinyu, et al.
Published: (2025)

Chain-of-Adaptation: Surgical Vision-Language Adaptation with Reinforcement Learning
by: Li, Jiajie, et al.
Published: (2026)

Adversarial Prompt Distillation for Vision-Language Models
by: Luo, Lin, et al.
Published: (2024)

Explainable Adversarial-Robust Vision-Language-Action Model for Robotic Manipulation
by: Kim, Ju-Young, et al.
Published: (2025)

On the Adversarial Robustness of Camera-based 3D Object Detection
by: Xie, Shaoyuan, et al.
Published: (2023)

Fewer Tokens and Fewer Videos: Extending Video Understanding Abilities in Large Vision-Language Models
by: Chen, Shimin, et al.
Published: (2024)

AdPO: Enhancing the Adversarial Robustness of Large Vision-Language Models with Preference Optimization
by: Liu, Chaohu, et al.
Published: (2025)

Securing Vision-Language Models with a Robust Encoder Against Jailbreak and Adversarial Attacks
by: Hossain, Md Zarif, et al.
Published: (2024)

Learning Knowledge-based Prompts for Robust 3D Mask Presentation Attack Detection
by: Jiang, Fangling, et al.
Published: (2025)

Group Orthogonalization Regularization For Vision Models Adaptation and Robustness
by: Kurtz, Yoav, et al.
Published: (2023)

One Prompt Word is Enough to Boost Adversarial Robustness for Pre-trained Vision-Language Models
by: Li, Lin, et al.
Published: (2024)

Prefill-Time Intervention for Mitigating Hallucination in Large Vision-Language Models
by: Zhang, Chengsheng, et al.
Published: (2026)

Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models
by: Schlarmann, Christian, et al.
Published: (2024)

Adversarial Defense in Vision-Language Models: An Overview
by: Fu, Xiaowei, et al.
Published: (2026)

General Scene Adaptation for Vision-and-Language Navigation
by: Hong, Haodong, et al.
Published: (2025)

Attention-Guided Patch-Wise Sparse Adversarial Attacks on Vision-Language-Action Models
by: Zhang, Naifu, et al.
Published: (2025)

Understanding Retrieval-Augmented Task Adaptation for Vision-Language Models
by: Ming, Yifei, et al.
Published: (2024)

LoopVLA: Learning Sufficiency in Recurrent Refinement for Vision-Language-Action Models
by: Shen, Boyang, et al.
Published: (2026)

On Inherent Adversarial Robustness of Active Vision Systems
by: Mukherjee, Amitangshu, et al.
Published: (2024)