:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Chen, Dong, Hu, Zhengqing, Fan, Peiguang, Zhuang, Yueting, Li, Yafei, Liu, Qidong, Jiang, Xiaoheng, Xu, Mingliang
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2502.14880
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

GMFVAD: Using Grained Multi-modal Feature to Improve Video Anomaly Detection
by: Dai, Guangyu, et al.
Published: (2025)

Improving Large Models with Small models: Lower Costs and Better Performance
by: Chen, Dong, et al.
Published: (2024)

Few-Shot Object Detection with Sparse Context Transformers
by: Mei, Jie, et al.
Published: (2024)

Vision-Language Models Assisted Unsupervised Video Anomaly Detection
by: Jiang, Yalong, et al.
Published: (2024)

Topo-R1: Detecting Topological Anomalies via Vision-Language Models
by: Xu, Meilong, et al.
Published: (2026)

From Heads to Neurons: Causal Attribution and Steering in Multi-Task Vision-Language Models
by: Wang, Qidong, et al.
Published: (2026)

Exploring Large Vision-Language Models for Robust and Efficient Industrial Anomaly Detection
by: Qian, Kun, et al.
Published: (2024)

VL4AD: Vision-Language Models Improve Pixel-wise Anomaly Detection
by: Zhong, Liangyu, et al.
Published: (2024)

Can Multimodal Large Language Models be Guided to Improve Industrial Anomaly Detection?
by: Chen, Zhiling, et al.
Published: (2025)

Chain-of-Anomaly Thoughts with Large Vision-Language Models
by: Domingos, Pedro, et al.
Published: (2025)

Anomaly-Aware Vision-Language Adapters for Zero-Shot Anomaly Detection
by: Aqeel, Muhammad, et al.
Published: (2026)

Robust Modality-incomplete Anomaly Detection: A Modality-instructive Framework with Benchmark
by: Miao, Bingchen, et al.
Published: (2024)

Normal and Abnormal Pathology Knowledge-Augmented Vision-Language Model for Anomaly Detection in Pathology Images
by: Song, Jinsol, et al.
Published: (2025)

Myriad: Large Multimodal Model by Applying Vision Experts for Industrial Anomaly Detection
by: Li, Yuanze, et al.
Published: (2023)

PatchGuard: Adversarially Robust Anomaly Detection and Localization through Vision Transformers and Pseudo Anomalies
by: Nafez, Mojtaba, et al.
Published: (2025)

HAAF: Hierarchical Adaptation and Alignment of Foundation Models for Few-Shot Pathology Anomaly Detection
by: Yang, Chunze, et al.
Published: (2026)

IAD-GPT: Advancing Visual Knowledge in Multimodal Large Language Model for Industrial Anomaly Detection
by: Li, Zewen, et al.
Published: (2025)

Towards Training-free Anomaly Detection with Vision and Language Foundation Models
by: Zhang, Jinjin, et al.
Published: (2025)

Anomaly Detection by Adapting a pre-trained Vision Language Model
by: Cai, Yuxuan, et al.
Published: (2024)

Harnessing Vision-Language Models for Time Series Anomaly Detection
by: He, Zelin, et al.
Published: (2025)

Video Anomaly Detection and Explanation via Large Language Models
by: Lv, Hui, et al.
Published: (2024)

Towards Fine-Grained Vision-Language Alignment for Few-Shot Anomaly Detection
by: Fan, Yuanting, et al.
Published: (2025)

Logic Distillation: Learning from Code Function by Function for Decision-making Tasks
by: Chen, Dong, et al.
Published: (2024)

Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models
by: Meng, Chutian, et al.
Published: (2024)

FADE: Few-shot/zero-shot Anomaly Detection Engine using Large Vision-Language Model
by: Li, Yuanwei, et al.
Published: (2024)

Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models
by: Xu, Jiacong, et al.
Published: (2025)

One Language-Free Foundation Model Is Enough for Universal Vision Anomaly Detection
by: Gao, Bin-Bin, et al.
Published: (2026)

VLMDiff: Leveraging Vision-Language Models for Multi-Class Anomaly Detection with Diffusion
by: Hicsonmez, Samet, et al.
Published: (2025)

Seeing Is Believing? A Benchmark for Multimodal Large Language Models on Visual Illusions and Anomalies
by: Hou, Wenjin, et al.
Published: (2026)

Self-Navigated Residual Mamba for Universal Industrial Anomaly Detection
by: Li, Hanxi, et al.
Published: (2025)

Evaluation of Large Language Models for Anomaly Detection in Autonomous Vehicles
by: Loukas, Petros, et al.
Published: (2025)

Latent Anomaly Knowledge Excavation: Unveiling Sparse Sensitive Neurons in Vision-Language Models
by: Li, Shaotian, et al.
Published: (2026)

Harnessing Large Language Models for Training-free Video Anomaly Detection
by: Zanella, Luca, et al.
Published: (2024)

Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Models
by: Yang, Yuchen, et al.
Published: (2024)

Aligning Effective Tokens with Video Anomaly in Large Language Models
by: Chen, Yingxian, et al.
Published: (2025)

LogicQA: Logical Anomaly Detection with Vision Language Model Generated Questions
by: Kwon, Yejin, et al.
Published: (2025)

ASBench: Image Anomalies Synthesis Benchmark for Anomaly Detection
by: Zhang, Qunyi, et al.
Published: (2025)

Structural-Temporal Coupling Anomaly Detection with Dynamic Graph Transformer
by: Zong, Chang, et al.
Published: (2025)

Reasoning-Guided Grounding: Elevating Video Anomaly Detection through Multimodal Large Language Models
by: Agarwal, Sakshi, et al.
Published: (2026)

Distilling Aggregated Knowledge for Weakly-Supervised Video Anomaly Detection
by: Dalvi, Jash, et al.
Published: (2024)