Saved in:
| Main Authors: | Chen, Dong, Hu, Zhengqing, Fan, Peiguang, Zhuang, Yueting, Li, Yafei, Liu, Qidong, Jiang, Xiaoheng, Xu, Mingliang |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.14880 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
GMFVAD: Using Grained Multi-modal Feature to Improve Video Anomaly Detection
by: Dai, Guangyu, et al.
Published: (2025)
by: Dai, Guangyu, et al.
Published: (2025)
Improving Large Models with Small models: Lower Costs and Better Performance
by: Chen, Dong, et al.
Published: (2024)
by: Chen, Dong, et al.
Published: (2024)
Few-Shot Object Detection with Sparse Context Transformers
by: Mei, Jie, et al.
Published: (2024)
by: Mei, Jie, et al.
Published: (2024)
Vision-Language Models Assisted Unsupervised Video Anomaly Detection
by: Jiang, Yalong, et al.
Published: (2024)
by: Jiang, Yalong, et al.
Published: (2024)
Topo-R1: Detecting Topological Anomalies via Vision-Language Models
by: Xu, Meilong, et al.
Published: (2026)
by: Xu, Meilong, et al.
Published: (2026)
From Heads to Neurons: Causal Attribution and Steering in Multi-Task Vision-Language Models
by: Wang, Qidong, et al.
Published: (2026)
by: Wang, Qidong, et al.
Published: (2026)
Exploring Large Vision-Language Models for Robust and Efficient Industrial Anomaly Detection
by: Qian, Kun, et al.
Published: (2024)
by: Qian, Kun, et al.
Published: (2024)
VL4AD: Vision-Language Models Improve Pixel-wise Anomaly Detection
by: Zhong, Liangyu, et al.
Published: (2024)
by: Zhong, Liangyu, et al.
Published: (2024)
Can Multimodal Large Language Models be Guided to Improve Industrial Anomaly Detection?
by: Chen, Zhiling, et al.
Published: (2025)
by: Chen, Zhiling, et al.
Published: (2025)
Chain-of-Anomaly Thoughts with Large Vision-Language Models
by: Domingos, Pedro, et al.
Published: (2025)
by: Domingos, Pedro, et al.
Published: (2025)
Anomaly-Aware Vision-Language Adapters for Zero-Shot Anomaly Detection
by: Aqeel, Muhammad, et al.
Published: (2026)
by: Aqeel, Muhammad, et al.
Published: (2026)
Robust Modality-incomplete Anomaly Detection: A Modality-instructive Framework with Benchmark
by: Miao, Bingchen, et al.
Published: (2024)
by: Miao, Bingchen, et al.
Published: (2024)
Normal and Abnormal Pathology Knowledge-Augmented Vision-Language Model for Anomaly Detection in Pathology Images
by: Song, Jinsol, et al.
Published: (2025)
by: Song, Jinsol, et al.
Published: (2025)
Myriad: Large Multimodal Model by Applying Vision Experts for Industrial Anomaly Detection
by: Li, Yuanze, et al.
Published: (2023)
by: Li, Yuanze, et al.
Published: (2023)
PatchGuard: Adversarially Robust Anomaly Detection and Localization through Vision Transformers and Pseudo Anomalies
by: Nafez, Mojtaba, et al.
Published: (2025)
by: Nafez, Mojtaba, et al.
Published: (2025)
HAAF: Hierarchical Adaptation and Alignment of Foundation Models for Few-Shot Pathology Anomaly Detection
by: Yang, Chunze, et al.
Published: (2026)
by: Yang, Chunze, et al.
Published: (2026)
IAD-GPT: Advancing Visual Knowledge in Multimodal Large Language Model for Industrial Anomaly Detection
by: Li, Zewen, et al.
Published: (2025)
by: Li, Zewen, et al.
Published: (2025)
Towards Training-free Anomaly Detection with Vision and Language Foundation Models
by: Zhang, Jinjin, et al.
Published: (2025)
by: Zhang, Jinjin, et al.
Published: (2025)
Anomaly Detection by Adapting a pre-trained Vision Language Model
by: Cai, Yuxuan, et al.
Published: (2024)
by: Cai, Yuxuan, et al.
Published: (2024)
Harnessing Vision-Language Models for Time Series Anomaly Detection
by: He, Zelin, et al.
Published: (2025)
by: He, Zelin, et al.
Published: (2025)
Video Anomaly Detection and Explanation via Large Language Models
by: Lv, Hui, et al.
Published: (2024)
by: Lv, Hui, et al.
Published: (2024)
Towards Fine-Grained Vision-Language Alignment for Few-Shot Anomaly Detection
by: Fan, Yuanting, et al.
Published: (2025)
by: Fan, Yuanting, et al.
Published: (2025)
Logic Distillation: Learning from Code Function by Function for Decision-making Tasks
by: Chen, Dong, et al.
Published: (2024)
by: Chen, Dong, et al.
Published: (2024)
Image Regeneration: Evaluating Text-to-Image Model via Generating Identical Image with Multimodal Large Language Models
by: Meng, Chutian, et al.
Published: (2024)
by: Meng, Chutian, et al.
Published: (2024)
FADE: Few-shot/zero-shot Anomaly Detection Engine using Large Vision-Language Model
by: Li, Yuanwei, et al.
Published: (2024)
by: Li, Yuanwei, et al.
Published: (2024)
Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models
by: Xu, Jiacong, et al.
Published: (2025)
by: Xu, Jiacong, et al.
Published: (2025)
One Language-Free Foundation Model Is Enough for Universal Vision Anomaly Detection
by: Gao, Bin-Bin, et al.
Published: (2026)
by: Gao, Bin-Bin, et al.
Published: (2026)
VLMDiff: Leveraging Vision-Language Models for Multi-Class Anomaly Detection with Diffusion
by: Hicsonmez, Samet, et al.
Published: (2025)
by: Hicsonmez, Samet, et al.
Published: (2025)
Seeing Is Believing? A Benchmark for Multimodal Large Language Models on Visual Illusions and Anomalies
by: Hou, Wenjin, et al.
Published: (2026)
by: Hou, Wenjin, et al.
Published: (2026)
Self-Navigated Residual Mamba for Universal Industrial Anomaly Detection
by: Li, Hanxi, et al.
Published: (2025)
by: Li, Hanxi, et al.
Published: (2025)
Evaluation of Large Language Models for Anomaly Detection in Autonomous Vehicles
by: Loukas, Petros, et al.
Published: (2025)
by: Loukas, Petros, et al.
Published: (2025)
Latent Anomaly Knowledge Excavation: Unveiling Sparse Sensitive Neurons in Vision-Language Models
by: Li, Shaotian, et al.
Published: (2026)
by: Li, Shaotian, et al.
Published: (2026)
Harnessing Large Language Models for Training-free Video Anomaly Detection
by: Zanella, Luca, et al.
Published: (2024)
by: Zanella, Luca, et al.
Published: (2024)
Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Models
by: Yang, Yuchen, et al.
Published: (2024)
by: Yang, Yuchen, et al.
Published: (2024)
Aligning Effective Tokens with Video Anomaly in Large Language Models
by: Chen, Yingxian, et al.
Published: (2025)
by: Chen, Yingxian, et al.
Published: (2025)
LogicQA: Logical Anomaly Detection with Vision Language Model Generated Questions
by: Kwon, Yejin, et al.
Published: (2025)
by: Kwon, Yejin, et al.
Published: (2025)
ASBench: Image Anomalies Synthesis Benchmark for Anomaly Detection
by: Zhang, Qunyi, et al.
Published: (2025)
by: Zhang, Qunyi, et al.
Published: (2025)
Structural-Temporal Coupling Anomaly Detection with Dynamic Graph Transformer
by: Zong, Chang, et al.
Published: (2025)
by: Zong, Chang, et al.
Published: (2025)
Reasoning-Guided Grounding: Elevating Video Anomaly Detection through Multimodal Large Language Models
by: Agarwal, Sakshi, et al.
Published: (2026)
by: Agarwal, Sakshi, et al.
Published: (2026)
Distilling Aggregated Knowledge for Weakly-Supervised Video Anomaly Detection
by: Dalvi, Jash, et al.
Published: (2024)
by: Dalvi, Jash, et al.
Published: (2024)
Similar Items
-
GMFVAD: Using Grained Multi-modal Feature to Improve Video Anomaly Detection
by: Dai, Guangyu, et al.
Published: (2025) -
Improving Large Models with Small models: Lower Costs and Better Performance
by: Chen, Dong, et al.
Published: (2024) -
Few-Shot Object Detection with Sparse Context Transformers
by: Mei, Jie, et al.
Published: (2024) -
Vision-Language Models Assisted Unsupervised Video Anomaly Detection
by: Jiang, Yalong, et al.
Published: (2024) -
Topo-R1: Detecting Topological Anomalies via Vision-Language Models
by: Xu, Meilong, et al.
Published: (2026)