Saved in:
| Main Authors: | Peng, Xiaomeng, Huang, Xilang, Choi, Seon Han |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.17419 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Myriad: Large Multimodal Model by Applying Vision Experts for Industrial Anomaly Detection
by: Li, Yuanze, et al.
Published: (2023)
by: Li, Yuanze, et al.
Published: (2023)
Can Multimodal Large Language Models be Guided to Improve Industrial Anomaly Detection?
by: Chen, Zhiling, et al.
Published: (2025)
by: Chen, Zhiling, et al.
Published: (2025)
IAD-GPT: Advancing Visual Knowledge in Multimodal Large Language Model for Industrial Anomaly Detection
by: Li, Zewen, et al.
Published: (2025)
by: Li, Zewen, et al.
Published: (2025)
Tuned Reverse Distillation: Enhancing Multimodal Industrial Anomaly Detection with Crossmodal Tuners
by: Liu, Xinyue, et al.
Published: (2024)
by: Liu, Xinyue, et al.
Published: (2024)
Multimodal Industrial Anomaly Detection via Geometric Prior
by: Li, Min, et al.
Published: (2026)
by: Li, Min, et al.
Published: (2026)
MMAD: A Comprehensive Benchmark for Multimodal Large Language Models in Industrial Anomaly Detection
by: Jiang, Xi, et al.
Published: (2024)
by: Jiang, Xi, et al.
Published: (2024)
EAGLE: Towards Efficient Arbitrary Referring Visual Prompts Comprehension for Multimodal Large Language Models
by: Zhang, Jiacheng, et al.
Published: (2024)
by: Zhang, Jiacheng, et al.
Published: (2024)
OmniAD: Detect and Understand Industrial Anomaly via Multimodal Reasoning
by: Zhao, Shifang, et al.
Published: (2025)
by: Zhao, Shifang, et al.
Published: (2025)
Breaking the Bias: Recalibrating the Attention of Industrial Anomaly Detection
by: Chen, Xin, et al.
Published: (2024)
by: Chen, Xin, et al.
Published: (2024)
Exploring Large Vision-Language Models for Robust and Efficient Industrial Anomaly Detection
by: Qian, Kun, et al.
Published: (2024)
by: Qian, Kun, et al.
Published: (2024)
TextGuider: Training-Free Guidance for Text Rendering via Attention Alignment
by: Baek, Kanghyun, et al.
Published: (2025)
by: Baek, Kanghyun, et al.
Published: (2025)
Text-Guided Multimodal Unified Industrial Anomaly Detection
by: Li, Zewen, et al.
Published: (2026)
by: Li, Zewen, et al.
Published: (2026)
Multimodal Industrial Anomaly Detection by Crossmodal Feature Mapping
by: Costanzino, Alex, et al.
Published: (2023)
by: Costanzino, Alex, et al.
Published: (2023)
Venus: Benchmarking and Empowering Multimodal Large Language Models for Aesthetic Guidance and Cropping
by: Du, Tianxiang, et al.
Published: (2026)
by: Du, Tianxiang, et al.
Published: (2026)
Not All Regions Are Equal: Attention-Guided Perturbation Network for Industrial Anomaly Detection
by: Huang, Tingfeng, et al.
Published: (2024)
by: Huang, Tingfeng, et al.
Published: (2024)
Detect, Classify, Act: Categorizing Industrial Anomalies with Multi-Modal Large Language Models
by: Mokhtar, Sassan, et al.
Published: (2025)
by: Mokhtar, Sassan, et al.
Published: (2025)
AgentIAD: Agentic Industrial Anomaly Detection via Adaptive Memory Augmentation
by: Miao, Junwen, et al.
Published: (2025)
by: Miao, Junwen, et al.
Published: (2025)
Proactive Reasoning-with-Retrieval Framework for Medical Multimodal Large Language Models
by: Wang, Lehan, et al.
Published: (2025)
by: Wang, Lehan, et al.
Published: (2025)
TWIST & SCOUT: Grounding Multimodal LLM-Experts by Forget-Free Tuning
by: Bhowmik, Aritra, et al.
Published: (2024)
by: Bhowmik, Aritra, et al.
Published: (2024)
Tuning-Free Image Customization with Image and Text Guidance
by: Li, Pengzhi, et al.
Published: (2024)
by: Li, Pengzhi, et al.
Published: (2024)
Incomplete Multimodal Industrial Anomaly Detection via Cross-Modal Distillation
by: Sui, Wenbo, et al.
Published: (2024)
by: Sui, Wenbo, et al.
Published: (2024)
LR-IAD:Mask-Free Industrial Anomaly Detection with Logical Reasoning
by: Zeng, Peijian, et al.
Published: (2025)
by: Zeng, Peijian, et al.
Published: (2025)
EAGLE: An Efficient Global Attention Lesion Segmentation Model for Hepatic Echinococcosis
by: Chen, Jiayan, et al.
Published: (2025)
by: Chen, Jiayan, et al.
Published: (2025)
WMoE-CLIP: Wavelet-Enhanced Mixture-of-Experts Prompt Learning for Zero-Shot Anomaly Detection
by: Chen, Peng, et al.
Published: (2026)
by: Chen, Peng, et al.
Published: (2026)
Text Prompt with Normality Guidance for Weakly Supervised Video Anomaly Detection
by: Yang, Zhiwei, et al.
Published: (2024)
by: Yang, Zhiwei, et al.
Published: (2024)
Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models
by: Xu, Jiacong, et al.
Published: (2025)
by: Xu, Jiacong, et al.
Published: (2025)
VMAD: Visual-enhanced Multimodal Large Language Model for Zero-Shot Anomaly Detection
by: Deng, Huilin, et al.
Published: (2024)
by: Deng, Huilin, et al.
Published: (2024)
HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
by: Zhang, Wenqiao, et al.
Published: (2024)
by: Zhang, Wenqiao, et al.
Published: (2024)
A Survey on RGB, 3D, and Multimodal Approaches for Unsupervised Industrial Image Anomaly Detection
by: Lin, Yuxuan, et al.
Published: (2024)
by: Lin, Yuxuan, et al.
Published: (2024)
Text-Guided Variational Image Generation for Industrial Anomaly Detection and Segmentation
by: Lee, Mingyu, et al.
Published: (2024)
by: Lee, Mingyu, et al.
Published: (2024)
PatchEAD: Unifying Industrial Visual Prompting Frameworks for Patch-Exclusive Anomaly Detection
by: Huang, Po-Han, et al.
Published: (2025)
by: Huang, Po-Han, et al.
Published: (2025)
Attention-driven GUI Grounding: Leveraging Pretrained Multimodal Large Language Models without Fine-Tuning
by: Xu, Hai-Ming, et al.
Published: (2024)
by: Xu, Hai-Ming, et al.
Published: (2024)
Seeing Is Believing? A Benchmark for Multimodal Large Language Models on Visual Illusions and Anomalies
by: Hou, Wenjin, et al.
Published: (2026)
by: Hou, Wenjin, et al.
Published: (2026)
Hallucination Augmented Contrastive Learning for Multimodal Large Language Model
by: Jiang, Chaoya, et al.
Published: (2023)
by: Jiang, Chaoya, et al.
Published: (2023)
Progressive Boundary Guided Anomaly Synthesis for Industrial Anomaly Detection
by: Chen, Qiyu, et al.
Published: (2024)
by: Chen, Qiyu, et al.
Published: (2024)
Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback
by: Ahn, Daechul, et al.
Published: (2024)
by: Ahn, Daechul, et al.
Published: (2024)
ExpertGen: Training-Free Expert Guidance for Controllable Text-to-Face Generation
by: Shi, Liang, et al.
Published: (2025)
by: Shi, Liang, et al.
Published: (2025)
MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models
by: Shen, Leyang, et al.
Published: (2024)
by: Shen, Leyang, et al.
Published: (2024)
BridgeNet: A Unified Multimodal Framework for Bridging 2D and 3D Industrial Anomaly Detection
by: Xiang, An, et al.
Published: (2025)
by: Xiang, An, et al.
Published: (2025)
MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping
by: Huang, Yushi, et al.
Published: (2025)
by: Huang, Yushi, et al.
Published: (2025)
Similar Items
-
Myriad: Large Multimodal Model by Applying Vision Experts for Industrial Anomaly Detection
by: Li, Yuanze, et al.
Published: (2023) -
Can Multimodal Large Language Models be Guided to Improve Industrial Anomaly Detection?
by: Chen, Zhiling, et al.
Published: (2025) -
IAD-GPT: Advancing Visual Knowledge in Multimodal Large Language Model for Industrial Anomaly Detection
by: Li, Zewen, et al.
Published: (2025) -
Tuned Reverse Distillation: Enhancing Multimodal Industrial Anomaly Detection with Crossmodal Tuners
by: Liu, Xinyue, et al.
Published: (2024) -
Multimodal Industrial Anomaly Detection via Geometric Prior
by: Li, Min, et al.
Published: (2026)