:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Deng, Huilin, Luo, Hongchen, Zhai, Wei, Cao, Yang, Kang, Yu
Natura:	Preprint
Pubblicazione:	2024
Soggetti:	Computer Vision and Pattern Recognition
Accesso online:	https://arxiv.org/abs/2409.20146
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

Boosting the Generalization and Reasoning of Vision Language Models with Curriculum Reinforcement Learning
di: Deng, Huilin, et al.
Pubblicazione: (2025)

Bidirectional Progressive Transformer for Interaction Intention Anticipation
di: Zhang, Zichen, et al.
Pubblicazione: (2024)

PEAR: Phrase-Based Hand-Object Interaction Anticipation
di: Zhang, Zichen, et al.
Pubblicazione: (2024)

Towards Zero-Shot Anomaly Detection and Reasoning with Multimodal Large Language Models
di: Xu, Jiacong, et al.
Pubblicazione: (2025)

Visual-Geometric Collaborative Guidance for Affordance Learning
di: Luo, Hongchen, et al.
Pubblicazione: (2024)

Intention-driven Ego-to-Exo Video Generation
di: Luo, Hongchen, et al.
Pubblicazione: (2024)

Global-Regularized Neighborhood Regression for Efficient Zero-Shot Texture Anomaly Detection
di: Yao, Haiming, et al.
Pubblicazione: (2024)

CoPS: Conditional Prompt Synthesis for Zero-Shot Anomaly Detection
di: Chen, Qiyu, et al.
Pubblicazione: (2025)

LEMON: Learning 3D Human-Object Interaction Relation from 2D Images
di: Yang, Yuhang, et al.
Pubblicazione: (2023)

IAD-GPT: Advancing Visual Knowledge in Multimodal Large Language Model for Industrial Anomaly Detection
di: Li, Zewen, et al.
Pubblicazione: (2025)

Leverage Task Context for Object Affordance Ranking
di: Huang, Haojie, et al.
Pubblicazione: (2024)

AG-VAS: Anchor-Guided Zero-Shot Visual Anomaly Segmentation with Large Multimodal Models
di: Qu, Zhen, et al.
Pubblicazione: (2026)

VisualAD: Language-Free Zero-Shot Anomaly Detection via Vision Transformer
di: Hou, Yanning, et al.
Pubblicazione: (2026)

Dual-Image Enhanced CLIP for Zero-Shot Anomaly Detection
di: Zhang, Zhaoxiang, et al.
Pubblicazione: (2024)

Towards Zero-Shot Differential Morphing Attack Detection with Multimodal Large Language Models
di: Shekhawat, Ria, et al.
Pubblicazione: (2025)

SSVP: Synergistic Semantic-Visual Prompting for Industrial Zero-Shot Anomaly Detection
di: Fu, Chenhao, et al.
Pubblicazione: (2026)

Learning Multi-view Multi-class Anomaly Detection
di: Yu, Qianzi, et al.
Pubblicazione: (2025)

ZSG-IAD: A Multimodal Framework for Zero-Shot Grounded Industrial Anomaly Detection
di: Chen, Qiuhui, et al.
Pubblicazione: (2026)

Visual Context Window Extension: A New Perspective for Long Video Understanding
di: Wei, Hongchen, et al.
Pubblicazione: (2024)

AnomalyAgent: Training-Free Agentic Models for Zero-/Few-Shot Anomaly Detection
di: Zhang, Yi, et al.
Pubblicazione: (2026)

AnyAnomaly: Zero-Shot Customizable Video Anomaly Detection with LVLM
di: Ahn, Sunghyun, et al.
Pubblicazione: (2025)

Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages
di: Hu, Jinyi, et al.
Pubblicazione: (2023)

Bootstrap Fine-Grained Vision-Language Alignment for Unified Zero-Shot Anomaly Localization
di: Deng, Hanqiu, et al.
Pubblicazione: (2023)

VarAD: Lightweight High-Resolution Image Anomaly Detection via Visual Autoregressive Modeling
di: Cao, Yunkang, et al.
Pubblicazione: (2024)

AnoRefiner: Anomaly-Aware Group-Wise Refinement for Zero-Shot Industrial Anomaly Detection
di: Huang, Dayou, et al.
Pubblicazione: (2025)

On the Problem of Consistent Anomalies in Zero-Shot Industrial Anomaly Detection
di: Le-Gia, Tai, et al.
Pubblicazione: (2025)

On the Problem of Consistent Anomalies in Zero-Shot Anomaly Detection
di: Le-Gia, Tai
Pubblicazione: (2025)

Seeing Is Believing? A Benchmark for Multimodal Large Language Models on Visual Illusions and Anomalies
di: Hou, Wenjin, et al.
Pubblicazione: (2026)

GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding
di: Shao, Yawen, et al.
Pubblicazione: (2024)

Back to Point: Exploring Point-Language Models for Zero-Shot 3D Anomaly Detection
di: Li, Kaiqiang, et al.
Pubblicazione: (2026)

Zero-Shot Image Anomaly Detection Using Generative Foundation Models
di: Abdi, Lemar, et al.
Pubblicazione: (2025)

Expanding Zero-Shot Object Counting with Rich Prompts
di: Zhu, Huilin, et al.
Pubblicazione: (2025)

AnomalyPainter: Vision-Language-Diffusion Synergy for Zero-Shot Realistic and Diverse Industrial Anomaly Synthesis
di: Lai, Zhangyu, et al.
Pubblicazione: (2025)

AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection
di: Cao, Yunkang, et al.
Pubblicazione: (2024)

LongCaptioning: Unlocking the Power of Long Video Caption Generation in Large Multimodal Models
di: Wei, Hongchen, et al.
Pubblicazione: (2025)

VETime: Vision Enhanced Zero-Shot Time Series Anomaly Detection
di: Yang, Yingyuan, et al.
Pubblicazione: (2026)

Distributed Zero-Shot Learning for Visual Recognition
di: Chen, Zhi, et al.
Pubblicazione: (2025)

Anomaly-Aware Vision-Language Adapters for Zero-Shot Anomaly Detection
di: Aqeel, Muhammad, et al.
Pubblicazione: (2026)

Zero-Shot Scene Understanding with Multimodal Large Language Models for Automated Vehicles
di: Elhenawy, Mohammed, et al.
Pubblicazione: (2025)

No Need For Real Anomaly: MLLM Empowered Zero-Shot Video Anomaly Detection
di: Dai, Zunkai, et al.
Pubblicazione: (2026)