:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lin, Shiwei, Wang, Chenxu, Ding, Xiaozhen, Wang, Yi, Du, Boyuan, Song, Lei, Wang, Chenggang, Liu, Huaping
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2506.05405
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Learning Event Completeness for Weakly Supervised Video Anomaly Detection
by: Wang, Yu, et al.
Published: (2025)

CogVLM: Visual Expert for Pretrained Language Models
by: Wang, Weihan, et al.
Published: (2023)

MANTA: A Large-Scale Multi-View and Visual-Text Anomaly Detection Dataset for Tiny Objects
by: Fan, Lei, et al.
Published: (2024)

Quo Vadis, Anomaly Detection? LLMs and VLMs in the Spotlight
by: Ding, Xi, et al.
Published: (2024)

ONER: Online Experience Replay for Incremental Anomaly Detection
by: Jin, Yizhou, et al.
Published: (2024)

VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding
by: Xu, Runsen, et al.
Published: (2024)

IndusAgent: Reinforcing Open-Vocabulary Industrial Anomaly Detection with Agentic Tools
by: Tan, Rongbin, et al.
Published: (2026)

LogicAD: Explainable Anomaly Detection via VLM-based Text Feature Extraction
by: Jin, Er, et al.
Published: (2025)

CogVLM2: Visual Language Models for Image and Video Understanding
by: Hong, Wenyi, et al.
Published: (2024)

HOLa: Zero-Shot HOI Detection with Low-Rank Decomposed VLM Feature Adaptation
by: Lei, Qinqian, et al.
Published: (2025)

FAIR: Frequency-aware Image Restoration for Industrial Visual Anomaly Detection
by: Liu, Tongkun, et al.
Published: (2023)

A Comprehensive Library for Benchmarking Multi-class Visual Anomaly Detection
by: Zhang, Jiangning, et al.
Published: (2024)

PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability
by: Zhou, Weijie, et al.
Published: (2025)

EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection
by: Lei, Qinqian, et al.
Published: (2024)

Hierarchically Decoupled Mixture-of-Experts for Robust Traffic Sign Recognition in Complex Driving Scenarios
by: Wang, Mingxiao, et al.
Published: (2026)

AdaptCLIP: Adapting CLIP for Universal Visual Anomaly Detection
by: Gao, Bin-Bin, et al.
Published: (2025)

Unifying VLM-Guided Flow Matching and Spectral Anomaly Detection for Interpretable Veterinary Diagnosis
by: Wang, Pu, et al.
Published: (2026)

A Trustworthy Method for Multimodal Emotion Recognition
by: Xue, Junxiao, et al.
Published: (2025)

AdaFV: Rethinking of Visual-Language alignment for VLM acceleration
by: Han, Jiayi, et al.
Published: (2025)

InfoSyncNet: Information Synchronization Temporal Convolutional Network for Visual Speech Recognition
by: Xue, Junxiao, et al.
Published: (2025)

FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation
by: Guo, Jun, et al.
Published: (2025)

Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical Images
by: Huang, Chaoqin, et al.
Published: (2024)

AgentIAD: Agentic Industrial Anomaly Detection via Adaptive Memory Augmentation
by: Miao, Junwen, et al.
Published: (2025)

A Comparative Study of Neural Surface Reconstruction for Scientific Visualization
by: Yao, Siyuan, et al.
Published: (2024)

Component-aware Unsupervised Logical Anomaly Generation for Industrial Anomaly Detection
by: Tong, Xuan, et al.
Published: (2025)

A Visually Impaired Assistance Benchmark for VLM-as-a-Judge Evaluation
by: Zhao, Yi, et al.
Published: (2026)

Learning Monocular Depth from Focus with Event Focal Stack
by: Jiang, Chenxu, et al.
Published: (2024)

The Solution for the ICCV 2023 1st Scientific Figure Captioning Challenge
by: Chao, Dian, et al.
Published: (2024)

ASBench: Image Anomalies Synthesis Benchmark for Anomaly Detection
by: Zhang, Qunyi, et al.
Published: (2025)

AnomalyDiffusion: Few-Shot Anomaly Image Generation with Diffusion Model
by: Hu, Teng, et al.
Published: (2023)

A Survey of Multimodal Hallucination Evaluation and Detection
by: Chen, Zhiyuan, et al.
Published: (2025)

PiCo: Active Manifold Canonicalization for Robust Robotic Visual Anomaly Detection
by: Yan, Teng, et al.
Published: (2026)

FlashVLM: Text-Guided Visual Token Selection for Large Multimodal Models
by: Cai, Kaitong, et al.
Published: (2025)

Dynamic-VLM: Simple Dynamic Visual Token Compression for VideoLLM
by: Wang, Han, et al.
Published: (2024)

Visual Anomaly Detection for Reliable Robotic Implantation of Flexible Microelectrode Array
by: Chen, Yitong, et al.
Published: (2025)

CRCL: Causal Representation Consistency Learning for Anomaly Detection in Surveillance Videos
by: Liu, Yang, et al.
Published: (2025)

Collision-Aware Object-Goal Visual Navigation via Two-Stage Deep Reinforcement Learning
by: Wang, Hongwu, et al.
Published: (2025)

Dual Conditioned Motion Diffusion for Pose-Based Video Anomaly Detection
by: Wang, Hongsong, et al.
Published: (2024)

AtlasVA: Self-Evolving Visual Skill Memory for Teacher-Free VLM Agents
by: Wang, Pan, et al.
Published: (2026)

Research on Anomaly Detection Methods Based on Diffusion Models
by: Chen, Yi
Published: (2025)