Saved in:
| Main Authors: | Lin, Shiwei, Wang, Chenxu, Ding, Xiaozhen, Wang, Yi, Du, Boyuan, Song, Lei, Wang, Chenggang, Liu, Huaping |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.05405 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Learning Event Completeness for Weakly Supervised Video Anomaly Detection
by: Wang, Yu, et al.
Published: (2025)
by: Wang, Yu, et al.
Published: (2025)
CogVLM: Visual Expert for Pretrained Language Models
by: Wang, Weihan, et al.
Published: (2023)
by: Wang, Weihan, et al.
Published: (2023)
MANTA: A Large-Scale Multi-View and Visual-Text Anomaly Detection Dataset for Tiny Objects
by: Fan, Lei, et al.
Published: (2024)
by: Fan, Lei, et al.
Published: (2024)
Quo Vadis, Anomaly Detection? LLMs and VLMs in the Spotlight
by: Ding, Xi, et al.
Published: (2024)
by: Ding, Xi, et al.
Published: (2024)
ONER: Online Experience Replay for Incremental Anomaly Detection
by: Jin, Yizhou, et al.
Published: (2024)
by: Jin, Yizhou, et al.
Published: (2024)
VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding
by: Xu, Runsen, et al.
Published: (2024)
by: Xu, Runsen, et al.
Published: (2024)
IndusAgent: Reinforcing Open-Vocabulary Industrial Anomaly Detection with Agentic Tools
by: Tan, Rongbin, et al.
Published: (2026)
by: Tan, Rongbin, et al.
Published: (2026)
LogicAD: Explainable Anomaly Detection via VLM-based Text Feature Extraction
by: Jin, Er, et al.
Published: (2025)
by: Jin, Er, et al.
Published: (2025)
CogVLM2: Visual Language Models for Image and Video Understanding
by: Hong, Wenyi, et al.
Published: (2024)
by: Hong, Wenyi, et al.
Published: (2024)
HOLa: Zero-Shot HOI Detection with Low-Rank Decomposed VLM Feature Adaptation
by: Lei, Qinqian, et al.
Published: (2025)
by: Lei, Qinqian, et al.
Published: (2025)
FAIR: Frequency-aware Image Restoration for Industrial Visual Anomaly Detection
by: Liu, Tongkun, et al.
Published: (2023)
by: Liu, Tongkun, et al.
Published: (2023)
A Comprehensive Library for Benchmarking Multi-class Visual Anomaly Detection
by: Zhang, Jiangning, et al.
Published: (2024)
by: Zhang, Jiangning, et al.
Published: (2024)
PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability
by: Zhou, Weijie, et al.
Published: (2025)
by: Zhou, Weijie, et al.
Published: (2025)
EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection
by: Lei, Qinqian, et al.
Published: (2024)
by: Lei, Qinqian, et al.
Published: (2024)
Hierarchically Decoupled Mixture-of-Experts for Robust Traffic Sign Recognition in Complex Driving Scenarios
by: Wang, Mingxiao, et al.
Published: (2026)
by: Wang, Mingxiao, et al.
Published: (2026)
AdaptCLIP: Adapting CLIP for Universal Visual Anomaly Detection
by: Gao, Bin-Bin, et al.
Published: (2025)
by: Gao, Bin-Bin, et al.
Published: (2025)
Unifying VLM-Guided Flow Matching and Spectral Anomaly Detection for Interpretable Veterinary Diagnosis
by: Wang, Pu, et al.
Published: (2026)
by: Wang, Pu, et al.
Published: (2026)
A Trustworthy Method for Multimodal Emotion Recognition
by: Xue, Junxiao, et al.
Published: (2025)
by: Xue, Junxiao, et al.
Published: (2025)
AdaFV: Rethinking of Visual-Language alignment for VLM acceleration
by: Han, Jiayi, et al.
Published: (2025)
by: Han, Jiayi, et al.
Published: (2025)
InfoSyncNet: Information Synchronization Temporal Convolutional Network for Visual Speech Recognition
by: Xue, Junxiao, et al.
Published: (2025)
by: Xue, Junxiao, et al.
Published: (2025)
FlowDreamer: A RGB-D World Model with Flow-based Motion Representations for Robot Manipulation
by: Guo, Jun, et al.
Published: (2025)
by: Guo, Jun, et al.
Published: (2025)
Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical Images
by: Huang, Chaoqin, et al.
Published: (2024)
by: Huang, Chaoqin, et al.
Published: (2024)
AgentIAD: Agentic Industrial Anomaly Detection via Adaptive Memory Augmentation
by: Miao, Junwen, et al.
Published: (2025)
by: Miao, Junwen, et al.
Published: (2025)
A Comparative Study of Neural Surface Reconstruction for Scientific Visualization
by: Yao, Siyuan, et al.
Published: (2024)
by: Yao, Siyuan, et al.
Published: (2024)
Component-aware Unsupervised Logical Anomaly Generation for Industrial Anomaly Detection
by: Tong, Xuan, et al.
Published: (2025)
by: Tong, Xuan, et al.
Published: (2025)
A Visually Impaired Assistance Benchmark for VLM-as-a-Judge Evaluation
by: Zhao, Yi, et al.
Published: (2026)
by: Zhao, Yi, et al.
Published: (2026)
Learning Monocular Depth from Focus with Event Focal Stack
by: Jiang, Chenxu, et al.
Published: (2024)
by: Jiang, Chenxu, et al.
Published: (2024)
The Solution for the ICCV 2023 1st Scientific Figure Captioning Challenge
by: Chao, Dian, et al.
Published: (2024)
by: Chao, Dian, et al.
Published: (2024)
ASBench: Image Anomalies Synthesis Benchmark for Anomaly Detection
by: Zhang, Qunyi, et al.
Published: (2025)
by: Zhang, Qunyi, et al.
Published: (2025)
AnomalyDiffusion: Few-Shot Anomaly Image Generation with Diffusion Model
by: Hu, Teng, et al.
Published: (2023)
by: Hu, Teng, et al.
Published: (2023)
A Survey of Multimodal Hallucination Evaluation and Detection
by: Chen, Zhiyuan, et al.
Published: (2025)
by: Chen, Zhiyuan, et al.
Published: (2025)
PiCo: Active Manifold Canonicalization for Robust Robotic Visual Anomaly Detection
by: Yan, Teng, et al.
Published: (2026)
by: Yan, Teng, et al.
Published: (2026)
FlashVLM: Text-Guided Visual Token Selection for Large Multimodal Models
by: Cai, Kaitong, et al.
Published: (2025)
by: Cai, Kaitong, et al.
Published: (2025)
Dynamic-VLM: Simple Dynamic Visual Token Compression for VideoLLM
by: Wang, Han, et al.
Published: (2024)
by: Wang, Han, et al.
Published: (2024)
Visual Anomaly Detection for Reliable Robotic Implantation of Flexible Microelectrode Array
by: Chen, Yitong, et al.
Published: (2025)
by: Chen, Yitong, et al.
Published: (2025)
CRCL: Causal Representation Consistency Learning for Anomaly Detection in Surveillance Videos
by: Liu, Yang, et al.
Published: (2025)
by: Liu, Yang, et al.
Published: (2025)
Collision-Aware Object-Goal Visual Navigation via Two-Stage Deep Reinforcement Learning
by: Wang, Hongwu, et al.
Published: (2025)
by: Wang, Hongwu, et al.
Published: (2025)
Dual Conditioned Motion Diffusion for Pose-Based Video Anomaly Detection
by: Wang, Hongsong, et al.
Published: (2024)
by: Wang, Hongsong, et al.
Published: (2024)
AtlasVA: Self-Evolving Visual Skill Memory for Teacher-Free VLM Agents
by: Wang, Pan, et al.
Published: (2026)
by: Wang, Pan, et al.
Published: (2026)
Research on Anomaly Detection Methods Based on Diffusion Models
by: Chen, Yi
Published: (2025)
by: Chen, Yi
Published: (2025)
Similar Items
-
Learning Event Completeness for Weakly Supervised Video Anomaly Detection
by: Wang, Yu, et al.
Published: (2025) -
CogVLM: Visual Expert for Pretrained Language Models
by: Wang, Weihan, et al.
Published: (2023) -
MANTA: A Large-Scale Multi-View and Visual-Text Anomaly Detection Dataset for Tiny Objects
by: Fan, Lei, et al.
Published: (2024) -
Quo Vadis, Anomaly Detection? LLMs and VLMs in the Spotlight
by: Ding, Xi, et al.
Published: (2024) -
ONER: Online Experience Replay for Incremental Anomaly Detection
by: Jin, Yizhou, et al.
Published: (2024)