:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Bekit, Lokman, Karim, Hamza, Nguyen, Nghia T, Yilmaz, Yasin
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2604.03040
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ComplexVAD: Detecting Interaction Anomalies in Video
by: Mumcu, Furkan, et al.
Published: (2025)

Is Video Anomaly Detection Misframed? Evidence from LLM-Based and Multi-Scene Models
by: Mumcu, Furkan, et al.
Published: (2026)

Leveraging Multimodal LLM Descriptions of Activity for Explainable Semi-Supervised Video Anomaly Detection
by: Mumcu, Furkan, et al.
Published: (2025)

Geometry-Aware Semantic Reasoning for Training Free Video Anomaly Detection
by: Zia, Ali, et al.
Published: (2026)

LLM-Guided Agentic Object Detection for Open-World Understanding
by: Mumcu, Furkan, et al.
Published: (2025)

AnomalyAgent: Training-Free Agentic Models for Zero-/Few-Shot Anomaly Detection
by: Zhang, Yi, et al.
Published: (2026)

Universal and Efficient Detection of Adversarial Data through Nonuniform Impact on Network Layers
by: Mumcu, Furkan, et al.
Published: (2025)

Agentic AI-Empowered Dynamic Survey Framework
by: Mumcu, Furkan, et al.
Published: (2026)

EventVAD: Training-Free Event-Aware Video Anomaly Detection
by: Shao, Yihua, et al.
Published: (2025)

CoReVAD: A Contextual Reasoning Framework for Training-Free Video Anomaly Detection
by: Lim, Hyeongmuk, et al.
Published: (2026)

From Frames to Events: Rethinking Evaluation in Human-Centric Video Anomaly Detection
by: Rashvand, Narges, et al.
Published: (2026)

SphereVAD: Training-Free Video Anomaly Detection via Geodesic Inference on the Unit Hypersphere
by: Huang, Chao, et al.
Published: (2026)

VADTree: Explainable Training-Free Video Anomaly Detection via Hierarchical Granularity-Aware Tree
by: Li, Wenlong, et al.
Published: (2025)

VisionGuard: Synergistic Framework for Helmet Violation Detection
by: Nguyen, Lam-Huy, et al.
Published: (2025)

Multimodal Attack Detection for Action Recognition Models
by: Mumcu, Furkan, et al.
Published: (2024)

PANDA: Towards Generalist Video Anomaly Detection via Agentic AI Engineer
by: Yang, Zhiwei, et al.
Published: (2025)

ViConsFormer: Constituting Meaningful Phrases of Scene Texts using Transformer-based Method in Vietnamese Text-based Visual Question Answering
by: Nguyen, Nghia Hieu, et al.
Published: (2024)

Agentic Keyframe Search for Video Question Answering
by: Fan, Sunqi, et al.
Published: (2025)

ReME: A Data-Centric Framework for Training-Free Open-Vocabulary Segmentation
by: Xuan, Xiwei, et al.
Published: (2025)

Hypergraph-Enhanced Training-Free and Language-Free Few-Shot Anomaly Detection
by: Xie, Guohuan, et al.
Published: (2026)

An Efficient Streaming Video Understanding Framework with Agentic Control
by: Liu, Jinming, et al.
Published: (2026)

EM-Vid: Training-Free Entity-Centric Memory for Efficient and Consistent Multi-Shot Video Generation
by: Vandersanden, Jente, et al.
Published: (2026)

A 4D Representation for Training-Free Agentic Reasoning from Monocular Laparoscopic Video
by: Fehrentz, Maximilian, et al.
Published: (2026)

LAVID: An Agentic LVLM Framework for Diffusion-Generated Video Detection
by: Liu, Qingyuan, et al.
Published: (2025)

Harnessing Large Language Models for Training-free Video Anomaly Detection
by: Zanella, Luca, et al.
Published: (2024)

Bridging the Training-Deployment Gap: Gated Encoding and Multi-Scale Refinement for Efficient Quantization-Aware Image Enhancement
by: To-Thanh, Dat, et al.
Published: (2026)

Human-Centric Video Anomaly Detection Through Spatio-Temporal Pose Tokenization and Transformer
by: Noghre, Ghazal Alinezhad, et al.
Published: (2024)

DocVideoQA: Towards Comprehensive Understanding of Document-Centric Videos through Question Answering
by: Wang, Haochen, et al.
Published: (2025)

ViOCRVQA: Novel Benchmark Dataset and Vision Reader for Visual Question Answering by Understanding Vietnamese Text in Images
by: Pham, Huy Quang, et al.
Published: (2024)

HeadHunt-VAD: Hunting Robust Anomaly-Sensitive Heads in MLLM for Tuning-Free Video Anomaly Detection
by: Cai, Zhaolin, et al.
Published: (2025)

Is Training Necessary for Anomaly Detection?
by: Zhang, Xingwu, et al.
Published: (2026)

Object-Centric Framework for Video Moment Retrieval
by: Li, Zongyao, et al.
Published: (2025)

The Evolution of Video Anomaly Detection: A Unified Framework from DNN to MLLM
by: Gao, Shibo, et al.
Published: (2025)

An Exploratory Study on Human-Centric Video Anomaly Detection through Variational Autoencoders and Trajectory Prediction
by: Noghre, Ghazal Alinezhad, et al.
Published: (2024)

Language-driven Grasp Detection
by: Vuong, An Dinh, et al.
Published: (2024)

Mitigating Visual Context Degradation in Large Multimodal Models: A Training-Free Decoupled Agentic Framework
by: Jia, Hongrui, et al.
Published: (2025)

GenKOL: Modular Generative AI Framework For Scalable Virtual KOL Generation
by: To, Tan-Hiep, et al.
Published: (2025)

Frequency-Guided Diffusion Model with Perturbation Training for Skeleton-Based Video Anomaly Detection
by: Tan, Xiaofeng, et al.
Published: (2024)

Video Anomaly Detection with Contours -- A Study
by: Siemon, Mia, et al.
Published: (2025)

VADMamba++: Efficient Video Anomaly Detection via Hybrid Modeling in Grayscale Space
by: Lyu, Jihao, et al.
Published: (2026)