Saved in:
| Main Authors: | Bekit, Lokman, Karim, Hamza, Nguyen, Nghia T, Yilmaz, Yasin |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.03040 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ComplexVAD: Detecting Interaction Anomalies in Video
by: Mumcu, Furkan, et al.
Published: (2025)
by: Mumcu, Furkan, et al.
Published: (2025)
Is Video Anomaly Detection Misframed? Evidence from LLM-Based and Multi-Scene Models
by: Mumcu, Furkan, et al.
Published: (2026)
by: Mumcu, Furkan, et al.
Published: (2026)
Leveraging Multimodal LLM Descriptions of Activity for Explainable Semi-Supervised Video Anomaly Detection
by: Mumcu, Furkan, et al.
Published: (2025)
by: Mumcu, Furkan, et al.
Published: (2025)
Geometry-Aware Semantic Reasoning for Training Free Video Anomaly Detection
by: Zia, Ali, et al.
Published: (2026)
by: Zia, Ali, et al.
Published: (2026)
LLM-Guided Agentic Object Detection for Open-World Understanding
by: Mumcu, Furkan, et al.
Published: (2025)
by: Mumcu, Furkan, et al.
Published: (2025)
AnomalyAgent: Training-Free Agentic Models for Zero-/Few-Shot Anomaly Detection
by: Zhang, Yi, et al.
Published: (2026)
by: Zhang, Yi, et al.
Published: (2026)
Universal and Efficient Detection of Adversarial Data through Nonuniform Impact on Network Layers
by: Mumcu, Furkan, et al.
Published: (2025)
by: Mumcu, Furkan, et al.
Published: (2025)
Agentic AI-Empowered Dynamic Survey Framework
by: Mumcu, Furkan, et al.
Published: (2026)
by: Mumcu, Furkan, et al.
Published: (2026)
EventVAD: Training-Free Event-Aware Video Anomaly Detection
by: Shao, Yihua, et al.
Published: (2025)
by: Shao, Yihua, et al.
Published: (2025)
CoReVAD: A Contextual Reasoning Framework for Training-Free Video Anomaly Detection
by: Lim, Hyeongmuk, et al.
Published: (2026)
by: Lim, Hyeongmuk, et al.
Published: (2026)
From Frames to Events: Rethinking Evaluation in Human-Centric Video Anomaly Detection
by: Rashvand, Narges, et al.
Published: (2026)
by: Rashvand, Narges, et al.
Published: (2026)
SphereVAD: Training-Free Video Anomaly Detection via Geodesic Inference on the Unit Hypersphere
by: Huang, Chao, et al.
Published: (2026)
by: Huang, Chao, et al.
Published: (2026)
VADTree: Explainable Training-Free Video Anomaly Detection via Hierarchical Granularity-Aware Tree
by: Li, Wenlong, et al.
Published: (2025)
by: Li, Wenlong, et al.
Published: (2025)
VisionGuard: Synergistic Framework for Helmet Violation Detection
by: Nguyen, Lam-Huy, et al.
Published: (2025)
by: Nguyen, Lam-Huy, et al.
Published: (2025)
Multimodal Attack Detection for Action Recognition Models
by: Mumcu, Furkan, et al.
Published: (2024)
by: Mumcu, Furkan, et al.
Published: (2024)
PANDA: Towards Generalist Video Anomaly Detection via Agentic AI Engineer
by: Yang, Zhiwei, et al.
Published: (2025)
by: Yang, Zhiwei, et al.
Published: (2025)
ViConsFormer: Constituting Meaningful Phrases of Scene Texts using Transformer-based Method in Vietnamese Text-based Visual Question Answering
by: Nguyen, Nghia Hieu, et al.
Published: (2024)
by: Nguyen, Nghia Hieu, et al.
Published: (2024)
Agentic Keyframe Search for Video Question Answering
by: Fan, Sunqi, et al.
Published: (2025)
by: Fan, Sunqi, et al.
Published: (2025)
ReME: A Data-Centric Framework for Training-Free Open-Vocabulary Segmentation
by: Xuan, Xiwei, et al.
Published: (2025)
by: Xuan, Xiwei, et al.
Published: (2025)
Hypergraph-Enhanced Training-Free and Language-Free Few-Shot Anomaly Detection
by: Xie, Guohuan, et al.
Published: (2026)
by: Xie, Guohuan, et al.
Published: (2026)
An Efficient Streaming Video Understanding Framework with Agentic Control
by: Liu, Jinming, et al.
Published: (2026)
by: Liu, Jinming, et al.
Published: (2026)
EM-Vid: Training-Free Entity-Centric Memory for Efficient and Consistent Multi-Shot Video Generation
by: Vandersanden, Jente, et al.
Published: (2026)
by: Vandersanden, Jente, et al.
Published: (2026)
A 4D Representation for Training-Free Agentic Reasoning from Monocular Laparoscopic Video
by: Fehrentz, Maximilian, et al.
Published: (2026)
by: Fehrentz, Maximilian, et al.
Published: (2026)
LAVID: An Agentic LVLM Framework for Diffusion-Generated Video Detection
by: Liu, Qingyuan, et al.
Published: (2025)
by: Liu, Qingyuan, et al.
Published: (2025)
Harnessing Large Language Models for Training-free Video Anomaly Detection
by: Zanella, Luca, et al.
Published: (2024)
by: Zanella, Luca, et al.
Published: (2024)
Bridging the Training-Deployment Gap: Gated Encoding and Multi-Scale Refinement for Efficient Quantization-Aware Image Enhancement
by: To-Thanh, Dat, et al.
Published: (2026)
by: To-Thanh, Dat, et al.
Published: (2026)
Human-Centric Video Anomaly Detection Through Spatio-Temporal Pose Tokenization and Transformer
by: Noghre, Ghazal Alinezhad, et al.
Published: (2024)
by: Noghre, Ghazal Alinezhad, et al.
Published: (2024)
DocVideoQA: Towards Comprehensive Understanding of Document-Centric Videos through Question Answering
by: Wang, Haochen, et al.
Published: (2025)
by: Wang, Haochen, et al.
Published: (2025)
ViOCRVQA: Novel Benchmark Dataset and Vision Reader for Visual Question Answering by Understanding Vietnamese Text in Images
by: Pham, Huy Quang, et al.
Published: (2024)
by: Pham, Huy Quang, et al.
Published: (2024)
HeadHunt-VAD: Hunting Robust Anomaly-Sensitive Heads in MLLM for Tuning-Free Video Anomaly Detection
by: Cai, Zhaolin, et al.
Published: (2025)
by: Cai, Zhaolin, et al.
Published: (2025)
Is Training Necessary for Anomaly Detection?
by: Zhang, Xingwu, et al.
Published: (2026)
by: Zhang, Xingwu, et al.
Published: (2026)
Object-Centric Framework for Video Moment Retrieval
by: Li, Zongyao, et al.
Published: (2025)
by: Li, Zongyao, et al.
Published: (2025)
The Evolution of Video Anomaly Detection: A Unified Framework from DNN to MLLM
by: Gao, Shibo, et al.
Published: (2025)
by: Gao, Shibo, et al.
Published: (2025)
An Exploratory Study on Human-Centric Video Anomaly Detection through Variational Autoencoders and Trajectory Prediction
by: Noghre, Ghazal Alinezhad, et al.
Published: (2024)
by: Noghre, Ghazal Alinezhad, et al.
Published: (2024)
Language-driven Grasp Detection
by: Vuong, An Dinh, et al.
Published: (2024)
by: Vuong, An Dinh, et al.
Published: (2024)
Mitigating Visual Context Degradation in Large Multimodal Models: A Training-Free Decoupled Agentic Framework
by: Jia, Hongrui, et al.
Published: (2025)
by: Jia, Hongrui, et al.
Published: (2025)
GenKOL: Modular Generative AI Framework For Scalable Virtual KOL Generation
by: To, Tan-Hiep, et al.
Published: (2025)
by: To, Tan-Hiep, et al.
Published: (2025)
Frequency-Guided Diffusion Model with Perturbation Training for Skeleton-Based Video Anomaly Detection
by: Tan, Xiaofeng, et al.
Published: (2024)
by: Tan, Xiaofeng, et al.
Published: (2024)
Video Anomaly Detection with Contours -- A Study
by: Siemon, Mia, et al.
Published: (2025)
by: Siemon, Mia, et al.
Published: (2025)
VADMamba++: Efficient Video Anomaly Detection via Hybrid Modeling in Grayscale Space
by: Lyu, Jihao, et al.
Published: (2026)
by: Lyu, Jihao, et al.
Published: (2026)
Similar Items
-
ComplexVAD: Detecting Interaction Anomalies in Video
by: Mumcu, Furkan, et al.
Published: (2025) -
Is Video Anomaly Detection Misframed? Evidence from LLM-Based and Multi-Scene Models
by: Mumcu, Furkan, et al.
Published: (2026) -
Leveraging Multimodal LLM Descriptions of Activity for Explainable Semi-Supervised Video Anomaly Detection
by: Mumcu, Furkan, et al.
Published: (2025) -
Geometry-Aware Semantic Reasoning for Training Free Video Anomaly Detection
by: Zia, Ali, et al.
Published: (2026) -
LLM-Guided Agentic Object Detection for Open-World Understanding
by: Mumcu, Furkan, et al.
Published: (2025)