:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Fassold, Hannes
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2502.09202
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Porting Large Language Models to Mobile Devices for Question Answering
by: Fassold, Hannes
Published: (2024)

Real time anomalies detection on video
by: Poirier, Fabien
Published: (2024)

BronchoLumen: Analysis of recent YOLO-based architectures for real-time bronchial orifice detection in video bronchoscopy
by: Li, Yongchao, et al.
Published: (2026)

Taming generative video models for zero-shot optical flow extraction
by: Kim, Seungwoo, et al.
Published: (2025)

Faster than Fast: Accelerating Oriented FAST Feature Detection on Low-end Embedded GPUs
by: Chang, Qiong, et al.
Published: (2025)

HFH-Font: Few-shot Chinese Font Synthesis with Higher Quality, Faster Speed, and Higher Resolution
by: Li, Hua, et al.
Published: (2024)

PREGO: online mistake detection in PRocedural EGOcentric videos
by: Flaborea, Alessandro, et al.
Published: (2024)

Few-shot target-driven instance detection based on open-vocabulary object detection models
by: Crulis, Ben, et al.
Published: (2024)

Network transferability of adversarial patches in real-time object detection
by: Bayer, Jens, et al.
Published: (2024)

VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention
by: Zheng, Mingzhe, et al.
Published: (2025)

FROSS: Faster-than-Real-Time Online 3D Semantic Scene Graph Generation from RGB-D Images
by: Hou, Hao-Yu, et al.
Published: (2025)

Pushing the boundaries of event subsampling in event-based video classification using CNNs
by: Araghi, Hesam, et al.
Published: (2024)

DepthFake: a depth-based strategy for detecting Deepfake videos
by: Maiano, Luca, et al.
Published: (2022)

Novel adaptation of video segmentation to 3D MRI: efficient zero-shot knee segmentation with SAM2
by: Yu, Andrew Seohwan, et al.
Published: (2024)

OmViD: Omni-supervised active learning for video action detection
by: Rana, Aayush, et al.
Published: (2025)

Exploiting temporal information to detect conversational groups in videos and predict the next speaker
by: Tosato, Lucrezia, et al.
Published: (2024)

Balancing long- and short-term dynamics for the modeling of saliency in videos
by: Wulff, Theodor, et al.
Published: (2025)

FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification
by: Yao, Jingfeng, et al.
Published: (2024)

Sequential keypoint density estimator: an overlooked baseline of skeleton-based video anomaly detection
by: Delić, Anja, et al.
Published: (2025)

Towards multi-modal forgery representation learning for AI-generated video detection and localization
by: Le, Dat, et al.
Published: (2026)

Disentangling spatio-temporal knowledge for weakly supervised object detection and segmentation in surgical video
by: Liao, Guiqiu, et al.
Published: (2024)

A multi-center analysis of deep learning methods for video polyp detection and segmentation
by: Ghatwary, Noha, et al.
Published: (2026)

Multimodal video analysis for crowd anomaly detection using open access tourism cameras
by: Dionis-Ros, Alejandro, et al.
Published: (2024)

CLIP-driven Outliers Synthesis for few-shot OOD detection
by: Sun, Hao, et al.
Published: (2024)

Learning semantical dynamics and spatiotemporal collaboration for human pose estimation in video
by: Feng, Runyang, et al.
Published: (2025)

YOLOv1 to YOLOv10: The fastest and most accurate real-time object detection systems
by: Wang, Chien-Yao, et al.
Published: (2024)

Underwater object detection in sonar imagery with detection transformer and Zero-shot neural architecture search
by: Gu, XiaoTong, et al.
Published: (2025)

A bag of tricks for real-time Mitotic Figure detection
by: Marzahl, Christian, et al.
Published: (2025)

Learn Faster and Remember More: Balancing Exploration and Exploitation for Continual Test-time Adaptation
by: Yang, Pinci, et al.
Published: (2025)

Bidirectional skip-frame prediction for video anomaly detection with intra-domain disparity-driven attention
by: Lyu, Jiahao, et al.
Published: (2024)

Faster Diffusion Action Segmentation
by: Wang, Shuaibing, et al.
Published: (2024)

Anomaly detection in non-stationary videos using time-recursive differencing network based prediction
by: Pillai, Gargi V., et al.
Published: (2025)

VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention
by: Zheng, Mingzhe, et al.
Published: (2024)

Study of detecting behavioral signatures within DeepFake videos
by: Miao, Qiaomu, et al.
Published: (2022)

Training-free zero-shot 3D symmetry detection with visual features back-projected to geometry
by: Aguirre, Isaac, et al.
Published: (2025)

Colony Grounded SAM2: Zero-shot detection and segmentation of bacterial colonies using foundation models
by: Korporaal, Daan, et al.
Published: (2026)

SurgiSAM2: Fine-tuning a foundational model for surgical video anatomy segmentation and detection
by: Kamtam, Devanish N., et al.
Published: (2025)

A lightweight detector for real-time detection of remote sensing images
by: Wang, Qianyi, et al.
Published: (2025)

Latent Denoising Diffusion GAN: Faster sampling, Higher image quality
by: Trinh, Luan Thanh, et al.
Published: (2024)

Prompt as Knowledge Bank: Boost Vision-language model via Structural Representation for zero-shot medical detection
by: Yang, Yuguang, et al.
Published: (2025)