Saved in:
| Main Author: | Fassold, Hannes |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.09202 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Porting Large Language Models to Mobile Devices for Question Answering
by: Fassold, Hannes
Published: (2024)
by: Fassold, Hannes
Published: (2024)
Real time anomalies detection on video
by: Poirier, Fabien
Published: (2024)
by: Poirier, Fabien
Published: (2024)
BronchoLumen: Analysis of recent YOLO-based architectures for real-time bronchial orifice detection in video bronchoscopy
by: Li, Yongchao, et al.
Published: (2026)
by: Li, Yongchao, et al.
Published: (2026)
Taming generative video models for zero-shot optical flow extraction
by: Kim, Seungwoo, et al.
Published: (2025)
by: Kim, Seungwoo, et al.
Published: (2025)
Faster than Fast: Accelerating Oriented FAST Feature Detection on Low-end Embedded GPUs
by: Chang, Qiong, et al.
Published: (2025)
by: Chang, Qiong, et al.
Published: (2025)
HFH-Font: Few-shot Chinese Font Synthesis with Higher Quality, Faster Speed, and Higher Resolution
by: Li, Hua, et al.
Published: (2024)
by: Li, Hua, et al.
Published: (2024)
PREGO: online mistake detection in PRocedural EGOcentric videos
by: Flaborea, Alessandro, et al.
Published: (2024)
by: Flaborea, Alessandro, et al.
Published: (2024)
Few-shot target-driven instance detection based on open-vocabulary object detection models
by: Crulis, Ben, et al.
Published: (2024)
by: Crulis, Ben, et al.
Published: (2024)
Network transferability of adversarial patches in real-time object detection
by: Bayer, Jens, et al.
Published: (2024)
by: Bayer, Jens, et al.
Published: (2024)
VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention
by: Zheng, Mingzhe, et al.
Published: (2025)
by: Zheng, Mingzhe, et al.
Published: (2025)
FROSS: Faster-than-Real-Time Online 3D Semantic Scene Graph Generation from RGB-D Images
by: Hou, Hao-Yu, et al.
Published: (2025)
by: Hou, Hao-Yu, et al.
Published: (2025)
Pushing the boundaries of event subsampling in event-based video classification using CNNs
by: Araghi, Hesam, et al.
Published: (2024)
by: Araghi, Hesam, et al.
Published: (2024)
DepthFake: a depth-based strategy for detecting Deepfake videos
by: Maiano, Luca, et al.
Published: (2022)
by: Maiano, Luca, et al.
Published: (2022)
Novel adaptation of video segmentation to 3D MRI: efficient zero-shot knee segmentation with SAM2
by: Yu, Andrew Seohwan, et al.
Published: (2024)
by: Yu, Andrew Seohwan, et al.
Published: (2024)
OmViD: Omni-supervised active learning for video action detection
by: Rana, Aayush, et al.
Published: (2025)
by: Rana, Aayush, et al.
Published: (2025)
Exploiting temporal information to detect conversational groups in videos and predict the next speaker
by: Tosato, Lucrezia, et al.
Published: (2024)
by: Tosato, Lucrezia, et al.
Published: (2024)
Balancing long- and short-term dynamics for the modeling of saliency in videos
by: Wulff, Theodor, et al.
Published: (2025)
by: Wulff, Theodor, et al.
Published: (2025)
FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification
by: Yao, Jingfeng, et al.
Published: (2024)
by: Yao, Jingfeng, et al.
Published: (2024)
Sequential keypoint density estimator: an overlooked baseline of skeleton-based video anomaly detection
by: Delić, Anja, et al.
Published: (2025)
by: Delić, Anja, et al.
Published: (2025)
Towards multi-modal forgery representation learning for AI-generated video detection and localization
by: Le, Dat, et al.
Published: (2026)
by: Le, Dat, et al.
Published: (2026)
Disentangling spatio-temporal knowledge for weakly supervised object detection and segmentation in surgical video
by: Liao, Guiqiu, et al.
Published: (2024)
by: Liao, Guiqiu, et al.
Published: (2024)
A multi-center analysis of deep learning methods for video polyp detection and segmentation
by: Ghatwary, Noha, et al.
Published: (2026)
by: Ghatwary, Noha, et al.
Published: (2026)
Multimodal video analysis for crowd anomaly detection using open access tourism cameras
by: Dionis-Ros, Alejandro, et al.
Published: (2024)
by: Dionis-Ros, Alejandro, et al.
Published: (2024)
CLIP-driven Outliers Synthesis for few-shot OOD detection
by: Sun, Hao, et al.
Published: (2024)
by: Sun, Hao, et al.
Published: (2024)
Learning semantical dynamics and spatiotemporal collaboration for human pose estimation in video
by: Feng, Runyang, et al.
Published: (2025)
by: Feng, Runyang, et al.
Published: (2025)
YOLOv1 to YOLOv10: The fastest and most accurate real-time object detection systems
by: Wang, Chien-Yao, et al.
Published: (2024)
by: Wang, Chien-Yao, et al.
Published: (2024)
Underwater object detection in sonar imagery with detection transformer and Zero-shot neural architecture search
by: Gu, XiaoTong, et al.
Published: (2025)
by: Gu, XiaoTong, et al.
Published: (2025)
A bag of tricks for real-time Mitotic Figure detection
by: Marzahl, Christian, et al.
Published: (2025)
by: Marzahl, Christian, et al.
Published: (2025)
Learn Faster and Remember More: Balancing Exploration and Exploitation for Continual Test-time Adaptation
by: Yang, Pinci, et al.
Published: (2025)
by: Yang, Pinci, et al.
Published: (2025)
Bidirectional skip-frame prediction for video anomaly detection with intra-domain disparity-driven attention
by: Lyu, Jiahao, et al.
Published: (2024)
by: Lyu, Jiahao, et al.
Published: (2024)
Faster Diffusion Action Segmentation
by: Wang, Shuaibing, et al.
Published: (2024)
by: Wang, Shuaibing, et al.
Published: (2024)
Anomaly detection in non-stationary videos using time-recursive differencing network based prediction
by: Pillai, Gargi V., et al.
Published: (2025)
by: Pillai, Gargi V., et al.
Published: (2025)
VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention
by: Zheng, Mingzhe, et al.
Published: (2024)
by: Zheng, Mingzhe, et al.
Published: (2024)
Study of detecting behavioral signatures within DeepFake videos
by: Miao, Qiaomu, et al.
Published: (2022)
by: Miao, Qiaomu, et al.
Published: (2022)
Training-free zero-shot 3D symmetry detection with visual features back-projected to geometry
by: Aguirre, Isaac, et al.
Published: (2025)
by: Aguirre, Isaac, et al.
Published: (2025)
Colony Grounded SAM2: Zero-shot detection and segmentation of bacterial colonies using foundation models
by: Korporaal, Daan, et al.
Published: (2026)
by: Korporaal, Daan, et al.
Published: (2026)
SurgiSAM2: Fine-tuning a foundational model for surgical video anatomy segmentation and detection
by: Kamtam, Devanish N., et al.
Published: (2025)
by: Kamtam, Devanish N., et al.
Published: (2025)
A lightweight detector for real-time detection of remote sensing images
by: Wang, Qianyi, et al.
Published: (2025)
by: Wang, Qianyi, et al.
Published: (2025)
Latent Denoising Diffusion GAN: Faster sampling, Higher image quality
by: Trinh, Luan Thanh, et al.
Published: (2024)
by: Trinh, Luan Thanh, et al.
Published: (2024)
Prompt as Knowledge Bank: Boost Vision-language model via Structural Representation for zero-shot medical detection
by: Yang, Yuguang, et al.
Published: (2025)
by: Yang, Yuguang, et al.
Published: (2025)
Similar Items
-
Porting Large Language Models to Mobile Devices for Question Answering
by: Fassold, Hannes
Published: (2024) -
Real time anomalies detection on video
by: Poirier, Fabien
Published: (2024) -
BronchoLumen: Analysis of recent YOLO-based architectures for real-time bronchial orifice detection in video bronchoscopy
by: Li, Yongchao, et al.
Published: (2026) -
Taming generative video models for zero-shot optical flow extraction
by: Kim, Seungwoo, et al.
Published: (2025) -
Faster than Fast: Accelerating Oriented FAST Feature Detection on Low-end Embedded GPUs
by: Chang, Qiong, et al.
Published: (2025)