Saved in:
Bibliographic Details
Main Author: Fassold, Hannes
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.09202
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909491179552768
author Fassold, Hannes
author_facet Fassold, Hannes
contents The detection of shot boundaries (hardcuts and short dissolves), sampling structure (progressive / interlaced / pulldown) and dynamic keyframes in a video are fundamental video analysis tasks which have to be done before any further high-level analysis tasks. We present a novel algorithm which does all these analysis tasks in an unified way, by utilizing a combination of inter-frame and intra-frame measures derived from the motion field and normalized cross correlation. The algorithm runs four times faster than real-time due to sparse and selective calculation of these measures. An initial evaluation furthermore shows that the proposed algorithm is extremely robust even for challenging content showing large camera or object motion, flashlights, flicker or low contrast / noise.
format Preprint
id arxiv_https___arxiv_org_abs_2502_09202
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Faster than real-time detection of shot boundaries, sampling structure and dynamic keyframes in video
Fassold, Hannes
Computer Vision and Pattern Recognition
The detection of shot boundaries (hardcuts and short dissolves), sampling structure (progressive / interlaced / pulldown) and dynamic keyframes in a video are fundamental video analysis tasks which have to be done before any further high-level analysis tasks. We present a novel algorithm which does all these analysis tasks in an unified way, by utilizing a combination of inter-frame and intra-frame measures derived from the motion field and normalized cross correlation. The algorithm runs four times faster than real-time due to sparse and selective calculation of these measures. An initial evaluation furthermore shows that the proposed algorithm is extremely robust even for challenging content showing large camera or object motion, flashlights, flicker or low contrast / noise.
title Faster than real-time detection of shot boundaries, sampling structure and dynamic keyframes in video
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2502.09202