Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Kalkhorani, Vahid Ahmadi, Zhang, Qingquan, Song, Guanqun, Zhu, Ting
Format:	Preprint
Published:	2023
Subjects:	Computer Vision and Pattern Recognition Machine Learning
Online Access:	https://arxiv.org/abs/2401.10254
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916097596325888
author	Kalkhorani, Vahid Ahmadi Zhang, Qingquan Song, Guanqun Zhu, Ting
author_facet	Kalkhorani, Vahid Ahmadi Zhang, Qingquan Song, Guanqun Zhu, Ting
contents	Video smmarization is a crucial method to reduce the time of videos which reduces the spent time to watch/review a long video. This apporach has became more important as the amount of publisehed video is increasing everyday. A single or multiple videos can be summarized into a relatively short video using various of techniques from multimodal audio-visual techniques, to natural language processing approaches. Audiovisual techniques may be used to recognize significant visual events and pick the most important parts, while NLP techniques can be used to evaluate the audio transcript and extract the main sentences (timestamps) and corresponding video frames from the original video. Another approach is to use the best of both domain. Meaning that we can use audio-visual cues as well as video transcript to extract and summarize the video. In this paper, we combine a variety of NLP techniques (extractive and contect-based summarizers) with video processing techniques to convert a long video into a single relatively short video. We design this toll in a way that user can specify the relative length of the summarized video. We have also explored ways of summarizing and concatenating multiple videos into a single short video which will help having most important concepts from the same subject in a single short video. Out approach shows that video summarizing is a difficult but significant work, with substantial potential for further research and development, and it is possible thanks to the development of NLP models.
format	Preprint
id	arxiv_https___arxiv_org_abs_2401_10254
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	Beyond the Frame: Single and mutilple video summarization method with user-defined length Kalkhorani, Vahid Ahmadi Zhang, Qingquan Song, Guanqun Zhu, Ting Computer Vision and Pattern Recognition Machine Learning Video smmarization is a crucial method to reduce the time of videos which reduces the spent time to watch/review a long video. This apporach has became more important as the amount of publisehed video is increasing everyday. A single or multiple videos can be summarized into a relatively short video using various of techniques from multimodal audio-visual techniques, to natural language processing approaches. Audiovisual techniques may be used to recognize significant visual events and pick the most important parts, while NLP techniques can be used to evaluate the audio transcript and extract the main sentences (timestamps) and corresponding video frames from the original video. Another approach is to use the best of both domain. Meaning that we can use audio-visual cues as well as video transcript to extract and summarize the video. In this paper, we combine a variety of NLP techniques (extractive and contect-based summarizers) with video processing techniques to convert a long video into a single relatively short video. We design this toll in a way that user can specify the relative length of the summarized video. We have also explored ways of summarizing and concatenating multiple videos into a single short video which will help having most important concepts from the same subject in a single short video. Out approach shows that video summarizing is a difficult but significant work, with substantial potential for further research and development, and it is possible thanks to the development of NLP models.
title	Beyond the Frame: Single and mutilple video summarization method with user-defined length
topic	Computer Vision and Pattern Recognition Machine Learning
url	https://arxiv.org/abs/2401.10254

Similar Items