Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Nurye, Desale Fentaw, García, Jorge, Aktas, Yagmur
Format:	Recurso digital
Language:
Published:	Zenodo 2023
Online Access:	https://doi.org/10.1117/12.2680257
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866901123931045888
author	Nurye, Desale Fentaw García, Jorge Aktas, Yagmur
author_facet	Nurye, Desale Fentaw García, Jorge Aktas, Yagmur
contents	<p>Visual indexing, or the ability to search and analyze visual media such as images and videos, is important for law enforcement agencies because it can speed up criminal investigations. As more and more visual media is created and shared online, the ability to effectively search and analyze this data becomes increasingly important for<br>investigators to do their job effectively. The major challenges for video captioning include accurately recognizing the objects and activities in the image, understanding their relationships and context, generating natural and descriptive language, and ensuring the captions are relevant and useful. Near real-time processing is also required<br>in order to facilitate agile forensic decision making and prompt triage, hand-over and reduction of the amount of data to be processed by investigators or subsequent processing tools. This paper presents a captioning-driven efficient video analytic which is able to extract accurate descriptions of images and videos files. The proposed<br>approach includes a temporal segmentation technique providing the most relevant frames. Subsequently, an image captioning approach has been specialized to describe visual media related to counter-terrorism and cybercrime for each relevant frame. Our proposed method achieves high consistency and correlation with human summary on SumMe dataset, outperforming previous similar methods.</p>
format	Recurso digital
id	zenodo_https___doi_org_10_1117_12_2680257
institution	Zenodo
language
publishDate	2023
publisher	Zenodo
record_format	zenodo
spellingShingle	Efficient visual information indexation for supporting actionable intelligence and knowledge generation Nurye, Desale Fentaw García, Jorge Aktas, Yagmur <p>Visual indexing, or the ability to search and analyze visual media such as images and videos, is important for law enforcement agencies because it can speed up criminal investigations. As more and more visual media is created and shared online, the ability to effectively search and analyze this data becomes increasingly important for<br>investigators to do their job effectively. The major challenges for video captioning include accurately recognizing the objects and activities in the image, understanding their relationships and context, generating natural and descriptive language, and ensuring the captions are relevant and useful. Near real-time processing is also required<br>in order to facilitate agile forensic decision making and prompt triage, hand-over and reduction of the amount of data to be processed by investigators or subsequent processing tools. This paper presents a captioning-driven efficient video analytic which is able to extract accurate descriptions of images and videos files. The proposed<br>approach includes a temporal segmentation technique providing the most relevant frames. Subsequently, an image captioning approach has been specialized to describe visual media related to counter-terrorism and cybercrime for each relevant frame. Our proposed method achieves high consistency and correlation with human summary on SumMe dataset, outperforming previous similar methods.</p>
title	Efficient visual information indexation for supporting actionable intelligence and knowledge generation
url	https://doi.org/10.1117/12.2680257

Similar Items