Saved in:
Bibliographic Details
Main Authors: Suluhan, H. Umut, Doruk, Abdullah Enes, Ates, Hasan F., Gunturk, Bahadir K.
Format: Preprint
Published: 2023
Subjects:
Online Access:https://arxiv.org/abs/2310.12092
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917770035200000
author Suluhan, H. Umut
Doruk, Abdullah Enes
Ates, Hasan F.
Gunturk, Bahadir K.
author_facet Suluhan, H. Umut
Doruk, Abdullah Enes
Ates, Hasan F.
Gunturk, Bahadir K.
contents High-spatio-temporal resolution (HSTR) video recording plays a crucial role in enhancing various imagery tasks that require fine-detailed information. State-of-the-art cameras provide this required high frame-rate and high spatial resolution together, albeit at a high cost. To alleviate this issue, this paper proposes a dual camera system for the generation of HSTR video using reference-based super-resolution (RefSR). One camera captures high spatial resolution low frame rate (HSLF) video while the other captures low spatial resolution high frame rate (LSHF) video simultaneously for the same scene. A novel deep learning architecture is proposed to fuse HSLF and LSHF video feeds and synthesize HSTR video frames. The proposed model combines optical flow estimation and (channel-wise and spatial) attention mechanisms to capture the fine motion and complex dependencies between frames of the two video feeds. Simulations show that the proposed model provides significant improvement over existing reference-based SR techniques in terms of PSNR and SSIM metrics. The method also exhibits sufficient frames per second (FPS) for aerial monitoring when deployed on a power-constrained drone equipped with dual cameras.
format Preprint
id arxiv_https___arxiv_org_abs_2310_12092
institution arXiv
publishDate 2023
record_format arxiv
spellingShingle HSTR-Net: Reference Based Video Super-resolution with Dual Cameras
Suluhan, H. Umut
Doruk, Abdullah Enes
Ates, Hasan F.
Gunturk, Bahadir K.
Computer Vision and Pattern Recognition
High-spatio-temporal resolution (HSTR) video recording plays a crucial role in enhancing various imagery tasks that require fine-detailed information. State-of-the-art cameras provide this required high frame-rate and high spatial resolution together, albeit at a high cost. To alleviate this issue, this paper proposes a dual camera system for the generation of HSTR video using reference-based super-resolution (RefSR). One camera captures high spatial resolution low frame rate (HSLF) video while the other captures low spatial resolution high frame rate (LSHF) video simultaneously for the same scene. A novel deep learning architecture is proposed to fuse HSLF and LSHF video feeds and synthesize HSTR video frames. The proposed model combines optical flow estimation and (channel-wise and spatial) attention mechanisms to capture the fine motion and complex dependencies between frames of the two video feeds. Simulations show that the proposed model provides significant improvement over existing reference-based SR techniques in terms of PSNR and SSIM metrics. The method also exhibits sufficient frames per second (FPS) for aerial monitoring when deployed on a power-constrained drone equipped with dual cameras.
title HSTR-Net: Reference Based Video Super-resolution with Dual Cameras
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2310.12092