Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Yu, Li, Li, Zhihui, Yao, Chao, Xiao, Jimin, Gabbouj, Moncef
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2504.06755
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915285596897280
author	Yu, Li Li, Zhihui Yao, Chao Xiao, Jimin Gabbouj, Moncef
author_facet	Yu, Li Li, Zhihui Yao, Chao Xiao, Jimin Gabbouj, Moncef
contents	Neural representations for video (NeRV) have gained considerable attention for their strong performance across various video tasks. However, existing NeRV methods often struggle to capture fine spatial details, resulting in vague reconstructions. In this paper, we present a Frequency Separation and Augmentation based Neural Representation for video (FANeRV), which addresses these limitations with its core Wavelet Frequency Upgrade Block. This block explicitly separates input frames into high and low-frequency components using discrete wavelet transform, followed by targeted enhancement using specialized modules. Finally, a specially designed gated network effectively fuses these frequency components for optimal reconstruction. Additionally, convolutional residual enhancement blocks are integrated into the later stages of the network to balance parameter distribution and improve the restoration of high-frequency details. Experimental results demonstrate that FANeRV significantly improves reconstruction performance and excels in multiple tasks, including video compression, inpainting, and interpolation, outperforming existing NeRV methods.
format	Preprint
id	arxiv_https___arxiv_org_abs_2504_06755
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	FANeRV: Frequency Separation and Augmentation based Neural Representation for Video Yu, Li Li, Zhihui Yao, Chao Xiao, Jimin Gabbouj, Moncef Computer Vision and Pattern Recognition Neural representations for video (NeRV) have gained considerable attention for their strong performance across various video tasks. However, existing NeRV methods often struggle to capture fine spatial details, resulting in vague reconstructions. In this paper, we present a Frequency Separation and Augmentation based Neural Representation for video (FANeRV), which addresses these limitations with its core Wavelet Frequency Upgrade Block. This block explicitly separates input frames into high and low-frequency components using discrete wavelet transform, followed by targeted enhancement using specialized modules. Finally, a specially designed gated network effectively fuses these frequency components for optimal reconstruction. Additionally, convolutional residual enhancement blocks are integrated into the later stages of the network to balance parameter distribution and improve the restoration of high-frequency details. Experimental results demonstrate that FANeRV significantly improves reconstruction performance and excels in multiple tasks, including video compression, inpainting, and interpolation, outperforming existing NeRV methods.
title	FANeRV: Frequency Separation and Augmentation based Neural Representation for Video
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2504.06755

Similar Items