Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Silver, Daniel, Kimmel, Ron
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Machine Learning
Online Access:	https://arxiv.org/abs/2501.00975
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916547569647616
author	Silver, Daniel Kimmel, Ron
author_facet	Silver, Daniel Kimmel, Ron
contents	In the field of video compression, the pursuit for better quality at lower bit rates remains a long-lasting goal. Recent developments have demonstrated the potential of Implicit Neural Representation (INR) as a promising alternative to traditional transform-based methodologies. Video INRs can be roughly divided into frame-wise and pixel-wise methods according to the structure the network outputs. While the pixel-based methods are better for upsampling and parallelization, frame-wise methods demonstrated better performance. We introduce CoordFlow, a novel pixel-wise INR for video compression. It yields state-of-the-art results compared to other pixel-wise INRs and on-par performance compared to leading frame-wise techniques. The method is based on the separation of the visual information into visually consistent layers, each represented by a dedicated network that compensates for the layer's motion. When integrated, a byproduct is an unsupervised segmentation of video sequence. Objects motion trajectories are implicitly utilized to compensate for visual-temporal redundancies. Additionally, the proposed method provides inherent video upsampling, stabilization, inpainting, and denoising capabilities.
format	Preprint
id	arxiv_https___arxiv_org_abs_2501_00975
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	CoordFlow: Coordinate Flow for Pixel-wise Neural Video Representation Silver, Daniel Kimmel, Ron Computer Vision and Pattern Recognition Machine Learning In the field of video compression, the pursuit for better quality at lower bit rates remains a long-lasting goal. Recent developments have demonstrated the potential of Implicit Neural Representation (INR) as a promising alternative to traditional transform-based methodologies. Video INRs can be roughly divided into frame-wise and pixel-wise methods according to the structure the network outputs. While the pixel-based methods are better for upsampling and parallelization, frame-wise methods demonstrated better performance. We introduce CoordFlow, a novel pixel-wise INR for video compression. It yields state-of-the-art results compared to other pixel-wise INRs and on-par performance compared to leading frame-wise techniques. The method is based on the separation of the visual information into visually consistent layers, each represented by a dedicated network that compensates for the layer's motion. When integrated, a byproduct is an unsupervised segmentation of video sequence. Objects motion trajectories are implicitly utilized to compensate for visual-temporal redundancies. Additionally, the proposed method provides inherent video upsampling, stabilization, inpainting, and denoising capabilities.
title	CoordFlow: Coordinate Flow for Pixel-wise Neural Video Representation
topic	Computer Vision and Pattern Recognition Machine Learning
url	https://arxiv.org/abs/2501.00975

Similar Items