Saved in:
Bibliographic Details
Main Authors: Gao, Zong-Lin, NguyenQuang, Sang, Peng, Wen-Hsiao, HoangVan, Xiem
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2402.12816
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917594242482176
author Gao, Zong-Lin
NguyenQuang, Sang
Peng, Wen-Hsiao
HoangVan, Xiem
author_facet Gao, Zong-Lin
NguyenQuang, Sang
Peng, Wen-Hsiao
HoangVan, Xiem
contents Learned hierarchical B-frame coding aims to leverage bi-directional reference frames for better coding efficiency. However, the domain shift between training and test scenarios due to dataset limitations poses a challenge. This issue arises from training the codec with small groups of pictures (GOP) but testing it on large GOPs. Specifically, the motion estimation network, when trained on small GOPs, is unable to handle large motion at test time, incurring a negative impact on compression performance. To mitigate the domain shift, we present an online motion resolution adaptation (OMRA) method. It adapts the spatial resolution of video frames on a per-frame basis to suit the capability of the motion estimation network in a pre-trained B-frame codec. Our OMRA is an online, inference technique. It need not re-train the codec and is readily applicable to existing B-frame codecs that adopt hierarchical bi-directional prediction. Experimental results show that OMRA significantly enhances the compression performance of two state-of-the-art learned B-frame codecs on commonly used datasets.
format Preprint
id arxiv_https___arxiv_org_abs_2402_12816
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle OMRA: Online Motion Resolution Adaptation to Remedy Domain Shift in Learned Hierarchical B-frame Coding
Gao, Zong-Lin
NguyenQuang, Sang
Peng, Wen-Hsiao
HoangVan, Xiem
Image and Video Processing
Learned hierarchical B-frame coding aims to leverage bi-directional reference frames for better coding efficiency. However, the domain shift between training and test scenarios due to dataset limitations poses a challenge. This issue arises from training the codec with small groups of pictures (GOP) but testing it on large GOPs. Specifically, the motion estimation network, when trained on small GOPs, is unable to handle large motion at test time, incurring a negative impact on compression performance. To mitigate the domain shift, we present an online motion resolution adaptation (OMRA) method. It adapts the spatial resolution of video frames on a per-frame basis to suit the capability of the motion estimation network in a pre-trained B-frame codec. Our OMRA is an online, inference technique. It need not re-train the codec and is readily applicable to existing B-frame codecs that adopt hierarchical bi-directional prediction. Experimental results show that OMRA significantly enhances the compression performance of two state-of-the-art learned B-frame codecs on commonly used datasets.
title OMRA: Online Motion Resolution Adaptation to Remedy Domain Shift in Learned Hierarchical B-frame Coding
topic Image and Video Processing
url https://arxiv.org/abs/2402.12816