Saved in:
Bibliographic Details
Main Authors: Wang, Yuhang, Li, Hai, Hou, Shujuan, Dong, Zhetao, Yang, Xiaoyao
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2603.07694
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910045319462912
author Wang, Yuhang
Li, Hai
Hou, Shujuan
Dong, Zhetao
Yang, Xiaoyao
author_facet Wang, Yuhang
Li, Hai
Hou, Shujuan
Dong, Zhetao
Yang, Xiaoyao
contents In bandwidth-limited online video streaming, videos are usually downsampled and compressed. Although recent online video super-resolution (online VSR) approaches achieve promising results, they are still compute-intensive and fall short of real-time processing at higher resolutions, due to complex motion estimation for alignment and redundant processing of consecutive frames. To address these issues, we propose a compressed-domain-aware network (CDA-VSR) for online VSR, which utilizes compressed-domain information, including motion vectors, residual maps, and frame types to balance quality and efficiency. Specifically, we propose a motion-vector-guided deformable alignment module that uses motion vectors for coarse warping and learns only local residual offsets for fine-tuned adjustments, thereby maintaining accuracy while reducing computation. Then, we utilize a residual map gated fusion module to derive spatial weights from residual maps, suppressing mismatched regions and emphasizing reliable details. Further, we design a frame-type-aware reconstruction module for adaptive compute allocation across frame types, balancing accuracy and efficiency. On the REDS4 dataset, our CDA-VSR surpasses the state-of-the-art method TMP, with a maximum PSNR improvement of 0.13 dB while delivering more than double the inference speed. The code will be released at https://github.com/sspBIT/CDA-VSR.
format Preprint
id arxiv_https___arxiv_org_abs_2603_07694
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Compressed-Domain-Aware Online Video Super-Resolution
Wang, Yuhang
Li, Hai
Hou, Shujuan
Dong, Zhetao
Yang, Xiaoyao
Computer Vision and Pattern Recognition
Artificial Intelligence
In bandwidth-limited online video streaming, videos are usually downsampled and compressed. Although recent online video super-resolution (online VSR) approaches achieve promising results, they are still compute-intensive and fall short of real-time processing at higher resolutions, due to complex motion estimation for alignment and redundant processing of consecutive frames. To address these issues, we propose a compressed-domain-aware network (CDA-VSR) for online VSR, which utilizes compressed-domain information, including motion vectors, residual maps, and frame types to balance quality and efficiency. Specifically, we propose a motion-vector-guided deformable alignment module that uses motion vectors for coarse warping and learns only local residual offsets for fine-tuned adjustments, thereby maintaining accuracy while reducing computation. Then, we utilize a residual map gated fusion module to derive spatial weights from residual maps, suppressing mismatched regions and emphasizing reliable details. Further, we design a frame-type-aware reconstruction module for adaptive compute allocation across frame types, balancing accuracy and efficiency. On the REDS4 dataset, our CDA-VSR surpasses the state-of-the-art method TMP, with a maximum PSNR improvement of 0.13 dB while delivering more than double the inference speed. The code will be released at https://github.com/sspBIT/CDA-VSR.
title Compressed-Domain-Aware Online Video Super-Resolution
topic Computer Vision and Pattern Recognition
Artificial Intelligence
url https://arxiv.org/abs/2603.07694