Salvato in:
Dettagli Bibliografici
Autori principali: Morimitsu, Henrique, Zhu, Xiaobin, Cesar Jr., Roberto M., Ji, Xiangyang, Yin, Xu-Cheng
Natura: Preprint
Pubblicazione: 2025
Soggetti:
Accesso online:https://arxiv.org/abs/2503.14880
Tags: Aggiungi Tag
Nessun Tag, puoi essere il primo ad aggiungerne!!
_version_ 1866918150205865984
author Morimitsu, Henrique
Zhu, Xiaobin
Cesar Jr., Roberto M.
Ji, Xiangyang
Yin, Xu-Cheng
author_facet Morimitsu, Henrique
Zhu, Xiaobin
Cesar Jr., Roberto M.
Ji, Xiangyang
Yin, Xu-Cheng
contents Optical flow estimation is essential for video processing tasks, such as restoration and action recognition. The quality of videos is constantly increasing, with current standards reaching 8K resolution. However, optical flow methods are usually designed for low resolution and do not generalize to large inputs due to their rigid architectures. They adopt downscaling or input tiling to reduce the input size, causing a loss of details and global information. There is also a lack of optical flow benchmarks to judge the actual performance of existing methods on high-resolution samples. Previous works only conducted qualitative high-resolution evaluations on hand-picked samples. This paper fills this gap in optical flow estimation in two ways. We propose DPFlow, an adaptive optical flow architecture capable of generalizing up to 8K resolution inputs while trained with only low-resolution samples. We also introduce Kubric-NK, a new benchmark for evaluating optical flow methods with input resolutions ranging from 1K to 8K. Our high-resolution evaluation pushes the boundaries of existing methods and reveals new insights about their generalization capabilities. Extensive experimental results show that DPFlow achieves state-of-the-art results on the MPI-Sintel, KITTI 2015, Spring, and other high-resolution benchmarks.
format Preprint
id arxiv_https___arxiv_org_abs_2503_14880
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework
Morimitsu, Henrique
Zhu, Xiaobin
Cesar Jr., Roberto M.
Ji, Xiangyang
Yin, Xu-Cheng
Computer Vision and Pattern Recognition
Optical flow estimation is essential for video processing tasks, such as restoration and action recognition. The quality of videos is constantly increasing, with current standards reaching 8K resolution. However, optical flow methods are usually designed for low resolution and do not generalize to large inputs due to their rigid architectures. They adopt downscaling or input tiling to reduce the input size, causing a loss of details and global information. There is also a lack of optical flow benchmarks to judge the actual performance of existing methods on high-resolution samples. Previous works only conducted qualitative high-resolution evaluations on hand-picked samples. This paper fills this gap in optical flow estimation in two ways. We propose DPFlow, an adaptive optical flow architecture capable of generalizing up to 8K resolution inputs while trained with only low-resolution samples. We also introduce Kubric-NK, a new benchmark for evaluating optical flow methods with input resolutions ranging from 1K to 8K. Our high-resolution evaluation pushes the boundaries of existing methods and reveals new insights about their generalization capabilities. Extensive experimental results show that DPFlow achieves state-of-the-art results on the MPI-Sintel, KITTI 2015, Spring, and other high-resolution benchmarks.
title DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid Framework
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2503.14880