Saved in:
Bibliographic Details
Main Authors: Lifshitz, Gal, Raviv, Dan
Format: Preprint
Published: 2020
Subjects:
Online Access:https://arxiv.org/abs/2011.14814
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914811927855104
author Lifshitz, Gal
Raviv, Dan
author_facet Lifshitz, Gal
Raviv, Dan
contents Steepest descent algorithms, which are commonly used in deep learning, use the gradient as the descent direction, either as-is or after a direction shift using preconditioning. In many scenarios calculating the gradient is numerically hard due to complex or non-differentiable cost functions, specifically next to singular points. In this work we focus on the derivation of the Total Variation semi-norm commonly used in unsupervised cost functions. Specifically, we derive a differentiable proxy to the hard L1 smoothness constraint in a novel iterative scheme which we refer to as Cost Unrolling. Producing more accurate gradients during training, our method enables finer predictions of a given DNN model through improved convergence, without modifying its architecture or increasing computational complexity. We demonstrate our method in the unsupervised optical flow task. Replacing the L1 smoothness constraint with our unrolled cost during the training of a well known baseline, we report improved results on both MPI Sintel and KITTI 2015 unsupervised optical flow benchmarks. Particularly, we report EPE reduced by up to 15.82% on occluded pixels, where the smoothness constraint is dominant, enabling the detection of much sharper motion edges.
format Preprint
id arxiv_https___arxiv_org_abs_2011_14814
institution arXiv
publishDate 2020
record_format arxiv
spellingShingle Cost Function Unrolling in Unsupervised Optical Flow
Lifshitz, Gal
Raviv, Dan
Computer Vision and Pattern Recognition
Machine Learning
Steepest descent algorithms, which are commonly used in deep learning, use the gradient as the descent direction, either as-is or after a direction shift using preconditioning. In many scenarios calculating the gradient is numerically hard due to complex or non-differentiable cost functions, specifically next to singular points. In this work we focus on the derivation of the Total Variation semi-norm commonly used in unsupervised cost functions. Specifically, we derive a differentiable proxy to the hard L1 smoothness constraint in a novel iterative scheme which we refer to as Cost Unrolling. Producing more accurate gradients during training, our method enables finer predictions of a given DNN model through improved convergence, without modifying its architecture or increasing computational complexity. We demonstrate our method in the unsupervised optical flow task. Replacing the L1 smoothness constraint with our unrolled cost during the training of a well known baseline, we report improved results on both MPI Sintel and KITTI 2015 unsupervised optical flow benchmarks. Particularly, we report EPE reduced by up to 15.82% on occluded pixels, where the smoothness constraint is dominant, enabling the detection of much sharper motion edges.
title Cost Function Unrolling in Unsupervised Optical Flow
topic Computer Vision and Pattern Recognition
Machine Learning
url https://arxiv.org/abs/2011.14814