Cost Function Unrolling in Unsupervised Optical Flow

Gal Lifshitz*, Dan Raviv

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Steepest descent algorithms, which are commonly used in deep learning, use the gradient as the descent direction, either as-is or after a direction shift using preconditioning. In many scenarios calculating the gradient is numerically hard due to complex or non-differentiable cost functions, specifically next to singular points. This has been commonly overcome by increased DNN model sizes and complexity. In this work we propose a novel mechanism we refer to as Cost Unrolling, for improving the ability of a given DNN model to solve a complex cost function, without modifying its architecture or increasing computational complexity. We focus on the derivation of the Total Variation (TV) smoothness constraint commonly used in unsupervised cost functions. We introduce an iterative differentiable alternative to the TV smoothness constraint, which is demonstrated to produce more stable gradients during training, enable faster convergence and improve the predictions of a given DNN model. We test our method in several tasks, including image denoising and unsupervised optical flow. Replacing the TV smoothness constraint with our loss during DNN training, we report improved results in all tested scenarios. Specifically, our method improves flows predicted at occluded regions, a crucial task by itself, resulting in sharper motion boundaries.

Original languageEnglish
Pages (from-to)869-880
Number of pages12
JournalIEEE Transactions on Pattern Analysis and Machine Intelligence
Volume46
Issue number2
DOIs
StatePublished - 1 Feb 2024

Funding

FundersFunder number
Yitzhak and Chaya Weinstein Research Institute for Signal Processing, Tel Aviv University

    Keywords

    • Optical flow
    • optimization
    • total variation
    • unsupervised learning

    Fingerprint

    Dive into the research topics of 'Cost Function Unrolling in Unsupervised Optical Flow'. Together they form a unique fingerprint.

    Cite this