TY - JOUR
T1 - Unsupervised natural image patch learning
AU - Danon, Dov
AU - Averbuch-Elor, Hadar
AU - Fried, Ohad
AU - Cohen-Or, Daniel
N1 - Publisher Copyright:
© 2019, The Author(s).
PY - 2019/9/1
Y1 - 2019/9/1
N2 - A metric for natural image patches is an important tool for analyzing images. An efficient means of learning one is to train a deep network to map an image patch to a vector space, in which the Euclidean distance reflects patch similarity. Previous attempts learned such an embedding in a supervised manner, requiring the availability of many annotated images. In this paper, we present an unsupervised embedding of natural image patches, avoiding the need for annotated images. The key idea is that the similarity of two patches can be learned from the prevalence of their spatial proximity in natural images. Clearly, relying on this simple principle, many spatially nearby pairs are outliers. However, as we show, these outliers do not harm the convergence of the metric learning. We show that our unsupervised embedding approach is more effective than a supervised one or one that uses deep patch representations. Moreover, we show that it naturally lends itself to an efficient self-supervised domain adaptation technique onto a target domain that contains a common foreground object.
AB - A metric for natural image patches is an important tool for analyzing images. An efficient means of learning one is to train a deep network to map an image patch to a vector space, in which the Euclidean distance reflects patch similarity. Previous attempts learned such an embedding in a supervised manner, requiring the availability of many annotated images. In this paper, we present an unsupervised embedding of natural image patches, avoiding the need for annotated images. The key idea is that the similarity of two patches can be learned from the prevalence of their spatial proximity in natural images. Clearly, relying on this simple principle, many spatially nearby pairs are outliers. However, as we show, these outliers do not harm the convergence of the metric learning. We show that our unsupervised embedding approach is more effective than a supervised one or one that uses deep patch representations. Moreover, we show that it naturally lends itself to an efficient self-supervised domain adaptation technique onto a target domain that contains a common foreground object.
KW - metric learning
KW - unsupervised learning
UR - http://www.scopus.com/inward/record.url?scp=85070959249&partnerID=8YFLogxK
U2 - 10.1007/s41095-019-0147-y
DO - 10.1007/s41095-019-0147-y
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:85070959249
SN - 2096-0433
VL - 5
SP - 229
EP - 237
JO - Computational Visual Media
JF - Computational Visual Media
IS - 3
ER -