TY - GEN
T1 - Scalable nearest neighbor search for optimal transport
AU - Backurs, Arturs
AU - Dong, Yihe
AU - Indyk, Piotr
AU - Razenshteyn, Ilya
AU - Wagner, Tal
N1 - Publisher Copyright:
© ICML 2020. All rights reserved.
PY - 2020
Y1 - 2020
N2 - The Optimal Transport (a.k.a. Wasserstein) distance is an increasingly popular similarity measure for rich data domains, such as images or text documents. This raises the necessity for fast nearest neighbor search algorithms according to this distance, which poses a substantial computational bottleneck on massive datasets. In this work we introduce Flowtree, a fast and accurate approximation algorithm for the Wasserstein-1 distance. We formally analyze its approximation factor and running time. We perform extensive experimental evaluation of nearest neighbor search algorithms in theW1 distance on realworld dataset. Our results show that compared to previous state of the art, Flowtree achieves up to 7:4 times faster running time.
AB - The Optimal Transport (a.k.a. Wasserstein) distance is an increasingly popular similarity measure for rich data domains, such as images or text documents. This raises the necessity for fast nearest neighbor search algorithms according to this distance, which poses a substantial computational bottleneck on massive datasets. In this work we introduce Flowtree, a fast and accurate approximation algorithm for the Wasserstein-1 distance. We formally analyze its approximation factor and running time. We perform extensive experimental evaluation of nearest neighbor search algorithms in theW1 distance on realworld dataset. Our results show that compared to previous state of the art, Flowtree achieves up to 7:4 times faster running time.
UR - http://www.scopus.com/inward/record.url?scp=85101696155&partnerID=8YFLogxK
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:85101696155
T3 - 37th International Conference on Machine Learning, ICML 2020
SP - 474
EP - 483
BT - 37th International Conference on Machine Learning, ICML 2020
A2 - Daume, Hal
A2 - Singh, Aarti
PB - International Machine Learning Society (IMLS)
T2 - 37th International Conference on Machine Learning, ICML 2020
Y2 - 13 July 2020 through 18 July 2020
ER -