TY - GEN
T1 - Speeding up the Xbox recommender system using a euclidean transformation for inner-product spaces
AU - Bachrach, Yoram
AU - Finkelstein, Yehuda
AU - Gilad-Bachrach, Ran
AU - Katzir, Liran
AU - Koenigstein, Noam
AU - Nice, Nir
AU - Paquet, Ulrich
N1 - Publisher Copyright:
Copyright © 2014 ACM.
PY - 2014/10/6
Y1 - 2014/10/6
N2 - A prominent approach in collaborative filtering based recommender systems is using dimensionality reduction (matrix factorization) techniques to map users and items into low-dimensional vectors. In such systems, a higher inner product between a user vector and an item vector indicates that the item better suits the user's preference. Traditionally, retrieving the most suitable items is done by scoring and sorting all items. Real world online recommender systems must adhere to strict response-time constraints, so when the number of items is large, scoring all items is intractable. We propose a novel order preserving transformation, mapping the maximum inner product search problem to Euclidean space nearest neighbor search problem. Utilizing this transformation, we study the efficiency of several (approximate) nearest neighbor data structures. Our final solution is based on a novel use of the PCA-Tree data structure in which results are augmented using paths one hamming distance away from the query (neighborhood boosting). The end result is a system which allows approximate matches (items with relatively high inner product, but not necessarily the highest one). We evaluate our techniques on two large-scale recommendation datasets, Xbox Movies and Yahoo Music, and show that this technique allows trading off a slight degradation in the recommendation quality for a significant improvement in the retrieval time.
AB - A prominent approach in collaborative filtering based recommender systems is using dimensionality reduction (matrix factorization) techniques to map users and items into low-dimensional vectors. In such systems, a higher inner product between a user vector and an item vector indicates that the item better suits the user's preference. Traditionally, retrieving the most suitable items is done by scoring and sorting all items. Real world online recommender systems must adhere to strict response-time constraints, so when the number of items is large, scoring all items is intractable. We propose a novel order preserving transformation, mapping the maximum inner product search problem to Euclidean space nearest neighbor search problem. Utilizing this transformation, we study the efficiency of several (approximate) nearest neighbor data structures. Our final solution is based on a novel use of the PCA-Tree data structure in which results are augmented using paths one hamming distance away from the query (neighborhood boosting). The end result is a system which allows approximate matches (items with relatively high inner product, but not necessarily the highest one). We evaluate our techniques on two large-scale recommendation datasets, Xbox Movies and Yahoo Music, and show that this technique allows trading off a slight degradation in the recommendation quality for a significant improvement in the retrieval time.
KW - Fast retrieval
KW - Inner product search
KW - Matrix factorization
KW - Recommender systems
UR - http://www.scopus.com/inward/record.url?scp=84908888240&partnerID=8YFLogxK
U2 - 10.1145/2645710.2645741
DO - 10.1145/2645710.2645741
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:84908888240
T3 - RecSys 2014 - Proceedings of the 8th ACM Conference on Recommender Systems
SP - 257
EP - 264
BT - RecSys 2014 - Proceedings of the 8th ACM Conference on Recommender Systems
PB - Association for Computing Machinery
T2 - 8th ACM Conference on Recommender Systems, RecSys 2014
Y2 - 6 October 2014 through 10 October 2014
ER -