TY - GEN
T1 - Ultra-fast similarity search using ternary content addressable memory
AU - Bremler-Barr, Anat
AU - Harchol, Yotam
AU - Hay, David
AU - Hel-Or, Yacov
N1 - Publisher Copyright:
Copyright 2015 ACM.
PY - 2015/5/31
Y1 - 2015/5/31
N2 - Similarity search, and specifically the nearest-neighbor search (NN) problem is widely used in many fields of computer science such as machine learning, computer vision and databases. However, in many settings such searches are known to suffer from the notorious curse of dimensionality, where running time grows exponentially with d. This causes severe performance degradation when working in high-dimensional spaces. Approximate techniques such as locality-sensitive hashing [2] improve the performance of the search, but are still computationally intensive. In this paper we propose a new way to solve this problem using a special hardware device called ternary content addressable memory (TCAM). TCAM is an associative memory, which is a special type of computer memory that is widely used in switches and routers for very high speed search applications. We show that the TCAM computational model can be leveraged and adjusted to solve NN search problems in a single TCAM lookup cycle, and with linear space. This concept does not suffer from the curse of dimensionality and is shown to improve the best known approaches for NN by more than four orders of magnitude. Simulation results demonstrate dramatic improvement over the best known approaches for NN, and suggest that TCAM devices may play a critical role in future large-scale databases and cloud applications.
AB - Similarity search, and specifically the nearest-neighbor search (NN) problem is widely used in many fields of computer science such as machine learning, computer vision and databases. However, in many settings such searches are known to suffer from the notorious curse of dimensionality, where running time grows exponentially with d. This causes severe performance degradation when working in high-dimensional spaces. Approximate techniques such as locality-sensitive hashing [2] improve the performance of the search, but are still computationally intensive. In this paper we propose a new way to solve this problem using a special hardware device called ternary content addressable memory (TCAM). TCAM is an associative memory, which is a special type of computer memory that is widely used in switches and routers for very high speed search applications. We show that the TCAM computational model can be leveraged and adjusted to solve NN search problems in a single TCAM lookup cycle, and with linear space. This concept does not suffer from the curse of dimensionality and is shown to improve the best known approaches for NN by more than four orders of magnitude. Simulation results demonstrate dramatic improvement over the best known approaches for NN, and suggest that TCAM devices may play a critical role in future large-scale databases and cloud applications.
UR - https://www.scopus.com/pages/publications/84959881033
U2 - 10.1145/2771937.2771938
DO - 10.1145/2771937.2771938
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:84959881033
T3 - 11th International Workshop on Data Management on New Hardware, DaMoN 2015 - In conjunction with the ACM SIGMOD/PODS Conference
BT - 11th International Workshop on Data Management on New Hardware, DaMoN 2015 - In conjunction with the ACM SIGMOD/PODS Conference
A2 - Kersten, Martin
A2 - Pandis, Ippokratis
PB - Association for Computing Machinery, Inc
T2 - 11th International Workshop on Data Management on New Hardware, DaMoN 2015
Y2 - 1 June 2015
ER -