TY - JOUR
T1 - LDAHash
T2 - Improved matching with smaller descriptors
AU - Strecha, Christoph
AU - Bronstein, Alexander M.
AU - Bronstein, Michael M.
AU - Fua, Pascal
N1 - Funding Information:
The authors would like to thank Matthew Brown for useful discussion and Simon Winder for providing his DAISY binary and for testing DAISY on the fountain sequence. They further acknowledge Flickr users for providing their images for the Venice and for parts of the Prague and Lausanne data sets as well as SenseFly (www.sensefly.com) for capturing the aerial image of the EPFL. Michael M. Bronstein is partially supported by the Swiss High-Performance and High-Productivity Computing (HP2C).
PY - 2012
Y1 - 2012
N2 - SIFT-like local feature descriptors are ubiquitously employed in computer vision applications such as content-based retrieval, video analysis, copy detection, object recognition, photo tourism, and 3D reconstruction. Feature descriptors can be designed to be invariant to certain classes of photometric and geometric transformations, in particular, affine and intensity scale transformations. However, real transformations that an image can undergo can only be approximately modeled in this way, and thus most descriptors are only approximately invariant in practice. Second, descriptors are usually high dimensional (e.g., SIFT is represented as a 128-dimensional vector). In large-scale retrieval and matching problems, this can pose challenges in storing and retrieving descriptor data. We map the descriptor vectors into the Hamming space in which the Hamming metric is used to compare the resulting representations. This way, we reduce the size of the descriptors by representing them as short binary strings and learn descriptor invariance from examples. We show extensive experimental validation, demonstrating the advantage of the proposed approach.
AB - SIFT-like local feature descriptors are ubiquitously employed in computer vision applications such as content-based retrieval, video analysis, copy detection, object recognition, photo tourism, and 3D reconstruction. Feature descriptors can be designed to be invariant to certain classes of photometric and geometric transformations, in particular, affine and intensity scale transformations. However, real transformations that an image can undergo can only be approximately modeled in this way, and thus most descriptors are only approximately invariant in practice. Second, descriptors are usually high dimensional (e.g., SIFT is represented as a 128-dimensional vector). In large-scale retrieval and matching problems, this can pose challenges in storing and retrieving descriptor data. We map the descriptor vectors into the Hamming space in which the Hamming metric is used to compare the resulting representations. This way, we reduce the size of the descriptors by representing them as short binary strings and learn descriptor invariance from examples. We show extensive experimental validation, demonstrating the advantage of the proposed approach.
KW - 3D reconstruction
KW - DAISY
KW - Local features
KW - SIFT
KW - binarization
KW - matching
KW - metric learning
KW - similarity-sensitive hashing
UR - http://www.scopus.com/inward/record.url?scp=81855191888&partnerID=8YFLogxK
U2 - 10.1109/TPAMI.2011.103
DO - 10.1109/TPAMI.2011.103
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:81855191888
VL - 34
SP - 66
EP - 78
JO - IEEE Transactions on Pattern Analysis and Machine Intelligence
JF - IEEE Transactions on Pattern Analysis and Machine Intelligence
SN - 0162-8828
IS - 1
M1 - 5770264
ER -