Approximately-isometric diffusion maps

Moshe Salhov, Amit Bermanis, Guy Wolf, Amir Averbuch*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

Diffusion Maps (DM), and other kernel methods, are utilized for the analysis of high dimensional datasets. The DM method uses a Markovian diffusion process to model and analyze data. A spectral analysis of the DM kernel yields a map of the data into a low dimensional space, where Euclidean distances between the mapped data points represent the diffusion distances between the corresponding high dimensional data points. Many machine learning methods, which are based on the Euclidean metric, can be applied to the mapped data points in order to take advantage of the diffusion relations between them. However, a significant drawback of the DM is the need to apply spectral decomposition to a kernel matrix, which becomes infeasible for large datasets. In this paper, we present an efficient approximation of the DM embedding. The presented approximation algorithm produces a dictionary of data points by identifying a small set of informative representatives. Then, based on this dictionary, the entire dataset is efficiently embedded into a low dimensional space. The Euclidean distances in the resulting embedded space approximate the diffusion distances. The properties of the presented embedding and its relation to DM method are analyzed and demonstrated.

Original languageEnglish
Pages (from-to)399-419
Number of pages21
JournalApplied and Computational Harmonic Analysis
Volume38
Issue number3
DOIs
StatePublished - 1 May 2015

Funding

FundersFunder number
United States-Israel Binational Science FoundationBSF 2012282
Israel Science Foundation1041/10
Jyväskylän Yliopisto
Ministry of Science and Technology, Israel3-9096, 3-10898

    Keywords

    • Diffusion distance
    • Diffusion maps
    • Dimensionality reduction
    • Distance preservation
    • Kernel PCA
    • Manifold learning

    Fingerprint

    Dive into the research topics of 'Approximately-isometric diffusion maps'. Together they form a unique fingerprint.

    Cite this