TY - JOUR
T1 - Gaussian bandwidth selection for manifold learning and classification
AU - Lindenbaum, Ofir
AU - Salhov, Moshe
AU - Yeredor, Arie
AU - Averbuch, Amir
N1 - Publisher Copyright:
© 2020, The Author(s), under exclusive licence to Springer Science+Business Media LLC, part of Springer Nature.
PY - 2020/11/1
Y1 - 2020/11/1
N2 - Kernel methods play a critical role in many machine learning algorithms. They are useful in manifold learning, classification, clustering and other data analysis tasks. Setting the kernel’s scale parameter, also referred to as the kernel’s bandwidth, highly affects the performance of the task in hand. We propose to set a scale parameter that is tailored to one of two types of tasks: classification and manifold learning. For manifold learning, we seek a scale which is best at capturing the manifold’s intrinsic dimension. For classification, we propose three methods for estimating the scale, which optimize the classification results in different senses. The proposed frameworks are simulated on artificial and on real datasets. The results show a high correlation between optimal classification rates and the estimated scales. Finally, we demonstrate the approach on a seismic event classification task.
AB - Kernel methods play a critical role in many machine learning algorithms. They are useful in manifold learning, classification, clustering and other data analysis tasks. Setting the kernel’s scale parameter, also referred to as the kernel’s bandwidth, highly affects the performance of the task in hand. We propose to set a scale parameter that is tailored to one of two types of tasks: classification and manifold learning. For manifold learning, we seek a scale which is best at capturing the manifold’s intrinsic dimension. For classification, we propose three methods for estimating the scale, which optimize the classification results in different senses. The proposed frameworks are simulated on artificial and on real datasets. The results show a high correlation between optimal classification rates and the estimated scales. Finally, we demonstrate the approach on a seismic event classification task.
KW - Classification
KW - Diffusion maps
KW - Dimensionality reduction
KW - Kernel methods
UR - http://www.scopus.com/inward/record.url?scp=85087463774&partnerID=8YFLogxK
U2 - 10.1007/s10618-020-00692-x
DO - 10.1007/s10618-020-00692-x
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 32837252
AN - SCOPUS:85087463774
SN - 1384-5810
VL - 34
SP - 1676
EP - 1712
JO - Data Mining and Knowledge Discovery
JF - Data Mining and Knowledge Discovery
IS - 6
ER -