TY - JOUR
T1 - FriendlyCore
T2 - 39th International Conference on Machine Learning, ICML 2022
AU - Tsfadia, Eliad
AU - Cohen, Edith
AU - Kaplan, Haim
AU - Mansour, Yishay
AU - Stemmer, Uri
N1 - Publisher Copyright:
Copyright © 2022 by the author(s)
PY - 2022
Y1 - 2022
N2 - Differentially private algorithms for common metric aggregation tasks, such as clustering or averaging, often have limited practicality due to their complexity or to the large number of data points that is required for accurate results. We propose a simple and practical tool, FriendlyCore, that takes a set of points D from an unrestricted (pseudo) metric space as input. When D has effective diameter r, FriendlyCore returns a “stable” subset C ⊆ D that includes all points, except possibly a few outliers, and is guaranteed to have diameter r. FriendlyCore can be used to preprocess the input before privately aggregating it, potentially simplifying the aggregation or boosting its accuracy. Surprisingly, FriendlyCore is light-weight with no dependence on the dimension. We empirically demonstrate its advantages in boosting the accuracy of mean estimation and clustering tasks such as k-means and k-GMM, outperforming tailored methods.
AB - Differentially private algorithms for common metric aggregation tasks, such as clustering or averaging, often have limited practicality due to their complexity or to the large number of data points that is required for accurate results. We propose a simple and practical tool, FriendlyCore, that takes a set of points D from an unrestricted (pseudo) metric space as input. When D has effective diameter r, FriendlyCore returns a “stable” subset C ⊆ D that includes all points, except possibly a few outliers, and is guaranteed to have diameter r. FriendlyCore can be used to preprocess the input before privately aggregating it, potentially simplifying the aggregation or boosting its accuracy. Surprisingly, FriendlyCore is light-weight with no dependence on the dimension. We empirically demonstrate its advantages in boosting the accuracy of mean estimation and clustering tasks such as k-means and k-GMM, outperforming tailored methods.
UR - http://www.scopus.com/inward/record.url?scp=85163079076&partnerID=8YFLogxK
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.conferencearticle???
AN - SCOPUS:85163079076
SN - 2640-3498
VL - 162
SP - 21828
EP - 21863
JO - Proceedings of Machine Learning Research
JF - Proceedings of Machine Learning Research
Y2 - 17 July 2022 through 23 July 2022
ER -