TY - CHAP
T1 - Integer Programming Based Algorithms for Overlapping Correlation Clustering
AU - Mashiach, Barel I.
AU - Sharan, Roded
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.
PY - 2024
Y1 - 2024
N2 - Clustering is a fundamental problem in data science with diverse applications in biology. The problem has many combinatorial and statistical variants, yet few allow clusters to overlap which is common in the biological domain. Recently, Bonchi et al. defined a new variant of the clustering problem, termed overlapping correlation clustering, which calls for multi-label cluster assignments that correlate with an input similarity between elements as much as possible. This variant is NP-hard and was solved by Bonchi et al. using a local search heuristic. We revisit this heuristic and develop exact integer-programming based variants for it. We show that these variants perform well across several datasets and evaluation measures.
AB - Clustering is a fundamental problem in data science with diverse applications in biology. The problem has many combinatorial and statistical variants, yet few allow clusters to overlap which is common in the biological domain. Recently, Bonchi et al. defined a new variant of the clustering problem, termed overlapping correlation clustering, which calls for multi-label cluster assignments that correlate with an input similarity between elements as much as possible. This variant is NP-hard and was solved by Bonchi et al. using a local search heuristic. We revisit this heuristic and develop exact integer-programming based variants for it. We show that these variants perform well across several datasets and evaluation measures.
UR - http://www.scopus.com/inward/record.url?scp=85188785158&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-55248-9_6
DO - 10.1007/978-3-031-55248-9_6
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.chapter???
AN - SCOPUS:85188785158
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 115
EP - 127
BT - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PB - Springer Science and Business Media Deutschland GmbH
ER -