Integer Programming Based Algorithms for Overlapping Correlation Clustering

Barel I. Mashiach, Roded Sharan*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingChapterpeer-review

Abstract

Clustering is a fundamental problem in data science with diverse applications in biology. The problem has many combinatorial and statistical variants, yet few allow clusters to overlap which is common in the biological domain. Recently, Bonchi et al. defined a new variant of the clustering problem, termed overlapping correlation clustering, which calls for multi-label cluster assignments that correlate with an input similarity between elements as much as possible. This variant is NP-hard and was solved by Bonchi et al. using a local search heuristic. We revisit this heuristic and develop exact integer-programming based variants for it. We show that these variants perform well across several datasets and evaluation measures.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
PublisherSpringer Science and Business Media Deutschland GmbH
Pages115-127
Number of pages13
DOIs
StatePublished - 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14070 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Funding

FundersFunder number
Israel Science Foundation715/18

    Fingerprint

    Dive into the research topics of 'Integer Programming Based Algorithms for Overlapping Correlation Clustering'. Together they form a unique fingerprint.

    Cite this