Algorithm for clustering cDNAs for gene expression analysis

Erez Hartuv*, Armin Schmitt, Jorg Lange, Sebastian Meier-Ewert, Hans Lehrach, Ron Shamir

*Corresponding author for this work

Research output: Contribution to conferencePaperpeer-review

68 Scopus citations

Abstract

We have developed a novel algorithm for cluster analysis that is based on graph theoretic techniques. A similarity graph is defined and clusters in that graph correspond to highly connected subgraphs. A polynomial algorithm to compute them efficiently is presented. Our algorithm produces a clustering with some probably good properties. The application that motivated this study was gene expression analysis, where a collection of cDNAs must be clustered based on their oligonucleotide fingerprints. The algorithm has been tested intensively on simulated libraries and was shown to outperform extant methods. It demonstrated robustness to high noise levels. In a blind test on real cDNA fingerprint data the algorithm obtained very good results. Utilizing the results of the algorithm would have saved over 70% of the cDNA sequencing cost on that data set.

Original languageEnglish
Pages188-197
Number of pages10
DOIs
StatePublished - 1999
EventProceedings of the 1999 3rd Annual International Conference on Computational Molecular Biology, RECOMB '99 - Lyon
Duration: 11 Apr 199914 Apr 1999

Conference

ConferenceProceedings of the 1999 3rd Annual International Conference on Computational Molecular Biology, RECOMB '99
CityLyon
Period11/04/9914/04/99

Fingerprint

Dive into the research topics of 'Algorithm for clustering cDNAs for gene expression analysis'. Together they form a unique fingerprint.

Cite this