TY - JOUR
T1 - CLICK and EXPANDER
T2 - A system for clustering and visualizing gene expression data
AU - Sharan, Roded
AU - Maron-Katz, Adi
AU - Shamir, Ron
N1 - Funding Information:
We thank Naama Arbili for her great help in programming CLICK. We thank Rani Elkon and Amos Tanay for many helpful discussions. R.S. was supported by a Fulbright Grant. This study was supported by a research grant from the Ministry of Science and Technology, Israel.
PY - 2003/9/22
Y1 - 2003/9/22
N2 - Motivation: Microarrays have become a central tool in biological research. Their applications range from functional annotation to tissue classification and genetic network inference. A key step in the analysis of gene expression data is the identification of groups of genes that manifest similar expression patterns. This translates to the algorithmic problem of clustering genes based on their expression patterns. Results: We present a novel clustering algorithm, called CLICK, and its applications to gene expression analysis. The algorithm utilizes graph-theoretic and statistical techniques to identify tight groups (kernels) of highly similar elements, which are likely to belong to the same true cluster. Several heuristic procedures are then used to expand the kernels into the full clusters. We report on the application of CLICK to a variety of gene expression data sets. In all those applications it outperformed extant algorithms according to several common figures of merit. We also point out that CLICK can be successfully used for the identification of common regulatory motifs in the upstream regions of co-regulated genes. Furthermore, we demonstrate how CLICK can be used to accurately classify tissue samples into disease types, based on their expression profiles. Finally, we present a new java-based graphical tool, called EXPANDER, for gene expression analysis and visualization, which incorporates CLICK and several other popular clustering algorithms.
AB - Motivation: Microarrays have become a central tool in biological research. Their applications range from functional annotation to tissue classification and genetic network inference. A key step in the analysis of gene expression data is the identification of groups of genes that manifest similar expression patterns. This translates to the algorithmic problem of clustering genes based on their expression patterns. Results: We present a novel clustering algorithm, called CLICK, and its applications to gene expression analysis. The algorithm utilizes graph-theoretic and statistical techniques to identify tight groups (kernels) of highly similar elements, which are likely to belong to the same true cluster. Several heuristic procedures are then used to expand the kernels into the full clusters. We report on the application of CLICK to a variety of gene expression data sets. In all those applications it outperformed extant algorithms according to several common figures of merit. We also point out that CLICK can be successfully used for the identification of common regulatory motifs in the upstream regions of co-regulated genes. Furthermore, we demonstrate how CLICK can be used to accurately classify tissue samples into disease types, based on their expression profiles. Finally, we present a new java-based graphical tool, called EXPANDER, for gene expression analysis and visualization, which incorporates CLICK and several other popular clustering algorithms.
UR - http://www.scopus.com/inward/record.url?scp=0141506116&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btg232
DO - 10.1093/bioinformatics/btg232
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:0141506116
VL - 19
SP - 1787
EP - 1799
JO - Bioinformatics
JF - Bioinformatics
SN - 1367-4803
IS - 14
ER -