A discriminative model for identifying spatial cis-regulatory modules

Eran Segal, Roded Sharan*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

27 Scopus citations

Abstract

Transcriptional regulation is mediated by the coordinated binding of transcription factors to the upstream regions of genes. In higher eukaryotes, the binding sites of cooperating transcription factors are organized into short sequence units, called cis-regulatory modules. In this paper, we propose a method for identifying modules of transcription factor binding sites in a set of co-regulated genes, using only the raw sequence data as input. Our method is based on a novel probabilistic model that describes the mechanism of cis-regulation, including the binding sites of cooperating transcription factors, the organization of these binding sites into short sequence modules, and the regulation of a gene by its modules. We show that our method is successful in discovering planted modules in simulated data and known modules in yeast. More importantly, we applied our method to a large collection of human gene sets and found 83 significant cis-regulatory modules, which included 36 known motifs and many novel ones. Thus, our results provide one of the first comprehensive compendiums of putative cis-regulatory modules in human.

Original languageEnglish
Pages (from-to)822-834
Number of pages13
JournalJournal of Computational Biology
Volume12
Issue number6
DOIs
StatePublished - Jul 2005

Keywords

  • Cis-regulatory module
  • Probabilistic model
  • Transcriptional regulation

Fingerprint

Dive into the research topics of 'A discriminative model for identifying spatial cis-regulatory modules'. Together they form a unique fingerprint.

Cite this