Comprehensive analysis of co-occurring domain sets in yeast proteins

Inbar Cohen-Gihon, Ruth Nussinov, Roded Sharan

Research output: Contribution to journalArticlepeer-review


Background: Protein domains are fundamental evolutionary units of protein architecture, composing proteins in a modular manner. Combinations of two or more, possibly non-adjacent, domains are thought to play specific functional roles within proteins. Indeed, while the number of potential co-occurring domain sets (CDSs) is very large, only a few of these occur in nature. Here we study the principles governing domain content of proteins, using yeast as a model species. Results: Wedesign a novel representation of proteins and their constituent domains as a protein-domain network. An analysis of this network reveals 99 CDSs that occur in proteins more than expected by chance. The identified CDSs are shown to preferentially include ancient domains that are conserved from bacteria or archaea. Moreover, the protein sets spanned by these combinations were found to be highly functionally coherent, significantly match known protein complexes, and enriched with protein-protein interactions. These observations serve to validate the biological significance of the identified CDSs. Conclusion: Our work provides a comprehensive list of co-occurring domain sets in yeast, and sheds light on their function and evolution.

Original languageEnglish
Article number161
JournalBMC Genomics
StatePublished - 11 Jun 2007


Dive into the research topics of 'Comprehensive analysis of co-occurring domain sets in yeast proteins'. Together they form a unique fingerprint.

Cite this