TY - JOUR
T1 - Experimental design schemes for learning Boolean network models
AU - Atias, Nir
AU - Gershenzon, Michal
AU - Labazin, Katia
AU - Sharan, Roded
N1 - Funding Information:
Funding: Edmond J. Safra Center for Bioinformatics at Tel Aviv University (to N.A.) and the I-CORE Program of the Planning and Budgeting Committee and The Israel Science Foundation (757/12 to R.S.).
PY - 2014/9/1
Y1 - 2014/9/1
N2 - Motivation: A holy grail of biological research is a working model of the cell. Current modeling frameworks, especially in the protein-protein interaction domain, are mostly topological in nature, calling for stronger and more expressive network models. One promising alternative is logic-based or Boolean network modeling, which was successfully applied to model signaling regulatory circuits in human. Learning such models requires observing the system under a sufficient number of different conditions. To date, the amount of measured data is the main bottleneck in learning informative Boolean models, underscoring the need for efficient experimental design strategies. Results: We developed novel design approaches that greedily select an experiment to be performed so as to maximize the difference or the entropy in the results it induces with respect to current best-fit models. Unique to our maximum difference approach is the ability to account for all (possibly exponential number of) Boolean models displaying high fit to the available data. We applied both approaches to simulated and real data from the EFGR and IL1 signaling systems in human. We demonstrate the utility of the developed strategies in substantially improving on a random selection approach. Our design schemes highlight the redundancy in these datasets, leading up to 11-fold savings in the number of experiments to be performed.
AB - Motivation: A holy grail of biological research is a working model of the cell. Current modeling frameworks, especially in the protein-protein interaction domain, are mostly topological in nature, calling for stronger and more expressive network models. One promising alternative is logic-based or Boolean network modeling, which was successfully applied to model signaling regulatory circuits in human. Learning such models requires observing the system under a sufficient number of different conditions. To date, the amount of measured data is the main bottleneck in learning informative Boolean models, underscoring the need for efficient experimental design strategies. Results: We developed novel design approaches that greedily select an experiment to be performed so as to maximize the difference or the entropy in the results it induces with respect to current best-fit models. Unique to our maximum difference approach is the ability to account for all (possibly exponential number of) Boolean models displaying high fit to the available data. We applied both approaches to simulated and real data from the EFGR and IL1 signaling systems in human. We demonstrate the utility of the developed strategies in substantially improving on a random selection approach. Our design schemes highlight the redundancy in these datasets, leading up to 11-fold savings in the number of experiments to be performed.
UR - http://www.scopus.com/inward/record.url?scp=84907024690&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btu451
DO - 10.1093/bioinformatics/btu451
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 25161232
AN - SCOPUS:84907024690
SN - 1367-4803
VL - 30
SP - i445-i452
JO - Bioinformatics
JF - Bioinformatics
IS - 17
ER -