TY - JOUR
T1 - Exploiting population samples to enhance genome-wide association studies of disease
AU - Kaufman, Shachar
AU - Rosset, Saharon
PY - 2014/5
Y1 - 2014/5
N2 - It is widely acknowledged that genome-wide association studies (GWAS) of complex human disease fail to explain a large portion of heritability, primarily due to lack of statistical power-a problem that is exacerbated when seeking detection of interactions of multiple genomic loci. An untapped source of information that is already widely available, and that is expected to grow in coming years, is population samples. Such samples contain genetic marker data for additional individuals, but not their relevant phenotypes. In this article we develop a highly efficient testing framework based on a constrained maximum-likelihood estimate in a case-control- population setting. We leverage the available population data and optional modeling assumptions, such as Hardy-Weinberg equilibrium (HWE) in the population and linkage equilibrium (LE) between distal loci, to substantially improve power of association and interaction tests. We demonstrate, via simulation and application to actual GWAS data sets, that our approach is substantially more powerful and robust than standard testing approaches that ignore or make naive use of the population sample. We report several novel and credible pairwise interactions, in bipolar disorder, coronary artery disease, Crohn's disease, and rheumatoid arthritis.
AB - It is widely acknowledged that genome-wide association studies (GWAS) of complex human disease fail to explain a large portion of heritability, primarily due to lack of statistical power-a problem that is exacerbated when seeking detection of interactions of multiple genomic loci. An untapped source of information that is already widely available, and that is expected to grow in coming years, is population samples. Such samples contain genetic marker data for additional individuals, but not their relevant phenotypes. In this article we develop a highly efficient testing framework based on a constrained maximum-likelihood estimate in a case-control- population setting. We leverage the available population data and optional modeling assumptions, such as Hardy-Weinberg equilibrium (HWE) in the population and linkage equilibrium (LE) between distal loci, to substantially improve power of association and interaction tests. We demonstrate, via simulation and application to actual GWAS data sets, that our approach is substantially more powerful and robust than standard testing approaches that ignore or make naive use of the population sample. We report several novel and credible pairwise interactions, in bipolar disorder, coronary artery disease, Crohn's disease, and rheumatoid arthritis.
UR - http://www.scopus.com/inward/record.url?scp=84901328050&partnerID=8YFLogxK
U2 - 10.1534/genetics.114.162511
DO - 10.1534/genetics.114.162511
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:84901328050
SN - 0016-6731
VL - 197
SP - 337
EP - 349
JO - Genetics
JF - Genetics
IS - 1
ER -