TY - JOUR
T1 - Leveraging Genetic Variability across Populations for the Identification of Causal Variants
AU - Zaitlen, Noah
AU - Paşaniuc, Bogdan
AU - Gur, Tom
AU - Ziv, Elad
AU - Halperin, Eran
N1 - Funding Information:
N.Z., T.G., and E.H. were supported by the Israeli Science Foundation, grant no. 04514831. E.H. and B.P. were supported by the National Science Foundation, grant IIS-071325412. E.Z. was supported by the National Institutes of Health and the National Cancer Institute, grant R01CA120120, and by the Komen Foundation for Breast Cancer Research, grant KG080165. Part of this work was done when E.Z. was a fellow of the Edmond J. Safra Bioinformatics program at Tel-Aviv University. E.H. is a faculty fellow of the Edmond J. Safra Bioinformatics program at Tel-Aviv University. We kindly thank the reviewers for their insightful comments.
PY - 2010
Y1 - 2010
N2 - Genome-wide association studies have been performed extensively in the last few years, resulting in many new discoveries of genomic regions that are associated with complex traits. It is often the case that a SNP found to be associated with the condition is not the causal SNP, but a proxy to it as a result of linkage disequilibrium. For the identification of the actual causal SNP, fine-mapping follow-up is performed, either with the use of dense genotyping or by sequencing of the region. In either case, if the causal SNP is in high linkage disequilibrium with other SNPs, the fine-mapping procedure will require a very large sample size for the identification of the causal SNP. Here, we show that by leveraging genetic variability across populations, we significantly increase the localization success rate (LSR) for a causal SNP in a follow-up study that involves multiple populations as compared to a study that involves only one population. Thus, the average power for detection of the causal variant will be higher in a joint analysis than that in studies in which only one population is analyzed at a time. On the basis of this observation, we developed a framework to efficiently search for a follow-up study design: our framework searches for the best combination of populations from a pool of available populations to maximize the LSR for detection of a causal variant. This framework and its accompanying software can be used to considerably enhance the power of fine-mapping studies.
AB - Genome-wide association studies have been performed extensively in the last few years, resulting in many new discoveries of genomic regions that are associated with complex traits. It is often the case that a SNP found to be associated with the condition is not the causal SNP, but a proxy to it as a result of linkage disequilibrium. For the identification of the actual causal SNP, fine-mapping follow-up is performed, either with the use of dense genotyping or by sequencing of the region. In either case, if the causal SNP is in high linkage disequilibrium with other SNPs, the fine-mapping procedure will require a very large sample size for the identification of the causal SNP. Here, we show that by leveraging genetic variability across populations, we significantly increase the localization success rate (LSR) for a causal SNP in a follow-up study that involves multiple populations as compared to a study that involves only one population. Thus, the average power for detection of the causal variant will be higher in a joint analysis than that in studies in which only one population is analyzed at a time. On the basis of this observation, we developed a framework to efficiently search for a follow-up study design: our framework searches for the best combination of populations from a pool of available populations to maximize the LSR for detection of a causal variant. This framework and its accompanying software can be used to considerably enhance the power of fine-mapping studies.
UR - http://www.scopus.com/inward/record.url?scp=73149100820&partnerID=8YFLogxK
U2 - 10.1016/j.ajhg.2009.11.016
DO - 10.1016/j.ajhg.2009.11.016
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:73149100820
SN - 0002-9297
VL - 86
SP - 23
EP - 33
JO - American Journal of Human Genetics
JF - American Journal of Human Genetics
IS - 1
ER -