TY - JOUR
T1 - Leveraging the HapMap correlation structure in association studies
AU - Zaitlen, Noah
AU - Kang, Hyun Min
AU - Eskin, Eleazar
AU - Halperin, Eran
N1 - Funding Information:
N.Z. is supported by the Microsoft Graduate Research Fellowship. N.Z. and E.E. are partially supported by National Science Foundation (NSF) grant 0513612 and National Institutes of Health grant 1K25HL080079. E.H. is supported by NSF grant IIS-0513599. H.M.K. is supported by the Samsung Scholarship. Part of this investigation was performed using the computing facility made possible by grants from the National Center for Research Resources, National Institutes of Health: the Research Facilities Improvement Program grant C06 RR017588, awarded to the Whitaker Biomedical Engineering Institute, and the Biomedical Technology Resource Centers Program grant P41 RR08605, awarded to the National Biomedical Computation Resource, University of California–San Diego. Additional computational resources were provided by the California Institute of Telecommunications and Information Technology (Calit2).
PY - 2007/4
Y1 - 2007/4
N2 - Recent high-throughput genotyping technologies, such as the Affymetrix 500k array and the Illumina HumanHap 550 beadchip, have driven down the costs of association studies and have enabled the measurement of single-nucleotide polymorphism (SNP) allele frequency differences between case and control populations on a genomewide scale. A key aspect in the efficiency of association studies is the notion of "indirect association," where only a subset of SNPs are collected to serve as proxies for the uncollected SNPs, taking advantage of the correlation structure between SNPs. Recently, a new class of methods for indirect association, multimarker methods, has been proposed. Although the multimarker methods are a considerable advancement, current methods do not fully take advantage of the correlation structure between SNPs and their multimarker proxies. In this article, we propose a novel multimarker indirect-association method, WHAP, that is based on a weighted sum of the haplotype frequency differences. In contrast to traditional indirect-association methods, we show analytically that there is a considerable gain in power achieved by our method compared with both single-marker and multimarker tests, as well as traditional haplotype-based tests. Our results are supported by empirical evaluation across the HapMap reference panel data sets, and a software implementation for the Affymetrix 500k and Illumina HumanHap 550 chips is available for download.
AB - Recent high-throughput genotyping technologies, such as the Affymetrix 500k array and the Illumina HumanHap 550 beadchip, have driven down the costs of association studies and have enabled the measurement of single-nucleotide polymorphism (SNP) allele frequency differences between case and control populations on a genomewide scale. A key aspect in the efficiency of association studies is the notion of "indirect association," where only a subset of SNPs are collected to serve as proxies for the uncollected SNPs, taking advantage of the correlation structure between SNPs. Recently, a new class of methods for indirect association, multimarker methods, has been proposed. Although the multimarker methods are a considerable advancement, current methods do not fully take advantage of the correlation structure between SNPs and their multimarker proxies. In this article, we propose a novel multimarker indirect-association method, WHAP, that is based on a weighted sum of the haplotype frequency differences. In contrast to traditional indirect-association methods, we show analytically that there is a considerable gain in power achieved by our method compared with both single-marker and multimarker tests, as well as traditional haplotype-based tests. Our results are supported by empirical evaluation across the HapMap reference panel data sets, and a software implementation for the Affymetrix 500k and Illumina HumanHap 550 chips is available for download.
UR - http://www.scopus.com/inward/record.url?scp=34147154784&partnerID=8YFLogxK
U2 - 10.1086/513109
DO - 10.1086/513109
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 17357074
AN - SCOPUS:34147154784
SN - 0002-9297
VL - 80
SP - 683
EP - 691
JO - American Journal of Human Genetics
JF - American Journal of Human Genetics
IS - 4
ER -