TY - JOUR
T1 - Inference of locus-specific ancestry in closely related populations
AU - Paşaniuc, Bogdan
AU - Sankararaman, Sriram
AU - Kimmel, Gad
AU - Halperin, Eran
N1 - Funding Information:
Funding: NSF (grant 0713254 to E.H. and B.P.); NSF (grant 0513599 to G.K.); Berkeley Fellowship (to S.S.); Wellcome Trust (under award 076113); E.H. is a faculty fellow of the Edmond J. Safra Bioinformatics program at Tel Aviv University.
PY - 2009
Y1 - 2009
N2 - A characterization of the genetic variation of recently admixed populations may reveal historical population events, and is useful for the detection of single nucleotide polymorphisms (SNPs) associated with diseases through association studies and admixture mapping. Inference of locus-specific ancestry is key to our understanding of the genetic variation of such populations. While a number of methods for the inference of locus-specific ancestry are accurate when the ancestral populations are quite distant (e.g. African-Americans), current methods incur a large error rate when inferring the locus-specific ancestry in admixed populations where the ancestral populations are closely related (e.g. Americans of European descent). Results: In this work, we extend previous methods for the inference of locus-specific ancestry by the incorporation of a refined model of recombination events. We present an efficient dynamic programming algorithm to infer the locus-specific ancestries in this model, resulting in a method that attains improved accuracies; the improvement is most significant when the ancestral populations are closely related. An evaluation on a wide range of scenarios, including admixtures of the 52 population groups from the Human Genome Diversity Project demonstrates that locus-specific ancestry can indeed be accurately inferred in these admixtures using our method. Finally, we demonstrate that imputation methods can be improved by the incorporation of locus-specific ancestry, when applied to admixed populations.
AB - A characterization of the genetic variation of recently admixed populations may reveal historical population events, and is useful for the detection of single nucleotide polymorphisms (SNPs) associated with diseases through association studies and admixture mapping. Inference of locus-specific ancestry is key to our understanding of the genetic variation of such populations. While a number of methods for the inference of locus-specific ancestry are accurate when the ancestral populations are quite distant (e.g. African-Americans), current methods incur a large error rate when inferring the locus-specific ancestry in admixed populations where the ancestral populations are closely related (e.g. Americans of European descent). Results: In this work, we extend previous methods for the inference of locus-specific ancestry by the incorporation of a refined model of recombination events. We present an efficient dynamic programming algorithm to infer the locus-specific ancestries in this model, resulting in a method that attains improved accuracies; the improvement is most significant when the ancestral populations are closely related. An evaluation on a wide range of scenarios, including admixtures of the 52 population groups from the Human Genome Diversity Project demonstrates that locus-specific ancestry can indeed be accurately inferred in these admixtures using our method. Finally, we demonstrate that imputation methods can be improved by the incorporation of locus-specific ancestry, when applied to admixed populations.
UR - http://www.scopus.com/inward/record.url?scp=66349088624&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btp197
DO - 10.1093/bioinformatics/btp197
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:66349088624
SN - 1367-4803
VL - 25
SP - i213-i221
JO - Bioinformatics
JF - Bioinformatics
IS - 12
ER -