On the inference of ancestries in admixed populations

Sriram Sankararaman, Gad Kimmel, Eran Halperin, Michael I. Jordan

Research output: Contribution to journalArticlepeer-review

37 Scopus citations

Abstract

Inference of ancestral information in recently admixed populations, in which every individual is composed of a mixed ancestry (e.g., African Americans in the United States), is a challenging problem. Several previous model-based approaches to admixture have been based on hidden Markov models (HMMs) and Markov hidden Markov models (MHMMs). We present an augmented form of these models that can be used to predict historical recombination events and can model background linkage disequilibrium (LD) more accurately. We also study some of the computational issues that arise in using such Markovian models on realistic data sets. In particular, we present an effective initialization procedure that, when combined with expectation-maximization (EM) algorithms for parameter estimation, yields high accuracy at significantly decreased computational cost relative to the Markov chain Monte Carlo (MCMC) algorithms that have generally been used in earlier studies. We present experiments exploring these modeling and algorithmic issues in two scenarios - the inference of locus-specific ancestries in a population that is assumed to originate from two unknown ancestral populations, and the inference of allele frequencies in one ancestral population given those in another.

Original languageEnglish
Pages (from-to)668-675
Number of pages8
JournalGenome Research
Volume18
Issue number4
DOIs
StatePublished - Apr 2008
Externally publishedYes

Funding

FundersFunder number
National Human Genome Research InstituteR33HG003070

    Fingerprint

    Dive into the research topics of 'On the inference of ancestries in admixed populations'. Together they form a unique fingerprint.

    Cite this