Large scale recovery of haplotypes from genotype data using imperfect phylogeny

Eran Halperin*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Critical to the understanding of the genetic basis for complex diseases is the modeling of human variation. Most of this variation can be characterized by single nucleotide polymorphisms (SNPs) which are mutations at a single nucleotide position. To characterize an individual’s variation, we must determine an individual’s haplotype or which nucleotide base occurs at each position of these common SNPs for each chromosome. In this paper, we present results for a highly accurate method for haplotype resolution from genotype data. Our method leverages a new insight into the underlying structure of haplotypes which shows that SNPs are organized in highly correlated “blocks”. The majority of individuals have one of about four common haplotypes in each block. Our method partitions the SNPs into blocks and for each block, we predict the common haplotypes and each individual’s haplotype. We evaluate our method over biological data. Our method predicts the common haplotypes perfectly and has a very low error rate (0. 47%) when taking into account the predictions for the uncommon haplotypes.

Original languageEnglish
Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
EditorsSorin Istrail, Michael Waterman, Andrew Clark
PublisherSpringer Verlag
Pages137
Number of pages1
ISBN (Print)9783540212492
DOIs
StatePublished - 2004
Externally publishedYes
Event1st Computational Methods for SNPs and Haplotype Inference Workshop, 2002 - Piscataway, United States
Duration: 21 Nov 200222 Nov 2002

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume2983
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference1st Computational Methods for SNPs and Haplotype Inference Workshop, 2002
Country/TerritoryUnited States
CityPiscataway
Period21/11/0222/11/02

Fingerprint

Dive into the research topics of 'Large scale recovery of haplotypes from genotype data using imperfect phylogeny'. Together they form a unique fingerprint.

Cite this