Indel reliability in indel-based phylogenetic inference

Haim Ashkenazy, Ofir Cohen, Tal Pupko, Dorothée Huchon

Research output: Contribution to journalArticlepeer-review

Abstract

It is often assumed that it is unlikely that the same insertion or deletion (indel) event occurred at the same position in two independent evolutionary lineages, and thus, indel-based inference of phylogeny should be less subject to homoplasy compared with standard inference which is based on substitution events. Indeed, indels were successfully used to solve debated evolutionary relationships among various taxonomical groups. However, indels are never directly observed but rather inferred from the alignment and thus indel-based inference may be sensitive to alignment errors. It is hypothesized that phylogenetic reconstruction would be more accurate if it relied only on a subset of reliable indels instead of the entire indel data. Here, we developed a method to quantify the reliability of indel characters by measuring how often they appear in a set of alternative multiple sequence alignments. Our approach is based on the assumption that indels that are consistently present in most alternative alignments are more reliable compared with indels that appear only in a small subset of these alignments. Using simulated and empirical data, we studied the impact of filtering and weighting indels by their reliability scores on the accuracy of indel-based phylogenetic reconstruction. The new method is available as a web-server at http://guidance.tau.ac.il/RELINDEL.

Original languageEnglish
Pages (from-to)3199-3209
Number of pages11
JournalGenome Biology and Evolution
Volume6
Issue number12
DOIs
StatePublished - Dec 2014

Keywords

  • Alignment reliability
  • Indel analysis
  • Multiple sequence alignment
  • Phylogeny

Fingerprint

Dive into the research topics of 'Indel reliability in indel-based phylogenetic inference'. Together they form a unique fingerprint.

Cite this