GUIDANCE2: Accurate detection of unreliable alignment regions accounting for the uncertainty of multiple parameters

Itamar Sela, Haim Ashkenazy, Kazutaka Katoh, Tal Pupko*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

632 Scopus citations

Abstract

Inference of multiple sequence alignments (MSAs) is a critical part of phylogenetic and comparative genomics studies. However, from the same set of sequences different MSAs are often inferred, depending on the methodologies used and the assumed parameters. Much effort has recently been devoted to improving the ability to identify unreliable alignment regions. Detecting such unreliable regions was previously shown to be important for downstream analyses relying on MSAs, such as the detection of positive selection. Here we developed GUIDANCE2, a new integrative methodology that accounts for: (i) uncertainty in the process of indel formation, (ii) uncertainty in the assumed guide tree and (iii) co-optimal solutions in the pairwise alignments, used as building blocks in progressive alignment algorithms. We compared GUIDANCE2 with seven methodologies to detect unreliable MSA regions using extensive simulations and empirical benchmarks. We show that GUIDANCE2 outperforms all previously developed methodologies. Furthermore, GUIDANCE2 also provides a set of alternative MSAs which can be useful for downstream analyses. The novel algorithm is implemented as a web-server, available at: http://guidance.tau.ac.il.

Original languageEnglish
Pages (from-to)W7-W14
JournalNucleic Acids Research
Volume43
Issue numberW1
DOIs
StatePublished - 2015

Funding

FundersFunder number
Edmond J. Safra Center for Ethics, Harvard University
Israel Science Foundation1092/13

    Fingerprint

    Dive into the research topics of 'GUIDANCE2: Accurate detection of unreliable alignment regions accounting for the uncertainty of multiple parameters'. Together they form a unique fingerprint.

    Cite this