TY - JOUR
T1 - Do tree split probabilities determine the branch lengths?
AU - Chor, Benny
AU - Steel, Mike
N1 - Publisher Copyright:
© 2015 Elsevier Ltd.
PY - 2015/6/7
Y1 - 2015/6/7
N2 - The evolution of aligned DNA sequence sites is generally modeled by a Markov process operating along the edges of a phylogenetic tree. It is well known that the probability distribution on the site patterns at the tips of the tree determines the tree topology, and its branch lengths. However, the number of patterns is typically much larger than the number of edges, suggesting considerable redundancy in the branch length estimation. In this paper we ask whether the probabilities of just the 'edge-specific' patterns (the ones that correspond to a change of state on a single edge) suffice to recover the branch lengths of the tree, under a symmetric 2-state Markov process. We first show that this holds provided the branch lengths are sufficiently short, by applying the inverse function theorem. We then consider whether this restriction to short branch lengths is necessary. We show that for trees with up to four leaves it can be lifted. This leaves open the interesting question of whether this holds in general. Our results also extend to certain Markov processes on more than 2-states, such as the Jukes-Cantor model.
AB - The evolution of aligned DNA sequence sites is generally modeled by a Markov process operating along the edges of a phylogenetic tree. It is well known that the probability distribution on the site patterns at the tips of the tree determines the tree topology, and its branch lengths. However, the number of patterns is typically much larger than the number of edges, suggesting considerable redundancy in the branch length estimation. In this paper we ask whether the probabilities of just the 'edge-specific' patterns (the ones that correspond to a change of state on a single edge) suffice to recover the branch lengths of the tree, under a symmetric 2-state Markov process. We first show that this holds provided the branch lengths are sufficiently short, by applying the inverse function theorem. We then consider whether this restriction to short branch lengths is necessary. We show that for trees with up to four leaves it can be lifted. This leaves open the interesting question of whether this holds in general. Our results also extend to certain Markov processes on more than 2-states, such as the Jukes-Cantor model.
KW - Evolutionary model
KW - Hadamard transform
KW - Inverse function theorem
KW - Markov process
KW - Phylogenetic tree reconstruction
KW - Systems of polynomial equations
UR - http://www.scopus.com/inward/record.url?scp=84927614610&partnerID=8YFLogxK
U2 - 10.1016/j.jtbi.2015.03.024
DO - 10.1016/j.jtbi.2015.03.024
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 25843219
AN - SCOPUS:84927614610
SN - 0022-5193
VL - 374
SP - 54
EP - 59
JO - Journal of Theoretical Biology
JF - Journal of Theoretical Biology
ER -