-
Notifications
You must be signed in to change notification settings - Fork 70
Description
Hi,
I am trying to use HyPhy to make pairwise Ka/Ks estimates for sequences from very large, genome-wide phylogenies, (including ancestral sequences), and I get weird behaviour from HyPhy; I wonder whether this is a bug. The sequences are very long, 400k codons, but also have very low divergence (~99% sequence similarity).
When I compare SeqA and SeqB from the phylogeny (with FitMG94), I get the global Ka/Ks for the sequence pair ~ 0.16. Setting the --rooted flag to yes is also fine, results in similar estimates:
| Branch | Length | dN/dS | Approximate dN/dS CI|
| SeqA | 0.002 | 0.161 | 0.140 - 0.182 |
| SeqB | 0.002 | 0.154 | 0.138 - 0.170 |
However, when I compare SeqA with an ancestral sequence (that is the last common ancestor of SeqA and SeqB) I get weird behaviour.
With the --rooted flag set to Yes, there is a weird asymmetry (which is reversible, the one having 0.0 Ka/Ks is always the second sequence in the alignment):
| Branch | Length | dN/dS |Approximate dN/dS CI|
| Anc | 0.002 | 0.315 | 0.298 - 0.333 |
| SeqA | 0.001 | 0.000 | 0.000 - 0.019 |
while with --type flag set to global Ka/Ks remains ~ 0.16 (that is also surprising). The ancestral sequence was reconstructed with IQtree2, with the MGK method.
I wonder whether this is an error (due to the very long sequences?), or normal behaviour.
Best wishes,
G.