Skip to content

Inconsistencies in Ka/Ks estimation ? #1960

@ga01

Description

@ga01

Hi,
I am trying to use HyPhy to make pairwise Ka/Ks estimates for sequences from very large, genome-wide phylogenies, (including ancestral sequences), and I get weird behaviour from HyPhy; I wonder whether this is a bug. The sequences are very long, 400k codons, but also have very low divergence (~99% sequence similarity).

When I compare SeqA and SeqB from the phylogeny (with FitMG94), I get the global Ka/Ks for the sequence pair ~ 0.16. Setting the --rooted flag to yes is also fine, results in similar estimates:
| Branch | Length | dN/dS | Approximate dN/dS CI|
| SeqA | 0.002 | 0.161 | 0.140 - 0.182 |
| SeqB | 0.002 | 0.154 | 0.138 - 0.170 |

However, when I compare SeqA with an ancestral sequence (that is the last common ancestor of SeqA and SeqB) I get weird behaviour.
With the --rooted flag set to Yes, there is a weird asymmetry (which is reversible, the one having 0.0 Ka/Ks is always the second sequence in the alignment):
| Branch | Length | dN/dS |Approximate dN/dS CI|
| Anc | 0.002 | 0.315 | 0.298 - 0.333 |
| SeqA | 0.001 | 0.000 | 0.000 - 0.019 |

while with --type flag set to global Ka/Ks remains ~ 0.16 (that is also surprising). The ancestral sequence was reconstructed with IQtree2, with the MGK method.
I wonder whether this is an error (due to the very long sequences?), or normal behaviour.

Best wishes,
G.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions