Skip to content

KO-length normalization #13

@benoitdepins

Description

@benoitdepins

Hi all!

First, thanks for FunProfiler. Such a great tool.

I would like to aggregate multiple KOs (from the same pathway) into a single pathway-level value per metagenome (e.g., mean), then compare these pathway values across metagenomes. Intuitively, it seems I should normalize each KO’s abundance by its “length” first (longer KOs might have more chances to get matches than shorter ones).

My questions:

  • Are FunProfiler’s KO “abundances” length-normalized in any way (e.g., by KO sequence length or KO k-mer cardinality)?

  • If not, would you recommend normalizing by KO length before aggregating KOs within a pathway? (and if so, maybe normalizing another output/metric of FunProfiler instead of your calculated relative abundance? Like the intersect_bp value from prefetch_out?)

  • Or do you see any other caveats or limitations to this idea when using FunProfiler?

Thank you very much

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions