KO-length normalization

Hi all! 

First, thanks for FunProfiler. Such a great tool.

I would like to aggregate multiple KOs (from the same pathway) into a single pathway-level value per metagenome (e.g., mean), then compare these pathway values across metagenomes. Intuitively, it seems I should normalize each KO’s abundance by its “length” first (longer KOs might have more chances to get matches than shorter ones).

My questions:

- Are FunProfiler’s KO “abundances” length-normalized in any way (e.g., by KO sequence length or KO k-mer cardinality)?

- If not, would you recommend normalizing by KO length before aggregating KOs within a pathway? (and if so, maybe normalizing another output/metric of FunProfiler instead of your calculated relative abundance? Like the intersect_bp value from prefetch_out?)

- Or do you see any other caveats or limitations to this idea when using FunProfiler?

Thank you very much

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KO-length normalization #13

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

KO-length normalization #13

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions