-
Notifications
You must be signed in to change notification settings - Fork 4
Description
Hi all!
First, thanks for FunProfiler. Such a great tool.
I would like to aggregate multiple KOs (from the same pathway) into a single pathway-level value per metagenome (e.g., mean), then compare these pathway values across metagenomes. Intuitively, it seems I should normalize each KO’s abundance by its “length” first (longer KOs might have more chances to get matches than shorter ones).
My questions:
-
Are FunProfiler’s KO “abundances” length-normalized in any way (e.g., by KO sequence length or KO k-mer cardinality)?
-
If not, would you recommend normalizing by KO length before aggregating KOs within a pathway? (and if so, maybe normalizing another output/metric of FunProfiler instead of your calculated relative abundance? Like the intersect_bp value from prefetch_out?)
-
Or do you see any other caveats or limitations to this idea when using FunProfiler?
Thank you very much