Feature request #62

magnusarntzen · 2023-10-11T12:24:00Z

Since you asked for feedback...

What about implementing calculations of module competion factors (mcf)? These are values between 0-1 indicating whether a Bin has the required genes to complete a given reaksjon, e.g., 'denitrification' or 'methanogenesis'.

This can be done with the MetQy package in R (I have code if you want) and it would complement your output nicely.
I attach an example output for some of my samples with 150 bins.
MetQy_mcf.pdf

cmkobel · 2023-10-11T14:28:56Z

Thanks! Great idea! MCFs are definitely easier to interpret than p-values for GSEA. That R package looks neat, but the KO calls are already made in the kegg_diamond rule so we just need the table and algorithm that links the KOs to pathways and computes the MCF, then we're there! I'll look into a way of integrating that.

magnusarntzen · 2023-10-12T06:16:29Z

Hey, The R-package MetQy does not do the KO calling so it is good you have another program that does that for you. I use KoFamScan in my pipelines but I am sure kegg_diamond does the trick too. MetQy takes a dataframe with semicolon-separated KOs per bin: Bin1 K00001;K00032;K24233 Bin2 K22001;K32231 Etc. NB: these are lists of gene K-numbers, not pathway KO-numbers. It uses about 10-15 minutes for 150 bins on my laptop but will be fast on the Threadripper I suppose.

…

-M From: Carl Mathias Kobel ***@***.***> Sent: onsdag 11. oktober 2023 16:29 To: cmkobel/assemblycomparator2 ***@***.***> Cc: Magnus Øverlie Arntzen ***@***.***>; Author ***@***.***> Subject: Re: [cmkobel/assemblycomparator2] Feature request (Issue #62) Thanks! Great idea! MCFs are definitely easier to interprete than p-values for GSEA. That R package looks neat, but the KO calls are already made in the kegg_diamond rule so we just need the table and algorithm that links the KOs to pathways and computes the MCF, then we're there! I'll look into a way of solving that. — Reply to this email directly, view it on GitHub<#62 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AIFICYTE4PMN4DXRGPNZWL3X62ULHANCNFSM6AAAAAA534632M>. You are receiving this because you authored the thread.Message ID: ***@***.******@***.***>>

cmkobel · 2024-05-02T12:59:16Z

This will be solved by adding gapseq which calculates pathway completion fractions. It is well maintained and very powerful. Currently waiting for r-chnosz to be published on conda-forge so we can publish gapseq on bioconda, so we can finally add gapseq to asscom2.

cmkobel added the enhancement New feature or request label Oct 11, 2023

cmkobel added this to the Reach 100 stars on Github milestone Jul 19, 2024

cmkobel self-assigned this Jul 23, 2024

cmkobel added feature request and removed enhancement New feature or request labels Jul 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request #62

Feature request #62

magnusarntzen commented Oct 11, 2023

cmkobel commented Oct 11, 2023 •

edited

Loading

magnusarntzen commented Oct 12, 2023 via email

cmkobel commented May 2, 2024 •

edited

Loading

Feature request #62

Feature request #62

Comments

magnusarntzen commented Oct 11, 2023

cmkobel commented Oct 11, 2023 • edited Loading

magnusarntzen commented Oct 12, 2023 via email

cmkobel commented May 2, 2024 • edited Loading

cmkobel commented Oct 11, 2023 •

edited

Loading

cmkobel commented May 2, 2024 •

edited

Loading