Pipeline requires 72GB of RAM, even to test. #408

mpiersonsmela · 2024-07-26T00:18:38Z

Description of the bug

On my university's cluster, users are penalized (with priority reduction) for requesting more RAM than they actually use. So the fact that the pipeline requires at least 72GB of RAM to run is an issue for me, given than I'm just trying to test it with the example samplesheet.csv from https://nf-co.re/methylseq/2.6.0/

This is the relevant portion of the output. Does bismark genome preparation really need so much RAM?

`ERROR ~ Error executing process > 'NFCORE_METHYLSEQ:METHYLSEQ:PREPARE_GENOME:BISMARK_GENOMEPREPARATION (BismarkIndex/grch38_core+bs_controls.fa)'

Caused by:
Process requirement exceeds available memory -- req: 72 GB; avail: 32 GB

Command executed:

bismark_genome_preparation
--bowtie2
BismarkIndex`

Command used and terminal output

nextflow run nf-core/methylseq \
--input test_samplesheet.csv \
--outdir Output \
--fasta grch38_core+bs_controls.fa \
-w /n/scratch/users/m/NF_MiSeq \
-ansi-log false

Relevant files

No response

System information

No response

imdanique · 2024-08-07T11:16:45Z

@mpiersonsmela

I've tested the pipeline and my nextflow report shows high RAM usage particularly by the deduplication step. I'm not sure if it's optimal but hope it helps

sateeshperi · 2024-09-17T17:15:16Z

its true that it requires 72.GB mem as the process is labelled with process_high with config set in base.config.

I can limit the max mem for the test_full profile but, if any other changes you have to make as per your resource availability by setting institutional cluster specific config settings. Does that sound ok to you ?

sateeshperi · 2024-10-27T14:09:18Z

Hi @mpiersonsmela, it’s true that the test_full profile needs 72 GB of RAM since we’re testing real-life samples. However, the test profile requires only 4 GB of RAM. So, if you’re just testing the pipeline setup, use the test profile. If you want to test with a real-sized dataset, you can try test_full, which does require high memory to process these samples.

If convinced with the answer, kindly close this issue. Thank you!

mpiersonsmela added the bug Something isn't working label Jul 26, 2024

sateeshperi added this to the 2.7.0 milestone Oct 20, 2024

sateeshperi removed the bug Something isn't working label Oct 20, 2024

sateeshperi linked a pull request Oct 20, 2024 that will close this issue

Dev -> Master 2.7.0 #420

Merged

sateeshperi mentioned this issue Oct 22, 2024

Dev -> Master 2.7.0 #420

Merged

sateeshperi removed a link to a pull request Oct 22, 2024

Dev -> Master 2.7.0 #420

Merged

sateeshperi removed this from the 2.7.0 milestone Oct 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pipeline requires 72GB of RAM, even to test. #408

Pipeline requires 72GB of RAM, even to test. #408

mpiersonsmela commented Jul 26, 2024

imdanique commented Aug 7, 2024

sateeshperi commented Sep 17, 2024

sateeshperi commented Oct 27, 2024 •

edited

Loading

Pipeline requires 72GB of RAM, even to test. #408

Pipeline requires 72GB of RAM, even to test. #408

Comments

mpiersonsmela commented Jul 26, 2024

Description of the bug

Command used and terminal output

Relevant files

System information

imdanique commented Aug 7, 2024

sateeshperi commented Sep 17, 2024

sateeshperi commented Oct 27, 2024 • edited Loading

sateeshperi commented Oct 27, 2024 •

edited

Loading