Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Controlling CPU usage for the DRAM.py distill pipeline for DRAM v.1.5.0 #358

Open
nataliesarno opened this issue Aug 19, 2024 · 1 comment

Comments

@nataliesarno
Copy link

I am trying to control CPU usage for the DRAM.py distill pipeline on DRAM v.1.5.0 that I am using on DRAM.py annotate .tsv output from metagenome assembled genomes. As I am using a shared lab server with 128 available CPU threads and over 20 users I need to set a max CPU usage and provide a run-time estimate. However, there does not appear to be any flag to control the max number of threads that the DRAM.py distill pipeline uses. When I ran the following command and monitored CPU use DRAM.py distill used up to 20 CPU's when I tested the command on a single metagenome assembled genome DRAM.py annotate output .tsv file:

DRAM.py distill -i annotations.tsv -o DRAM_results_distilled --trna_path trnas.tsv --rrna_path rrnas.tsv

Is there any way to control CPU usage in the DRAM.py distill pipeline?

Thank you so much for your time and assistance in advance!

@madeline-scyphers
Copy link
Member

@nataliesarno DRAM v2 just got its first public beta release, and there are options to control the number of threads, memory allocation, SLURM queue process size (number of simultaneous SLURM jobs), all from the config or from CLI. The documentation is light right now with the first public beta release. We would love for you to give it a try, but if you want to wait, the documentation updates will be coming in the next couple weeks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants