Add option to output a new config file #63

KevinSayers · 2024-08-27T03:13:18Z

Issue #, if available:

Description of changes:
DRAFT: Need to add CWL and WDL formatting still.
Adds --config=<path> which if specified will create a configuration file formatted correctly for the workflow's engine. The below for example would be correctly formatted for NF. This would enable users to directly utilize recommended settings from the runanalyzer.

withName: TEST {
    cpu = 2
    memory = 4
}

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

markjschreiber

Added some suggestions

markjschreiber · 2024-08-30T14:05:04Z

omics/cli/run_analyzer/__main__.py

@@ -24,6 +25,7 @@
 -o, --out=<path>         Write output to file
 -P, --plot=<directory>   Plot a run timeline to a directory
 -H, --headroom=<float>   Adds a fractional buffer to the size of recommended memory and CPU. Values must be between 0.0 and 1.0.
+ -c, --config=<path>      Output a config file with recommended resources


specify that it is a Nextflow style config file

or Nextflow / CWL

markjschreiber · 2024-08-30T14:06:07Z

omics/cli/run_analyzer/__main__.py

+                """)
+                out.write(task_string)
+    elif engine == 'WDL':
+        pass


Raise an error saying we don't support WDL

markjschreiber · 2024-08-30T14:11:09Z

omics/cli/run_analyzer/__main__.py

+                    wfid = res['workflow'].split('/')[-1]
+                    engine = omics.get_workflow(id=wfid)['engine']
+                if res['type'] == 'task':
+                    task_name = res['name'].split(" ")[0]


Will this always work? Do we know that our engines can't produce a task name like "[my workflow task]" with spaces?

It might make this more future proof to have a constant for the task name split string because if we need to change it due to new engines, this line of code doesn't advertise it's intent. Alternatively, perhaps have a function to split the name, perhaps with an engine name as an argument if different engines do different things.

I will add a function that can parse the base task name and enforce the format of it.

Updates instructions for building from source

…labels (awslabs#65)

caveats on price estimation

KevinSayers · 2024-09-18T03:43:39Z

@markjschreiber I made this Nextflow only for now. I will add in CWL and WDL once I have spent more time working out how to handle scattered tasks for those.

markjschreiber

Looks good. Before merging, can you update the README with information about the --config option.

Other suggestions for improvements:
docopt has a nice mechanism for separating mutually exclusive tags (see my most recent PR and docopt documentation). If --config shouldn't be logically used with some other options then you can include a new usage line such as omics-run-analyzer <runId> --config=<path>. This can also simplify the logic for the main method as conflicting options can be rejected by the arg parser before they get to the main logic.

--write-config might be a better long name? --config might suggest you can read a config file to change the behavior of the run analyzer. Not sure?

Can you see if you can split the config logic into a separate python file. I have been trying to do this for new features to declutter the __main__.py which has gotten large and confusing.

markjschreiber

LGTM!

aws-ktsayers added 2 commits August 26, 2024 20:53

adding output of recommended config

bd9fc16

compare all tasks with the same name and keep the max resources

7138ded

markjschreiber reviewed Aug 30, 2024

View reviewed changes

markjschreiber and others added 6 commits September 16, 2024 21:25

Update README.md (awslabs#64)

8bb899c

Updates instructions for building from source

fixes label offsets and increases the threshold where we stop adding …

92b04f1

…labels (awslabs#65)

Update README.md (awslabs#66)

8ea9adc

caveats on price estimation

updating config

288157f

function to handle base task

2a76bdb

Limiting config to nextflow

440533b

KevinSayers marked this pull request as ready for review September 18, 2024 03:43

Merge branch 'main' into config

0c2724f

markjschreiber requested changes Sep 18, 2024

View reviewed changes

aws-ktsayers added 2 commits September 23, 2024 22:04

moving writeconfig to new file and adding tests

7ffb8ee

adding README details

b8f4b5a

markjschreiber approved these changes Sep 24, 2024

View reviewed changes

KevinSayers merged commit e2a8286 into awslabs:main Sep 24, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add option to output a new config file #63

Add option to output a new config file #63

KevinSayers commented Aug 27, 2024

markjschreiber left a comment

markjschreiber Aug 30, 2024

markjschreiber Aug 30, 2024

markjschreiber Aug 30, 2024

markjschreiber Aug 30, 2024

KevinSayers Sep 9, 2024

KevinSayers commented Sep 18, 2024

markjschreiber left a comment

markjschreiber left a comment

Add option to output a new config file #63

Add option to output a new config file #63

Conversation

KevinSayers commented Aug 27, 2024

markjschreiber left a comment

Choose a reason for hiding this comment

markjschreiber Aug 30, 2024

Choose a reason for hiding this comment

markjschreiber Aug 30, 2024

Choose a reason for hiding this comment

markjschreiber Aug 30, 2024

Choose a reason for hiding this comment

markjschreiber Aug 30, 2024

Choose a reason for hiding this comment

KevinSayers Sep 9, 2024

Choose a reason for hiding this comment

KevinSayers commented Sep 18, 2024

markjschreiber left a comment

Choose a reason for hiding this comment

markjschreiber left a comment

Choose a reason for hiding this comment