Implement alignment subworkflow #6

scwatts · 2024-03-12T08:30:40Z

No description provided.

…a single group record.

+ Put read_alignment and read_processing subworkflows into targeted workflow. + Uncomment the whole wgts worflow and integrate this with the read_alignment and read_processing subworkflows.

…ups stub.

* restrict target regions to canonical Ensembl transcripts

github-actions · 2024-03-12T08:32:45Z

`nf-core lint` overall result: Passed ✅ ⚠️

Posted for pipeline commit e0da20a

+| ✅ 155 tests passed       |+
#| ❔   6 tests were ignored |#
!| ❗  45 tests had warnings |!

❗ Test warnings:

files_exist - File not found: assets/multiqc_config.yml
nextflow_config - Config manifest.version should end in dev: 0.3.1
readme - README contains the placeholder zenodo.XXXXXXX. This should be replaced with the zenodo doi (after the first release).
pipeline_todos - TODO string in test_full.config: Specify the paths to your full test data ( on nf-core/test-datasets or directly in repositories, e.g. SRA)
pipeline_todos - TODO string in test_full.config: Give any required params for the test so that command line flags are not needed
pipeline_todos - TODO string in output.md: Write this documentation describing your workflow's output
pipeline_todos - TODO string in usage.md: Add documentation about anything specific to running your pipeline. For general topics, please point to (and add to) the main nf-core website.
pipeline_todos - TODO string in awsfulltest.yml: You can customise AWS full pipeline tests as required
pipeline_todos - TODO string in main.nf: Optionally add in-text citation tools to this list.
pipeline_todos - TODO string in main.nf: Optionally add bibliographic entries to this list.
pipeline_todos - TODO string in main.nf: Only uncomment below if logic in toolCitationText/toolBibliographyText has been filled!
pipeline_todos - TODO string in methods_description_template.yml: #Update the HTML below to your preferred methods description, e.g. add publication citation for this pipeline
system_exit - System.exit in main.nf: System.exit(1) [line 44]
system_exit - System.exit in main.nf: System.exit(1) [line 46]
system_exit - System.exit in Processes.groovy: System.exit(1) [line 33]
system_exit - System.exit in Processes.groovy: System.exit(1) [line 49]
system_exit - System.exit in WorkflowOncoanalyser.groovy: System.exit(1) [line 62]
system_exit - System.exit in Utils.groovy: System.exit(1) [line 29]
system_exit - System.exit in Utils.groovy: System.exit(1) [line 39]
system_exit - System.exit in Utils.groovy: System.exit(1) [line 47]
system_exit - System.exit in Utils.groovy: System.exit(1) [line 55]
system_exit - System.exit in Utils.groovy: System.exit(1) [line 63]
system_exit - System.exit in Utils.groovy: System.exit(1) [line 68]
system_exit - System.exit in Utils.groovy: System.exit(1) [line 84]
system_exit - System.exit in Utils.groovy: System.exit(1) [line 89]
system_exit - System.exit in Utils.groovy: System.exit(1) [line 108]
system_exit - System.exit in Utils.groovy: System.exit(1) [line 113]
system_exit - System.exit in Utils.groovy: System.exit(1) [line 121]
system_exit - System.exit in Utils.groovy: System.exit(1) [line 182]
system_exit - System.exit in Utils.groovy: System.exit(1) [line 263]
system_exit - System.exit in Utils.groovy: System.exit(1) [line 275]
system_exit - System.exit in Utils.groovy: System.exit(1) [line 283]
system_exit - System.exit in Utils.groovy: System.exit(1) [line 290]
system_exit - System.exit in Utils.groovy: System.exit(1) [line 298]
system_exit - System.exit in Utils.groovy: System.exit(1) [line 313]
system_exit - System.exit in Utils.groovy: System.exit(1) [line 344]
system_exit - System.exit in WorkflowMain.groovy: System.exit(1) [line 116]
system_exit - System.exit in WorkflowMain.groovy: System.exit(1) [line 123]
system_exit - System.exit in WorkflowMain.groovy: System.exit(1) [line 130]
system_exit - System.exit in WorkflowMain.groovy: System.exit(1) [line 144]
system_exit - System.exit in WorkflowMain.groovy: System.exit(1) [line 154]
system_exit - System.exit in WorkflowMain.groovy: System.exit(1) [line 159]
system_exit - System.exit in WorkflowMain.groovy: System.exit(1) [line 172]
system_exit - System.exit in WorkflowMain.groovy: System.exit(1) [line 188]
system_exit - System.exit in WorkflowMain.groovy: System.exit(1) [line 197]

❔ Tests ignored:

files_exist - File is ignored: lib/NfcoreTemplate.groovy
files_exist - File is ignored: lib/Utils.groovy
files_exist - File is ignored: lib/WorkflowMain.groovy
files_exist - File is ignored: lib/WorkflowOncoanalyser.groovy
actions_ci - actions_ci
multiqc_config - 'assets/multiqc_config.yml' not found

✅ Tests passed:

files_exist - File found: .gitattributes
files_exist - File found: .gitignore
files_exist - File found: .nf-core.yml
files_exist - File found: .editorconfig
files_exist - File found: .prettierignore
files_exist - File found: .prettierrc.yml
files_exist - File found: CHANGELOG.md
files_exist - File found: CITATIONS.md
files_exist - File found: CODE_OF_CONDUCT.md
files_exist - File found: LICENSE or LICENSE.md or LICENCE or LICENCE.md
files_exist - File found: nextflow_schema.json
files_exist - File found: nextflow.config
files_exist - File found: README.md
files_exist - File found: .github/.dockstore.yml
files_exist - File found: .github/CONTRIBUTING.md
files_exist - File found: .github/ISSUE_TEMPLATE/bug_report.yml
files_exist - File found: .github/ISSUE_TEMPLATE/config.yml
files_exist - File found: .github/ISSUE_TEMPLATE/feature_request.yml
files_exist - File found: .github/PULL_REQUEST_TEMPLATE.md
files_exist - File found: .github/workflows/branch.yml
files_exist - File found: .github/workflows/ci.yml
files_exist - File found: .github/workflows/linting_comment.yml
files_exist - File found: .github/workflows/linting.yml
files_exist - File found: assets/email_template.html
files_exist - File found: assets/email_template.txt
files_exist - File found: assets/sendmail_template.txt
files_exist - File found: assets/nf-core-oncoanalyser_logo_light.png
files_exist - File found: conf/modules.config
files_exist - File found: conf/test.config
files_exist - File found: conf/test_full.config
files_exist - File found: docs/images/nf-core-oncoanalyser_logo_light.png
files_exist - File found: docs/images/nf-core-oncoanalyser_logo_dark.png
files_exist - File found: docs/output.md
files_exist - File found: docs/README.md
files_exist - File found: docs/README.md
files_exist - File found: docs/usage.md
files_exist - File found: main.nf
files_exist - File found: conf/base.config
files_exist - File found: conf/igenomes.config
files_exist - File found: .github/workflows/awstest.yml
files_exist - File found: .github/workflows/awsfulltest.yml
files_exist - File found: modules.json
files_exist - File found: pyproject.toml
files_exist - File not found check: Singularity
files_exist - File not found check: parameters.settings.json
files_exist - File not found check: pipeline_template.yml
files_exist - File not found check: .nf-core.yaml
files_exist - File not found check: bin/markdown_to_html.r
files_exist - File not found check: conf/aws.config
files_exist - File not found check: .github/workflows/push_dockerhub.yml
files_exist - File not found check: .github/ISSUE_TEMPLATE/bug_report.md
files_exist - File not found check: .github/ISSUE_TEMPLATE/feature_request.md
files_exist - File not found check: docs/images/nf-core-oncoanalyser_logo.png
files_exist - File not found check: .markdownlint.yml
files_exist - File not found check: .yamllint.yml
files_exist - File not found check: lib/Checks.groovy
files_exist - File not found check: lib/Completion.groovy
files_exist - File not found check: lib/Workflow.groovy
files_exist - File not found check: lib/nfcore_external_java_deps.jar
files_exist - File not found check: .travis.yml
nextflow_config - Config variable found: manifest.name
nextflow_config - Config variable found: manifest.nextflowVersion
nextflow_config - Config variable found: manifest.description
nextflow_config - Config variable found: manifest.version
nextflow_config - Config variable found: manifest.homePage
nextflow_config - Config variable found: timeline.enabled
nextflow_config - Config variable found: trace.enabled
nextflow_config - Config variable found: report.enabled
nextflow_config - Config variable found: dag.enabled
nextflow_config - Config variable found: process.cpus
nextflow_config - Config variable found: process.memory
nextflow_config - Config variable found: process.time
nextflow_config - Config variable found: params.outdir
nextflow_config - Config variable found: params.input
nextflow_config - Config variable found: params.validationShowHiddenParams
nextflow_config - Config variable found: params.validationSchemaIgnoreParams
nextflow_config - Config variable found: manifest.mainScript
nextflow_config - Config variable found: timeline.file
nextflow_config - Config variable found: trace.file
nextflow_config - Config variable found: report.file
nextflow_config - Config variable found: dag.file
nextflow_config - Config variable (correctly) not found: params.nf_required_version
nextflow_config - Config variable (correctly) not found: params.container
nextflow_config - Config variable (correctly) not found: params.singleEnd
nextflow_config - Config variable (correctly) not found: params.igenomesIgnore
nextflow_config - Config variable (correctly) not found: params.name
nextflow_config - Config variable (correctly) not found: params.enable_conda
nextflow_config - Config timeline.enabled had correct value: true
nextflow_config - Config report.enabled had correct value: true
nextflow_config - Config trace.enabled had correct value: true
nextflow_config - Config dag.enabled had correct value: true
nextflow_config - Config manifest.name began with nf-core/
nextflow_config - Config variable manifest.homePage began with https://github.com/nf-core/
nextflow_config - Config dag.file ended with .html
nextflow_config - Config variable manifest.nextflowVersion started with >= or !>=
nextflow_config - Config params.custom_config_version is set to master
nextflow_config - Config params.custom_config_base is set to https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Lines for loading custom profiles found
nextflow_config - nextflow.config contains configuration profile test
nextflow_config - Config default value correct: params.force_genome= false
nextflow_config - Config default value correct: params.create_stub_placeholders= false
nextflow_config - Config default value correct: params.isofox_functions= TRANSCRIPT_COUNTS;ALT_SPLICE_JUNCTIONS;FUSIONS;RETAINED_INTRONS
nextflow_config - Config default value correct: params.custom_config_version= master
nextflow_config - Config default value correct: params.custom_config_base= https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Config default value correct: params.max_cpus= 16
nextflow_config - Config default value correct: params.max_memory= 128.GB
nextflow_config - Config default value correct: params.max_time= 240.h
nextflow_config - Config default value correct: params.publish_dir_mode= copy
nextflow_config - Config default value correct: params.validate_params= true
files_unchanged - .gitattributes matches the template
files_unchanged - .prettierrc.yml matches the template
files_unchanged - CODE_OF_CONDUCT.md matches the template
files_unchanged - LICENSE matches the template
files_unchanged - .github/.dockstore.yml matches the template
files_unchanged - .github/CONTRIBUTING.md matches the template
files_unchanged - .github/ISSUE_TEMPLATE/bug_report.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/config.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/feature_request.yml matches the template
files_unchanged - .github/PULL_REQUEST_TEMPLATE.md matches the template
files_unchanged - .github/workflows/branch.yml matches the template
files_unchanged - .github/workflows/linting_comment.yml matches the template
files_unchanged - .github/workflows/linting.yml matches the template
files_unchanged - assets/email_template.html matches the template
files_unchanged - assets/email_template.txt matches the template
files_unchanged - assets/sendmail_template.txt matches the template
files_unchanged - assets/nf-core-oncoanalyser_logo_light.png matches the template
files_unchanged - docs/images/nf-core-oncoanalyser_logo_light.png matches the template
files_unchanged - docs/images/nf-core-oncoanalyser_logo_dark.png matches the template
files_unchanged - docs/README.md matches the template
files_unchanged - .gitignore matches the template
files_unchanged - .prettierignore matches the template
files_unchanged - pyproject.toml matches the template
actions_awstest - '.github/workflows/awstest.yml' is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml does not use -profile test
readme - README Nextflow minimum version badge matched config. Badge: 23.04.0, Config: 23.04.0
pipeline_name_conventions - Name adheres to nf-core convention
template_strings - Did not find any Jinja template strings (260 files)
schema_lint - Schema lint passed
schema_lint - Schema title + description lint passed
schema_lint - Input mimetype lint passed: 'text/csv'
schema_params - Schema matched params returned from nextflow config
actions_schema_validation - Workflow validation passed: awstest.yml
actions_schema_validation - Workflow validation passed: branch.yml
actions_schema_validation - Workflow validation passed: ci.yml
actions_schema_validation - Workflow validation passed: clean-up.yml
actions_schema_validation - Workflow validation passed: fix-linting.yml
actions_schema_validation - Workflow validation passed: release-announcements.yml
actions_schema_validation - Workflow validation passed: linting.yml
actions_schema_validation - Workflow validation passed: awsfulltest.yml
actions_schema_validation - Workflow validation passed: linting_comment.yml
actions_schema_validation - Workflow validation passed: download_pipeline.yml
merge_markers - No merge markers found in pipeline files
modules_json - Only installed modules found in modules.json
modules_structure - modules directory structure is correct 'modules/nf-core/TOOL/SUBTOOL'

Run details

nf-core/tools version 2.13.1
Run at 2024-04-23 05:48:52

scwatts · 2024-04-23T07:49:01Z

I've completed testing on the alignment subworkflow using simulated DNA/RNA reads representing four patients across WGTS data and targeted data:

subject_a (WGTS): tumor/normal DNA, tumor RNA
subject_b (WGTS): tumor/normal DNA, tumor RNA
subject_c (targeted, TSO500): tumor DNA, tumor RNA
subject_d (targeted, TSO500): tumor DNA, tumor RNA

Each subject where applicable contained:

somatic SV resulting in a reportable gene fusion
viral integration
one driver somatic small variant
one predisposing germline small variant
random somatic small variants
population heterozygous small variants (present in both tumor and normal)

NB: RNA data supported only the fusion event

Using this simulated data, I performed a total of 132 oncoanalyser runs that covered common input combinations drawing from the following options for each sample:

BAM, MarkDups duplicate marking
BAM, GATK4 duplicate marking
FASTQ, single lane + single library
FASTQ, single lane + multiple library
FASTQ, multiple lane + single library
FASTQ, multiple lane + multiple library

Combinations were additionally selected to cover all relevant analysis types. Each input combination was also tested against the following:

Sample number (single/multiple)
FASTQ splitting (default/disabled)
Mode (WGTS/targeted)

The outputs of each oncoanalyser run was compared to ensure expected features were present with identical metrics e.g. depth, location, fusion partners, etc (population hets were excluded from comparison). No differences in these comparisons were observed.

/cc @charlesshale @mkcmkc

charlesshale

Approved

scwatts and others added 30 commits December 1, 2023 18:33

Basic isolated alignment subworkflow outline

86220ec

Initial implementation of alignment workflow.

4e92135

Simplified condition on whether fastp is run in alignment subworkflow.

f69caf9

Get rid of blocking when merging individual sample records back into …

1ef8abd

…a single group record.

Simple improvement to the alignment subworkflow.

b2c2ad7

Merge alignment and markdups logic into Stephen's stubs.

f377861

Updgrading from bwa mem to bwa mem2.

0151004

Fixing read group flag for bwa mem2.

8456695

Reassigning TODO.

c3f9050

Emiting versions.

fe49cf0

Updating TODOs.

1d13b56

Fixing tags for new processes.

8ce78ff

Setting up targeted and wgts workflows for testing.

ee059ad

+ Put read_alignment and read_processing subworkflows into targeted workflow. + Uncomment the whole wgts worflow and integrate this with the read_alignment and read_processing subworkflows.

Minor fixes and style improvements.

e3f8d45

Adding a TODO.

58ed71c

Add has_umis switch to markdups.

de77de8

Force symlink overwrite so process does not fail on resume.

01ad68d

Change name of output bam from markdups.

df6e65a

Fix read group arg to bwa mem2.

a4a95b1

Add TODO.

61aa1b1

Fix markdups umi flags for TSO500 panel samples.

a1dde8e

Add TODO.

8f06a9f

Fix read group extraction from fastq filenames and a bug in the markd…

a2330b2

…ups stub.

Running with umis for targeted and without for wgts.

36c96fc

Fix includes in targeted.nf.

c8cec75

Only run markdups with UMIs when tso500 panel is selected.

b45b43a

Create switch between bwa mem and bwa mem2 for debugging purposes.

2479229

Add TODO.

00bf6ee

Move new params into nextflow.config.

3b83265

Add label to markdups process.

00cfe50

scwatts and others added 6 commits March 1, 2024 08:44

Use explicit returns in .branch ops

bd45719

Do not index RNA BAMs prior to merge

8d06484

Remove obsolete TODOs

c7e87c2

Fix Isofox singularity container URL

1653daf

Bump TSO500 data bundle version

f618283

* restrict target regions to canonical Ensembl transcripts

Remove -force_pathogenic_pass in PAVE somatic

2bd379c

scwatts added this to the Release 1.0.0 milestone Mar 12, 2024

scwatts added 16 commits March 16, 2024 17:24

Correct prepare reference panel data path lookup

f76f79b

Merge branch 'dev' into alignment-subworkflow

07117c2

Fix optional channel placeholders

9b012d0

Update modules.json

b90c572

Adjust indenting

77b8bce

Use standard container directive format for STAR

bd6b304

Add missing imports and subworkflow descriptions

c7f1774

Use Bioconda/BioContainers for bwa-mem2 module

177bf69

Improve naming for bwa-mem2 output BAMs

9d61690

Use BAM index created during alignment

066c8d2

Improve BAM index selection

0abc35f

Include BAI in bwa-mem2/align stub

eb95b4f

Adjust input selection logic

4039507

Bump MarkDups to 1.1.5

e8f6c99

Remove Sambamba index module file

ed8e1d1

Add new meta.yaml

e0da20a

scwatts marked this pull request as ready for review April 23, 2024 22:49

scwatts requested a review from charlesshale April 23, 2024 22:49

charlesshale approved these changes Apr 24, 2024

View reviewed changes

scwatts merged commit 101e987 into dev Apr 24, 2024
4 checks passed

scwatts deleted the alignment-subworkflow branch April 24, 2024 01:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement alignment subworkflow #6

Implement alignment subworkflow #6

scwatts commented Mar 12, 2024

github-actions bot commented Mar 12, 2024 •

edited

Loading

❗ Test warnings:

❔ Tests ignored:

✅ Tests passed:

Run details

scwatts commented Apr 23, 2024

charlesshale left a comment

Implement alignment subworkflow #6

Implement alignment subworkflow #6

Conversation

scwatts commented Mar 12, 2024

github-actions bot commented Mar 12, 2024 • edited Loading

nf-core lint overall result: Passed ✅ ⚠️

❗ Test warnings:

❔ Tests ignored:

✅ Tests passed:

Run details

scwatts commented Apr 23, 2024

charlesshale left a comment

Choose a reason for hiding this comment

github-actions bot commented Mar 12, 2024 •

edited

Loading

`nf-core lint` overall result: Passed ✅ ⚠️