Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

change input bam names of RunLOHHLA/QcQualimap/CollectHsMetrics #892

Open
anoronh4 opened this issue Apr 26, 2021 · 2 comments
Open

change input bam names of RunLOHHLA/QcQualimap/CollectHsMetrics #892

anoronh4 opened this issue Apr 26, 2021 · 2 comments
Assignees
Labels
backburner probably won't address in a near future bug Something isn't working

Comments

@anoronh4
Copy link
Collaborator

anoronh4 commented Apr 26, 2021

Recently we ran into a situation where input bams were not generated by Tempo and the ID was s_C_000184_T002_d but the basename of the bam was s_C_000184_T002_d___bqsr.bam, causing failures or recognition of an incorrect sample name (s_C_000184_T002_d___bqsr). We had failures in RunLOHHLA. Multiqc processes showed incorrect sample names because it would parse the path of the bam from the input picard and qualimap files.

LOHHLA does not have an option to enforce the ids we want. instead we can change the following line https://github.com/mskcc/tempo/blob/master/pipeline.nf#L1691 to:

set idNormal, target, idTumor, file("${idTumor}.bam"), file("${idTumor}.bam.bai"), file("${idNormal}.bam"), file("${idNormal}.bam.bai"), file(purityOut), placeHolder, file(winnersHla) from mergedChannelLOHHLA

multiqc uses regex to find and extract from the bam input of the hs_metrics file:
https://github.com/ewels/MultiQC/blob/master/multiqc/modules/picard/HsMetrics.py#L78-L82
same thing with qualimap:
https://github.com/ewels/MultiQC/blob/master/multiqc/modules/qualimap/QM_BamQC.py#L109

i think it makes sense to run each of the upstream modules with corrected bam names. an alternative would be to generate a clean temporary version of each input file at the multiqc process, which would work but not be as simple to implement.

@anoronh4 anoronh4 self-assigned this May 10, 2021
@anoronh4 anoronh4 added the bug Something isn't working label May 10, 2021
@anoronh4 anoronh4 added the backburner probably won't address in a near future label Sep 8, 2021
@gongyixiao
Copy link
Collaborator

I don't think changing the lines as you indicated will fix the problem. My idea will be have a rename command before running lohhla script in RunLOHHLA process to rename the bam file from whatever they are to ${idTumor}.bam etc. Happy to discuss.

@gongyixiao
Copy link
Collaborator

This might be a solution? #773

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backburner probably won't address in a near future bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants