-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
chimeric_alignments error during infer_breakpoint_graph #7
Comments
@Tina04021997
|
Hi Team CoRAL, I have encountered almost similar issues as well while running reconstruct mode. I am using Nanopore reads aligned with NGMLR (which is in contrast to minimap2 mentioned in the article). The log file (error message attached) shows some key error corresponding to parsing the CIGAR string in bam files. `Performing reconstruction with options: Traceback (most recent call last): `[root:INFO] Python version 3.7.12 | packaged by conda-forge | (default, Oct 26 2021, 06:08:21) [root:INFO] #TIME 0.0046 Commandline: /data/cephfs-1/work/projects/ecdnapanc/CoRAL/src/CoRAL.py reconstruct --lr_bam /data/cephfs-1/work/projects/ecdnapanc/carcinoma_celllines/Panc1_20230711_sorted.bam --cnv_seed /data/cephfs-1/work/projects/ecdnapanc/CoRAL/outputs/seed_mode/Panc1_20230711_sorted_CNV_SEEDS.bed --output_prefix /data/cephfs-1/work/projects/ecdnapanc/CoRAL/outputs/reconstruct_mode/Panc1_20230711_sorted --cn_seg /data/cephfs-1/work/projects/ecdnapanc/carcinoma_celllines/cnvkit/calls/Panc1_20230711_sorted.call.cns Could you please guide how to solve this issue? |
Hi @Jenny-Tamboli, I think it could only be used with winnowmap. We aligned our data with winnowmap and printed different variables of the script infer_breakpoint_graph.py. With winnowmap, the read's name is printed (column 1) but not with pbmm2, maybe because of how winnowmap defines the CIGAR string. I have also tried this program by providing the seed and bed files and it also produces errors. |
Hi @MJoseMo, Thank you very much for the swift response. I will try using winnowmap. But if I understood it correctly, despite using winnowmap, you still got errors. Is it? |
Hi all, Thanks so much for using CoRAL and posting this issue! Though in our study we used minimap2 for all alignments, there's no reason that alternative aligners will not work (such as winnowmap or NGMLR). I think that one issue could be the MarkDuplicates step - if you run CoRAL without this last filter (i.e., just alignment & sorting), does this fix the issue? If I understand your workflow correctly, you did not do any PCR amplifications I'm not sure marking/removing duplicates is necessary for this analysis. Apologies if I misunderstood something. Best, |
Hi, |
Hi, I got similar issue when using NGMLR (0.2.7) aligner. However with minimap2 (2.28-r1209) it worked. This is I think because for the |
Hi everyone, We have just released a testing version for CoRAL that supports both PacBio data and bams produced by aligners that make alternately formatted CIGARs (e.g. NGMLR). We would be greatful if you would be able to give it a try with the data you encountered that has issues. Please let me know if this gives errors for you and if so which aligner & long read technology you are using.
Thank you! |
Thank you all for developing CoRAL. I have a question regarding the error message during infer_breakpoint_graph step and wonder if there are any suggestions you could kindly have:
Error message:
Traceback (most recent call last):
File "/tscc/nfs/home/tiy002/software/CoRAL/src/infer_breakpoint_graph.py", line 1637, in
b2bn.find_amplicon_intervals()
File "/tscc/nfs/home/tiy002/software/CoRAL/src/infer_breakpoint_graph.py", line 219, in find_amplicon_intervals
self.find_interval_i(ai, ccid)
File "/tscc/nfs/home/tiy002/software/CoRAL/src/infer_breakpoint_graph.py", line 354, in find_interval_i
if i in self.chimeric_alignments_seg[chr]:
KeyError: 'chr1'
The last few lines of the log file:
[root:DEBUG] #TIME 332.1212 Updated amplicon interval ['chr12', 56855263, 57175262, -1]
[root:DEBUG] #TIME 332.1212 Updated amplicon interval ['chr12', 57555264, 57950263, -1]
[root:DEBUG] #TIME 332.1212 Updated amplicon interval ['chr12', 68545270, 69760270, -1]
[root:DEBUG] #TIME 332.1213 Updated amplicon interval ['chr12', 110075295, 110390294, -1]
[root:DEBUG] #TIME 332.1213 Begin processing amplicon interval 0
[root:DEBUG] #TIME 332.1213 Amplicon interval ['chr1', 766225, 1086326, -1]
[root:DEBUG] #TIME 332.1213 Start BFS on amplicon interval 0.
[root:DEBUG] #TIME 332.1213 BFS queue: [0]
[root:DEBUG] #TIME 332.1213 Next amplicon interval 0: ['chr1', 766225, 1086326, -1].
[root:DEBUG] #TIME 332.1214 Reset connected component ID to 0
CoRAL Command:
python3 /tscc/nfs/home/tiy002/software/CoRAL/src/infer_breakpoint_graph.py --lr_bam $PATH/TR14.mkdup.bam --seed $PATH/TR14_CNV_SEEDS.bed --output_prefix $PATH/TR14 --cnseg $PATH/TR14.mkdup.cns --log_fn TR14_trace.log
CNVkit command:
cnvkit.py batch -m wgs -r /tscc/nfs/home/tiy002/data_repo/GRCh38/GRCh38_cnvkit_filtered_ref.cnn -p 16 -d $PATH/cnvkit $PATH/TR14.mkdup.bam
Additional info:
a) BAM to FASTQ
b) FASTQ alignment to BAM (PacBio CCS mode)
c) Sort and index and perform mark duplicate
The text was updated successfully, but these errors were encountered: