-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bam_stats fails to give informative error on BAMs with no RG lines/tags #66
Comments
Hi,
|
Please run the command I provided to generate the file and attach it to the issue (drag'n drop), the long comment has been deleted. |
Github cannot support the BAM file as an attachment, is there another way I can share the file? |
I have reviewed your file (via email). Your BAM file is invalid using PHREAD+64 encoding for the quality scores. Note all your reads have a run of
You can find more information on phread encoding here: |
Correspondence with new BAM
|
This appears to be related to a lack of readgroup header (and aux tag). The code is failing when attempting to handle the absence of the sample information.
Lines 108 to 110 in 79f22b6
@davidrajones can you take a look please? Bam attached with artificial @leovam you can work around this by defining a read group header and tagging the records with the ID. Please note that the cgpmap process is just a CWL wrapper around this repository (pcap-core). The methodology you use for mapping can be achieved using |
Thanks for the efforts to provide such information. I have a general question. Is I use BWA-MEM directly to do the alignment instead of using "by defining a read group header and tagging the records with the ID" means that we can actually work with the current BAM file and add |
The You can pass a RG entry into many mapping tools, but you can also apply it on a stream using
Each lane of sequencing should be processed independently and then merged after the RG+sorting has been applied P.S. I noticed in your example file that you have the core reference sequences and the targeted genes declared in the header. This is not something I would expect to see (certainly in short-read) and may result in your reads having very low mapping scores as they may be considered non-unique mappings. |
Hi,
I have been trying to use cgpwgs to call the variants on my own BAM files. I saw it said the BAM files are expected to have been mapped using cgpmap. We did not use the cgpmap to get the BAM files but got them with the common routine: 1) align the pair-end fastq files to hg38 using BWA MEM to get the SAM files, 2) convert SAM to BAM and 3) sort and index the BAM files with SAMTOOLS.
then we used bam_stats in the dockstore-cgpwgs to get the bas file for the sorted BAM files. the script was like:
however, the program finsied immediately after hittign "enter" and no bas files produced at all and there is no error message so we did not know what went wrong. We also test the script on the example BAM files and it will give us the corresponding bas files in about 1 minute.
We wonder:
Thanks!
The text was updated successfully, but these errors were encountered: