You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have been looking through the source code and I would like some clarifications.
I am unable to determine whether or not the software makes any assumptions about sample names in the VCF? It doesn't appear to really care about the GT columns in the VCF, but I wanted to confirm this. I am mostly concerned that the sample ID in my bam files aren't the same as the ones in my VCF (just TUMOR/NORMAL in my vcf).
In your docs it looks like the order of the disease and normal bam is different than what the command-line help prompt states for the extract observations part.
Edit:
One more question. Does prosic consider soft-clipped bases?
Thanks!
The text was updated successfully, but these errors were encountered:
Hi,
thanks for your interest in prosic! Regarding your questions:
indeed, the whole sample columns in the vcf are not considered. With extract observations, the first bam is expected to be the healthy sample, the second the cancer sample. extract-observations then extracts evidence from the bams for each entry in the vcf, but it only considers the position, ref, alt and info columns. The calling then recalculates the posteriors for the variants.
Indeed, that was a mistake. Thanks for pointing out!
I should note that we are currently rewriting prosic into a much faster and more powerful and solid version (see here). It should be ready in at most 1 or 2 months.
I have been looking through the source code and I would like some clarifications.
Edit:
One more question. Does prosic consider soft-clipped bases?
Thanks!
The text was updated successfully, but these errors were encountered: