Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

No genotype information after variant calling using both single and multiple bam files. #10

Open
alaminzju opened this issue Mar 24, 2023 · 7 comments

Comments

@alaminzju
Copy link

Dear Giovanni,
Thank you for your nice software. I am using scAllele for the detection and analysis of variants using scRNA-seq. I wonder how you get the information about 0/1 or 1/1 in Figure 2E of your paper. I checked your deposited results in Zenodo (https://doi.org/10.5281/ zenodo.6558451), where I found no genotype information. Moreover, I have also analyzed my data and have yet to get genotype information. I have followed all QC processes and scAllele protocols to analyze my data. I am stuck at this point. Please kindly give your explanation about it.

Advance thanks for your response.

Best Regards,
Alamin
Email: [email protected]

@giovanniquinones
Copy link
Contributor

The genotype information was obtained from the reference vcf file (the one we are comparing the scAllele output to) which is the GIAB HG001 vcf.

@alaminzju
Copy link
Author

Can I use the GIAB HG001 vcf file for my analysis? If yes, please share the location of the vcf file.

@jmfa
Copy link

jmfa commented Jun 30, 2023

Hi Giovanni,
I have a follow-up question regarding this matter. Could you please provide any recommendations on how to perform genotyping of single cells once variants have been identified? (or does this fall outside the scope of scAllele?)

@giovanniquinones
Copy link
Contributor

scAllele takes a very conservative approach of not genotyping because allele-specific expression/bias can occur in the RNA-seq data resulting in wrong genotype calls. Currently, genotyping is out of the scope of scAllele but, since the allelic read counts and ratio values are given in the file, you could estimate the 'observed' genotypes.

@jmfa
Copy link

jmfa commented Jun 30, 2023

Thanks for the swift reply, @giovanniquinones.
Right, so perhaps one idea would be to only rely on variable sites (i.e., AC!=0) - since 0/0 genotypes are trickier to infer given the allele-expression bias.

@MohamedSuhail27
Copy link

MohamedSuhail27 commented May 9, 2024

Hello,
I tried using the github test data (https://github.com/gxiaolab/scAllele/tree/main) to perform the variant calling using scAllele v0.0.9.4. But I didn't get the vcf header in my output vcf file.

I used the following command for variant calling:
scAllele -b test_data/gm12878.chr21.bam -g test_data/hg38.chr21.fa -o scAllele_variant

Below, I have attached the output vcf for your reference.
scAllele_variant.zip

Could you able to help me in this.

Thanks.

@ttt-404
Copy link

ttt-404 commented Nov 4, 2024

Hello, I tried using the github test data (https://github.com/gxiaolab/scAllele/tree/main) to perform the variant calling using scAllele v0.0.9.4. But I didn't get the vcf header in my output vcf file.

I used the following command for variant calling: scAllele -b test_data/gm12878.chr21.bam -g test_data/hg38.chr21.fa -o scAllele_variant

Below, I have attached the output vcf for your reference. scAllele_variant.zip

Could you able to help me in this.

Thanks.

I met the same mistake, wondering how to sovle it

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants