-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
"Number of mapped reads" from log file #144
Comments
For the "number of mapped reads", I believe it is only from those barcode-valid (barcode in the whitelist or corrected barcode) reads. Q1: If you need the number for unfilterer mapped reads, "number of mapped reads" is the place to look at. What number do you have in mind? |
Thanks for your responses! Sorry, but I'm not sure if I fully understand what you mean. What do you mean by 'unfiltered' mapped reads? I prefer to know the number of mapped reads regardless of whether the reads have a valid cell barcode or not. I am trying to figure out why I started with 153,220,788 reads, but ended up with only 30,174,748, lol. The reason I feel confused is that for the same sample, I also did a bulk mapping with Bowtie2. As you can see below, the mapping rate is okay, with an 86.75% overall alignment rate and a 56% unique mapping rate (Bowtie2 counts paired-end fragments once, so it's half the number compared to Chromap, but they are mapped with the same FASTQ files). However, for Chromap, even before deduplication, the ratio is 37,926,511/153,220,788 = 24.7% So, I want to know at which step I am losing reads. If Hopefully, I have explained my questions clearly, and thank you very much for your help in advance. bulk mapping summary using
|
The read with invalid barcode will not be mapped, so the mapped read count won't include them. |
Hello Chromap Team,
Thank you very much for actively maintaining the chromap!
I recently used Chromap for mapping scATAC-seq data with a barcode whitelist. I found that the log file is a bit confusing. As stated in the documentation, when barcodes and a whitelist are given as input, Chromap will, by default, estimate barcode abundance and perform barcode correction.
I am looking to understand the following QC numbers from the log file:
In relation to these questions:
For Q1, should I refer to the "Number of mapped reads" in the log file?
For Q2, what does "Number of barcodes in whitelist" represent? Does it indicate the number of barcodes, or the number of reads with the whitelisted barcodes?
These metrics are very useful for my experimental debugging, and I would greatly appreciate your clarification.
The text was updated successfully, but these errors were encountered: