Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Repetitive or low-quality barcode sequences in scATAC data #161

Open
jeremymsimon opened this issue Apr 24, 2024 · 1 comment
Open

Repetitive or low-quality barcode sequences in scATAC data #161

jeremymsimon opened this issue Apr 24, 2024 · 1 comment
Labels
bug Something isn't working

Comments

@jeremymsimon
Copy link

Hi @haowenz,
I tend to not force use of the 10X barcode include-list since it is possible there could be valuable information/real cells, but I noticed that some scATACseq data processed via chromap has identified some "cells" that manage to pass all QC, and escape doublet discrimination (!), but whose barcode sequence is something like GGGGGGGGGGGGTGGG or a similar highly-G-rich/low-complexity sequence. This isn't a bug per se of chromap, and can obviously be fixed by forcing the identified cells to be contained within the include-list, however I do wonder whether barcodes like this could perhaps be flagged if they have exceedingly low entropy? There were about 200 such "cells" out of ~70,000 in the dataset I'm currently working with, so it's rare but it was frequent enough such that these cells formed their own cluster in my data. Curious to hear your thoughts! Thanks!

@jeremymsimon jeremymsimon added the bug Something isn't working label Apr 24, 2024
@mourisl
Copy link
Collaborator

mourisl commented Apr 24, 2024

I think this issue can be better handled during downstream analysis. As you said, these barcodes can be easily identified, but the filtering cutoff (like entropy) might be tunable. It would be more efficient to find the appropriate cutoffs in the dataframe rather than re-running Chromap.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants