Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What fraction of cells per celltype is recommended to use as subsample_size? #26

Open
kaizen89 opened this issue Jun 7, 2023 · 2 comments

Comments

@kaizen89
Copy link

kaizen89 commented Jun 7, 2023

Hi!
Thanks a lot for this very useful tool.
I succesfully ran Augur on a large dataset ~70K cells using default parameters, I want to increase the number of celles in subsample_size to 1000-2000 cells to better capture the heterogeneity. I was wandering if you can give any recommendation about the fraction of cells per celltype and per condition that you think would be reasonnable to use.
Thank you!

@skinnider
Copy link
Collaborator

I essentially never change this parameter from its default values, but doing so should not impact your results very much. In Supplementary Fig. 6 of the Augur paper, we show that prioritizations are quite robust to the value of subsample_size - the main risk is removing cell types with too few cells from consideration.

@kaizen89
Copy link
Author

kaizen89 commented Jun 7, 2023

I guess you are referring to Supp Fig7 (not 6), I saw this part but unfortunately it seems that the dataset tested is a bit small and it's difficult ton conclude whit subsample_size 50-100 .
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants